Javascript must be enabled to continue!
Hadoop-based Online Shopping Behavior Analysis: Design and Implementation
View through CrossRef
This research conducts big data analysis based on open-source Taobao user behavior data. Leveraging the Hadoop big data analytics platform, we performed multi-dimensional user behavior analysis on the publicly available Alibaba Tianchi dataset to provide actionable insights for e-commerce sales decisions.
The study utilizes open-source Taobao user behavior data, where each row represents an individual user action. The dataset was first uploaded to Hadoop's HDFS storage. Subsequently, we configured Hadoop's Flume component to automate data ingestion, loading the data into a Hive database for comprehensive analysis. Key e-commerce metrics—including PV (Page Views), UV (Unique Visitors), bounce rate, and repurchase rate—were statistically analyzed. A multi-dimensional perspective was applied to examine user behavior patterns and activity levels across time dimensions. Additionally, we conducted statistical analyses on top-selling item IDs, popular product categories, and user geographic distribution.
The resulting analytical tables were stored in Hive. Using Sqoop, these result tables were automatically exported to a relational MySQL database for efficient storage and analytical presentation.
For visualization, Python's PyEcharts library was employed to create front-end interactive displays. By querying datasets from MySQL, we generated multi-dimensional visualizations to enhance data interpretability. Finally, PyEcharts' `Page` method facilitated the design of an interactive dashboard, while static HTML deployment enabled a dynamic large-screen visualization interface. These visually rich presentations empower decision-makers to rapidly derive strategic insights.
Title: Hadoop-based Online Shopping Behavior Analysis: Design and Implementation
Description:
This research conducts big data analysis based on open-source Taobao user behavior data.
Leveraging the Hadoop big data analytics platform, we performed multi-dimensional user behavior analysis on the publicly available Alibaba Tianchi dataset to provide actionable insights for e-commerce sales decisions.
The study utilizes open-source Taobao user behavior data, where each row represents an individual user action.
The dataset was first uploaded to Hadoop's HDFS storage.
Subsequently, we configured Hadoop's Flume component to automate data ingestion, loading the data into a Hive database for comprehensive analysis.
Key e-commerce metrics—including PV (Page Views), UV (Unique Visitors), bounce rate, and repurchase rate—were statistically analyzed.
A multi-dimensional perspective was applied to examine user behavior patterns and activity levels across time dimensions.
Additionally, we conducted statistical analyses on top-selling item IDs, popular product categories, and user geographic distribution.
The resulting analytical tables were stored in Hive.
Using Sqoop, these result tables were automatically exported to a relational MySQL database for efficient storage and analytical presentation.
For visualization, Python's PyEcharts library was employed to create front-end interactive displays.
By querying datasets from MySQL, we generated multi-dimensional visualizations to enhance data interpretability.
Finally, PyEcharts' `Page` method facilitated the design of an interactive dashboard, while static HTML deployment enabled a dynamic large-screen visualization interface.
These visually rich presentations empower decision-makers to rapidly derive strategic insights.
Related Results
Pengaruh Hedonic Shopping Value dan Shopping Lifestyle terhadap Impulsive Buying pada Konsumen Shopee
Pengaruh Hedonic Shopping Value dan Shopping Lifestyle terhadap Impulsive Buying pada Konsumen Shopee
Abstract. The development of increasingly advanced information technology has made the internet not only a medium of communication, but also a shopping center for consumers online....
ADDRESSING THE FIRE SAFETY PROBLEMS IN SHOPPING CENTRES OF DHAKA CITY: A CASE STUDY ON SOME SELECTED SHOPPING CENTRES IN MIRPUR AREA
ADDRESSING THE FIRE SAFETY PROBLEMS IN SHOPPING CENTRES OF DHAKA CITY: A CASE STUDY ON SOME SELECTED SHOPPING CENTRES IN MIRPUR AREA
The city of dhaka has been seeing a significant number of fire incidents. The exacerbation of the crisis is mostly attributed to a combination of institutional incompetence, insuff...
Factors Influencing Online Shopping behavior of Online Fashion Retailers
Factors Influencing Online Shopping behavior of Online Fashion Retailers
The COVID-19 pandemic has changed online shopping behaviors, according to a survey of about 3,700 consumers in nine emerging and developed economies. As people embraced social dist...
Hadoop Tools
Hadoop Tools
As the name indicates, this chapter explains the various additional tools provided by Hadoop. The additional tools provided by Hadoop distribution are Hadoop Streaming, Hadoop Arch...
Factors affecting online shopping behavior in Bangladesh: A demographic perspective
Factors affecting online shopping behavior in Bangladesh: A demographic perspective
Prior studies in Bangladesh examined how several non-demographic factors influenced consumers' online buying behavior. However, no specific research has been conducted to examine a...
Determining shopping mall visitors’ perceptions on mall attributes
Determining shopping mall visitors’ perceptions on mall attributes
The challenging retail environment requires a need to manage shopping malls effectively to understand the attributes that attract shopping mall visitors to visit shopping malls. Th...
Secure Cloud Data with Attribute-based Honey Encryption
Secure Cloud Data with Attribute-based Honey Encryption
Abstract
Encryption is a Technique to convert plain text into Cipher text, which is unreadable without an appropriate decryption key. Hadoop is a platform to process and st...

