Javascript must be enabled to continue!
Study of the Yahoo-Yahoo Hash-Tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
View through CrossRef
Mining opinion on social media microblogs presents opportunities to extract meaningful insight from the public from trending issues like the “yahoo-yahoo” which in Nigeria, is synonymous to cybercrime. In this study, content analysis of selected historical tweets from “yahoo-yahoo” hash-tag was conducted for sentiment and topic modelling. A corpus of 5500 tweets was obtained and pre-processed using a pre-trained tweet tokenizer while Valence Aware Dictionary for Sentiment Reasoning (VADER), Liu Hu method, Latent Dirichlet Allocation (LDA), Latent Semantic Indexing (LSI) and Multidimensional Scaling (MDS) graphs were used for sentiment analysis, topic modelling and topic visualization. Results showed the corpus had 173 unique tweet clusters, 5327 duplicates tweets and a frequency of 9555 for “yahoo”. Further validation using the mean sentiment scores of ten volunteers returned R and R2 of 0.8038 and 0.6402; 0.5994 and 0.3463; 0.5999 and 0.3586 for Human and VADER; Human and Liu Hu; Liu Hu and VADER sentiment scores, respectively. While VADER outperforms Liu Hu in sentiment analysis, LDA and LSI returned similar results in the topic modelling. The study confirms VADER’s performance on unstructured social media data containing non-English slangs, conjunctions, emoticons, etc. and proved that emojis are more representative of sentiments in tweets than the texts.
Title: Study of the Yahoo-Yahoo Hash-Tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
Description:
Mining opinion on social media microblogs presents opportunities to extract meaningful insight from the public from trending issues like the “yahoo-yahoo” which in Nigeria, is synonymous to cybercrime.
In this study, content analysis of selected historical tweets from “yahoo-yahoo” hash-tag was conducted for sentiment and topic modelling.
A corpus of 5500 tweets was obtained and pre-processed using a pre-trained tweet tokenizer while Valence Aware Dictionary for Sentiment Reasoning (VADER), Liu Hu method, Latent Dirichlet Allocation (LDA), Latent Semantic Indexing (LSI) and Multidimensional Scaling (MDS) graphs were used for sentiment analysis, topic modelling and topic visualization.
Results showed the corpus had 173 unique tweet clusters, 5327 duplicates tweets and a frequency of 9555 for “yahoo”.
Further validation using the mean sentiment scores of ten volunteers returned R and R2 of 0.
8038 and 0.
6402; 0.
5994 and 0.
3463; 0.
5999 and 0.
3586 for Human and VADER; Human and Liu Hu; Liu Hu and VADER sentiment scores, respectively.
While VADER outperforms Liu Hu in sentiment analysis, LDA and LSI returned similar results in the topic modelling.
The study confirms VADER’s performance on unstructured social media data containing non-English slangs, conjunctions, emoticons, etc.
and proved that emojis are more representative of sentiments in tweets than the texts.
Related Results
Study of the Yahoo-yahoo Hash-tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
Study of the Yahoo-yahoo Hash-tag Tweets Using Sentiment Analysis and Opinion Mining Algorithms
Abstract
BackgroundSocial media opinion has become a medium to quickly access large, valuable, and rich details of information on any subject matter within a short period. ...
Sentiment Analysis of Tweets on Soda Taxes
Sentiment Analysis of Tweets on Soda Taxes
Context:
As a primary source of added sugars, sugar-sweetened beverage (SSB) consumption may contribute to the obesity epidemic. A soda tax is an excise tax cha...
Sentiment/tone (Automated Content Analysis)
Sentiment/tone (Automated Content Analysis)
Sentiment/tone describes the way issues or specific actors are described in coverage. Many analyses differentiate between negative, neutral/balanced or positive sentiment/tone as b...
Using Social Media to Predict Food Deserts in the United States: Infodemiology Study of Tweets
Using Social Media to Predict Food Deserts in the United States: Infodemiology Study of Tweets
Background
The issue of food insecurity is becoming increasingly important to public health practitioners because of the adverse health outcomes and underlying ...
Using Social Media to Predict Food Deserts in the United States: Infodemiology Study of Tweets (Preprint)
Using Social Media to Predict Food Deserts in the United States: Infodemiology Study of Tweets (Preprint)
BACKGROUND
The issue of food insecurity is becoming increasingly important to public health practitioners because of the adverse health outcomes and underly...
Feasible Sentiment Analysis of Real Time Twitter Data
Feasible Sentiment Analysis of Real Time Twitter Data
Sentiment analysis plays a significant role in understanding public opinion, trends, and sentiments expressed on social media platforms. In this paper, we focus on performing senti...
Does X Mark the Spot? Investigating discussions about cancer screening programs on X/Twitter through corpus analysis (Preprint)
Does X Mark the Spot? Investigating discussions about cancer screening programs on X/Twitter through corpus analysis (Preprint)
BACKGROUND
While cancer screening is proven to be effective in the early detection of the disease and early detection enables better treatment options, screening ...
Evaluation of Medical Confidentiality Breaches on Twitter Among Anesthesiology and Intensive Care Health Care Workers
Evaluation of Medical Confidentiality Breaches on Twitter Among Anesthesiology and Intensive Care Health Care Workers
BACKGROUND:
With the generalization of social network use by health care workers, we observe the emergence of breaches in medical confidentiality. Our objective was to ...

