Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A Comparative Study of Some Selected Classifiers on an Imbalanced Dataset for Sentiment Analysis

View through CrossRef
Extracting subjective data from online user generated text documents is made quite easy with the use of sentiment analysis. For a classification task different individual algorithms are applied to a review dataset in which most classifiers produce accurate results while others produce limited and inaccurate predictions. This research is to evaluate various machine learning algorithms for online dataset classification, where same set of data will be used to test four different machine learning algorithms: Naive Bayes, Support Vector machine, K-nearest neighbor and Decision tree. In order to determine which machine learning model will perform best in sentiment analysis as a constant issue. In this research, our primary goal is to identify the most effective machine learning model for sentiment analysis of English texts among the aforementioned classifiers. Their robustness will be tested and classified with an imbalanced dataset Kaggle.com a Machine learning repository. The dataset will first undergo data preprocessing in order to enable analysis, and then feature extraction for the base classifiers performance and accuracy which will be carried out in Jupyter notebook from Anaconda. Each machine learning algorithm performance scores will be calculated for higher accuracy using confusion matrix, F1-score, precision and recall respectively.
Title: A Comparative Study of Some Selected Classifiers on an Imbalanced Dataset for Sentiment Analysis
Description:
Extracting subjective data from online user generated text documents is made quite easy with the use of sentiment analysis.
For a classification task different individual algorithms are applied to a review dataset in which most classifiers produce accurate results while others produce limited and inaccurate predictions.
This research is to evaluate various machine learning algorithms for online dataset classification, where same set of data will be used to test four different machine learning algorithms: Naive Bayes, Support Vector machine, K-nearest neighbor and Decision tree.
In order to determine which machine learning model will perform best in sentiment analysis as a constant issue.
In this research, our primary goal is to identify the most effective machine learning model for sentiment analysis of English texts among the aforementioned classifiers.
Their robustness will be tested and classified with an imbalanced dataset Kaggle.
com a Machine learning repository.
The dataset will first undergo data preprocessing in order to enable analysis, and then feature extraction for the base classifiers performance and accuracy which will be carried out in Jupyter notebook from Anaconda.
Each machine learning algorithm performance scores will be calculated for higher accuracy using confusion matrix, F1-score, precision and recall respectively.

Related Results

Primerjalna književnost na prelomu tisočletja
Primerjalna književnost na prelomu tisočletja
In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...
Sentiment Analysis with Python: A Hands-on Approach
Sentiment Analysis with Python: A Hands-on Approach
Sentiment Analysis is a rapidly growing field in Natural Language Processing (NLP) that aims to extract opinions, emotions, and attitudes expressed in text. It has a wide range o...
Selective Ensemble Learning Algorithm for Imbalanced Dataset
Selective Ensemble Learning Algorithm for Imbalanced Dataset
Abstract Under the imbalanced dataset, the performance of the base-classifier, the computingmethod of weight of base-classifier and the selection method of the base-classif...
Weak tagging and imbalanced networks for online review sentiment classification
Weak tagging and imbalanced networks for online review sentiment classification
Sentiment classification aims to complete the automatic judgment task of text sentiment tendency. In the sentiment classification task of online reviews, traditional deep learning ...
Machine Learning and Semantic Orientation Ensemble Methods for Egyptian Telecom Tweets Sentiment Analysis
Machine Learning and Semantic Orientation Ensemble Methods for Egyptian Telecom Tweets Sentiment Analysis
The vast amount of data currently available online attracted many parties to analyze sentiments expressed in these data extracting valuable knowledge. Many approaches have been pro...
Text Sentiment Analysis with Event Information
Text Sentiment Analysis with Event Information
Abstract In recent years, the research of text sentiment analysis has recognized the importance of associating events with sentiment polarity, that is, the sentiment trigge...
Forex Sentiment Analysis with Python
Forex Sentiment Analysis with Python
The most important catalysts for forex market movements are news, economic data, and also market sentiment. Market sentiment refers to the overall attitude of traders toward a part...
Imbalanced sentiment classification of online reviews based on SimBERT
Imbalanced sentiment classification of online reviews based on SimBERT
The purpose of sentiment classification is to accomplish automatic judssssgment of the sentiment tendency of text. In the sentiment classification task of online reviews, tradition...

Back to Top