Javascript must be enabled to continue!

Sentiment Analysis of Finnish Twitter Discussions on COVID-19 During the Pandemic

AbstractWith the outbreak of the COVID-19 pandemic, researchers have studied how people reacted on social media during the pandemic. Sentiment analysis has been leveraged to gain insight. However, much of the research conducted on both sentiment analysis and social media analysis of COVID-19 often focuses on widespread languages, such as English and Chinese. This is partly due to the scarcity of resources for natural language processing and sentiment analysis for morphologically complex and less prevalent languages such as Finnish. This paper aims to analyze sentiments on Twitter in the Finnish language during the COVID-19 pandemic. We manually annotate with sentiments a random sample of 1943 tweets about COVID-19 in Finnish. We use it to build binomial and multinomial logistic regression models with Lasso penalty by exploiting ngrams and two existing sentiment lexicons. We also build two similar models using an existing (pre-COVID-19) Twitter dataset for comparison. The best-performing model for the Finnish language is then used to determine the trends of positive, negative, and neutral opinions on a collection of tweets in Finnish extracted between April 21 and June 18, 2020. The best sentiment polarity prediction model for the Finnish language attain 0.785 AUC, 0.710 balanced accuracy, and 0.723 macro-averaged F1 for predicting positive and negative polarity (binomial classification), and 0.667 AUC, 0.607 balanced accuracy, and 0.475 F1 when adding neutral tweets (multinomial classification). On the other hand, the pre-COVID-19 model trained on the same number of tweets exhibits higher accuracy for the multinomial model (0.687 balanced accuracy, and 0.588 F1). We hypothesize that this loss of performance is due to the COVID-19 context that makes sentiment analysis of neutral tweets more difficult for the machine learning algorithm to predict. Running the model on all the extracted Finnish tweets, we observe a decrease in negativity and an increase in positivity over the observed time as the Finnish government lifts restrictions. Our results show that applying an existing general-purpose sentiment analyzer on tweets that are domain-specific, such as COVID-19, provides lower accuracy. More effort in the future needs to be invested in using and developing sentiment analysis tools tailored to their application domain when conducting large-scale social media analysis of specific medical issues, such as a global pandemic.

Springer Science and Business Media LLC

Maëlick Claes Umar Farooq Iflaah Salman Anna Teern Minna Isomursu Raija Halonen

SN Computer Science

2024

Title: Sentiment Analysis of Finnish Twitter Discussions on COVID-19 During the Pandemic

Description:

AbstractWith the outbreak of the COVID-19 pandemic, researchers have studied how people reacted on social media during the pandemic.

Sentiment analysis has been leveraged to gain insight.

However, much of the research conducted on both sentiment analysis and social media analysis of COVID-19 often focuses on widespread languages, such as English and Chinese.

This is partly due to the scarcity of resources for natural language processing and sentiment analysis for morphologically complex and less prevalent languages such as Finnish.

This paper aims to analyze sentiments on Twitter in the Finnish language during the COVID-19 pandemic.

We manually annotate with sentiments a random sample of 1943 tweets about COVID-19 in Finnish.

We use it to build binomial and multinomial logistic regression models with Lasso penalty by exploiting ngrams and two existing sentiment lexicons.

We also build two similar models using an existing (pre-COVID-19) Twitter dataset for comparison.

The best-performing model for the Finnish language is then used to determine the trends of positive, negative, and neutral opinions on a collection of tweets in Finnish extracted between April 21 and June 18, 2020.

The best sentiment polarity prediction model for the Finnish language attain 0.

785 AUC, 0.

710 balanced accuracy, and 0.

723 macro-averaged F1 for predicting positive and negative polarity (binomial classification), and 0.

667 AUC, 0.

607 balanced accuracy, and 0.

475 F1 when adding neutral tweets (multinomial classification).

On the other hand, the pre-COVID-19 model trained on the same number of tweets exhibits higher accuracy for the multinomial model (0.

687 balanced accuracy, and 0.

588 F1).

We hypothesize that this loss of performance is due to the COVID-19 context that makes sentiment analysis of neutral tweets more difficult for the machine learning algorithm to predict.

Running the model on all the extracted Finnish tweets, we observe a decrease in negativity and an increase in positivity over the observed time as the Finnish government lifts restrictions.

Our results show that applying an existing general-purpose sentiment analyzer on tweets that are domain-specific, such as COVID-19, provides lower accuracy.

More effort in the future needs to be invested in using and developing sentiment analysis tools tailored to their application domain when conducting large-scale social media analysis of specific medical issues, such as a global pandemic.

Back

Related Results

KECEMASAN SAAT PANDEMI COVID 19: LITERATUR REVIEW Hardiyati, Efri Widianti, Taty Hernawaty Departemen Keperawatan Jiwa Poltekkes Kemenkes Mamuju Sulbar, Universitas Pad...

Faith Tweets: Ambient Religious Communication and Microblogging Rituals

There’s no reason to think that Jesus wouldn’t have Facebooked or twittered if he came into the world now. Can you imagine his killer status updates? Reverend Schenck, New York, Al...

Burden of the Beast

Introduction Throughout the COVID-19 pandemic, and its fluctuating waves of infections and the emergence of new variants, Indigenous populations in Australia and worldwide have re...

Alts and Automediality: Compartmentalising the Self through Multiple Social Media Profiles

IntroductionAlt, or alternative, accounts are secondary profiles people use in addition to a main account on a social media platform. They are a kind of automediation, a way of rep...

A Twitter Sentimen Analysis on Islamic Banking Using Drone Emprit Academic (DEA): Evidence from Indonesia

ABSTRACT The research aimed to identify and collect issues discussed regarding Islamic banking from user activity, sentimen, and content on Twitter. This study used a qualitative a...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

The Impact of the Covid-19 Pandemic and Macroeconomics on the Sharia Stock Indexes in Indonesia

ABSTRACT The Covid-19 pandemic has changed economic conditions in various countries, including Indonesia. One of the sectors affected is the capital market sector which can also de...

Analysis Of Sentiment On Twitter Social Media On Public Perception Of Dana Fintech Services In Indonesia

Abstract The rapid growth of financial technology (fintech) services in Indonesia has significantly transformed digital transaction behavior, with digital wallets such as DANA beco...

Email:
Password:

Email: