Javascript must be enabled to continue!
Examining LDA2Vec and Tweet Pooling for Topic Modeling on Twitter Data
View through CrossRef
The short lengths of tweets present a challenge for topic modeling to extend beyond what is provided explicitly from hashtag information. This is particularly true for LDAbased methods because the amount of information available from pertweet statistical analysis is severely limited. In this paper we present LDA2Vec paired with temporal tweet pooling (LDA2VecTTP) and assess its performance on this problem relative to traditional LDA and to Biterm Topic Model (Biterm), which was developed specifically for topic modeling on short text documents. We paired each of the three topic modeling algorithms with three tweet pooling schemes: no pooling, authorbased pooling, and temporal pooling. We then conducted topic modeling on two Twitter datasets using each of the algorithms and the tweet pooling schemes. Our results on the largest dataset suggest that LDA2VecTTP can produce higher coherence scores and more logically coherent and interpretable topics.
World Scientific and Engineering Academy and Society (WSEAS)
Title: Examining LDA2Vec and Tweet Pooling for Topic Modeling on Twitter Data
Description:
The short lengths of tweets present a challenge for topic modeling to extend beyond what is provided explicitly from hashtag information.
This is particularly true for LDAbased methods because the amount of information available from pertweet statistical analysis is severely limited.
In this paper we present LDA2Vec paired with temporal tweet pooling (LDA2VecTTP) and assess its performance on this problem relative to traditional LDA and to Biterm Topic Model (Biterm), which was developed specifically for topic modeling on short text documents.
We paired each of the three topic modeling algorithms with three tweet pooling schemes: no pooling, authorbased pooling, and temporal pooling.
We then conducted topic modeling on two Twitter datasets using each of the algorithms and the tweet pooling schemes.
Our results on the largest dataset suggest that LDA2VecTTP can produce higher coherence scores and more logically coherent and interpretable topics.
Related Results
Faith Tweets: Ambient Religious Communication and Microblogging Rituals
Faith Tweets: Ambient Religious Communication and Microblogging Rituals
There’s no reason to think that Jesus wouldn’t have Facebooked or twittered if he came into the world now. Can you imagine his killer status updates? Reverend Schenck, New York, Al...
Alts and Automediality: Compartmentalising the Self through Multiple Social Media Profiles
Alts and Automediality: Compartmentalising the Self through Multiple Social Media Profiles
IntroductionAlt, or alternative, accounts are secondary profiles people use in addition to a main account on a social media platform. They are a kind of automediation, a way of rep...
Pooling Operations in Deep Learning: From “Invariable” to “Variable”
Pooling Operations in Deep Learning: From “Invariable” to “Variable”
Deep learning has become a research hotspot in multimedia, especially in the field of image processing. Pooling operation is an important operation in deep learning. Pooling operat...
A Twitter Sentimen Analysis on Islamic Banking Using Drone Emprit Academic (DEA): Evidence from Indonesia
A Twitter Sentimen Analysis on Islamic Banking Using Drone Emprit Academic (DEA): Evidence from Indonesia
ABSTRACT
The research aimed to identify and collect issues discussed regarding Islamic banking from user activity, sentimen, and content on Twitter. This study used a qualitative a...
MENGELOMPOKKAN TOPIK TWEET AKUN POLITISI DENGAN PEMODELAN TOPIK METODE AUTHOR-TOPIC MODELS
MENGELOMPOKKAN TOPIK TWEET AKUN POLITISI DENGAN PEMODELAN TOPIK METODE AUTHOR-TOPIC MODELS
Twitter merupakan sebuah platform untuk berhubungan satu sama lain agar tetap berkomunikasi dan tetap terhubung dengan mengirim pesan cepat dan daring. Pada Twitter juga dapat memb...
Analisis Dampak Kabut Asap dari Kebakaran Hutan dan Lahan dengan Pendekatan Text Mining
Analisis Dampak Kabut Asap dari Kebakaran Hutan dan Lahan dengan Pendekatan Text Mining
Kebakaran hutan dan lahan (karhutla) berdampak buruk bagi lingkungan serta ekosistem. Kabut asap merupakan salah satu akibat yang ditimbulkan dari kebakaran hutan dan lahan. Keresa...
Analisis Dampak Kabut Asap dari Kebakaran Hutan dan Lahan dengan Pendekatan Text Mining
Analisis Dampak Kabut Asap dari Kebakaran Hutan dan Lahan dengan Pendekatan Text Mining
Kebakaran hutan dan lahan (karhutla) berdampak buruk bagi lingkungan serta ekosistem. Kabut asap merupakan salah satu akibat yang ditimbulkan dari kebakaran hutan dan lahan. Keresa...
Public engagement of scientists (Science Communication)
Public engagement of scientists (Science Communication)
Public engagement of scientists is defined as “all kinds of publicly accessible communication carried out by people presenting themselves as scientists. This includes scholarly com...

