Javascript must be enabled to continue!
Examining LDA2Vec and Tweet Pooling for Topic Modeling on Twitter Data
View through CrossRef
The short lengths of tweets present a challenge for topic modeling to extend beyond what is provided explicitly from hashtag information. This is particularly true for LDAbased methods because the amount of information available from pertweet statistical analysis is severely limited. In this paper we present LDA2Vec paired with temporal tweet pooling (LDA2VecTTP) and assess its performance on this problem relative to traditional LDA and to Biterm Topic Model (Biterm), which was developed specifically for topic modeling on short text documents. We paired each of the three topic modeling algorithms with three tweet pooling schemes: no pooling, authorbased pooling, and temporal pooling. We then conducted topic modeling on two Twitter datasets using each of the algorithms and the tweet pooling schemes. Our results on the largest dataset suggest that LDA2VecTTP can produce higher coherence scores and more logically coherent and interpretable topics.
World Scientific and Engineering Academy and Society (WSEAS)
Title: Examining LDA2Vec and Tweet Pooling for Topic Modeling on Twitter Data
Description:
The short lengths of tweets present a challenge for topic modeling to extend beyond what is provided explicitly from hashtag information.
This is particularly true for LDAbased methods because the amount of information available from pertweet statistical analysis is severely limited.
In this paper we present LDA2Vec paired with temporal tweet pooling (LDA2VecTTP) and assess its performance on this problem relative to traditional LDA and to Biterm Topic Model (Biterm), which was developed specifically for topic modeling on short text documents.
We paired each of the three topic modeling algorithms with three tweet pooling schemes: no pooling, authorbased pooling, and temporal pooling.
We then conducted topic modeling on two Twitter datasets using each of the algorithms and the tweet pooling schemes.
Our results on the largest dataset suggest that LDA2VecTTP can produce higher coherence scores and more logically coherent and interpretable topics.
Related Results
Pooling Operations in Deep Learning: From “Invariable” to “Variable”
Pooling Operations in Deep Learning: From “Invariable” to “Variable”
Deep learning has become a research hotspot in multimedia, especially in the field of image processing. Pooling operation is an important operation in deep learning. Pooling operat...
Public engagement of scientists (Science Communication)
Public engagement of scientists (Science Communication)
Public engagement of scientists is defined as “all kinds of publicly accessible communication carried out by people presenting themselves as scientists. This includes scholarly com...
FPGA implementation of AAD pooling unit and performance analysis
FPGA implementation of AAD pooling unit and performance analysis
Convolutional Neural Network (CNN) has been witnessing a massive growth for its various applications in different fields. It is a category of Neural Network or Deep learning that i...
Tweet-SCAN: An event discovery technique for geo-located tweets
Tweet-SCAN: An event discovery technique for geo-located tweets
Twitter has become one of the most popular Location-Based Social Networks (LBSNs) that enables bridging physical and virtual worlds. Tweets, 140-character-long messages published i...
Twitter User Rank Using Keyword Search
Twitter User Rank Using Keyword Search
Twitter has attracted attention recently as a new way of collecting, providing and sharing information on the Internet. However, it is difficult for us to find Twitter users to fol...
Support Analysis of Weighted Discussions in Twitter
Support Analysis of Weighted Discussions in Twitter
The analysis of opinions on social networks has recently received a considerable attention on many application fields. Although there exist many specialized and generalist social n...
Remaja dan Literasi Media Sosial
Remaja dan Literasi Media Sosial
Abstract. This study aims to describe teenager’s media literacy in Pekan baru through the 'Twitter Please Do Your Magic' movement on Twitter. Media literacy is a person's ability t...
Exploring the topical structure of short text through probability models : from tasks to fundamentals
Exploring the topical structure of short text through probability models : from tasks to fundamentals
Recent technological advances have radically changed the way we communicate. Today’s
communication has become ubiquitous and it has fostered the need for information that is easie...


