Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Tweet-SCAN: An event discovery technique for geo-located tweets

View through CrossRef
Twitter has become one of the most popular Location-Based Social Networks (LBSNs) that enables bridging physical and virtual worlds. Tweets, 140-character-long messages published in Twitter, are aimed to provide basic responses to the What's happening? question. Occurrences and events in the real life are usually reported through geo-located tweets by users on site. Uncovering event-related tweets from the rest is a challenging problem that necessarily requires exploiting different tweet features. With that in mind, we propose Tweet-SCAN, a novel event discovery technique based on the density-based clustering algorithm called DB-SCAN. Tweet-SCAN takes into account four main features from a tweet, namely content, time, location and user to cluster homogeneously event-related tweets. This new technique models textual content through a probabilistic topic model called Hierarchical Dirichlet Process and introduces Jensen-Shannon distance for the task of neighborhood identification in the textual dimension. As a matter of fact, we show Tweet-SCAN performance in a real data set of geo-located tweets posted during Barcelona local festivities in 2014, for which some of the events were known beforehand. By means of this data set, we are able to assess Tweet-SCAN capabilities to discover events, justify using a textual component and highlight the effects of several parameters.
Title: Tweet-SCAN: An event discovery technique for geo-located tweets
Description:
Twitter has become one of the most popular Location-Based Social Networks (LBSNs) that enables bridging physical and virtual worlds.
Tweets, 140-character-long messages published in Twitter, are aimed to provide basic responses to the What's happening? question.
Occurrences and events in the real life are usually reported through geo-located tweets by users on site.
Uncovering event-related tweets from the rest is a challenging problem that necessarily requires exploiting different tweet features.
With that in mind, we propose Tweet-SCAN, a novel event discovery technique based on the density-based clustering algorithm called DB-SCAN.
Tweet-SCAN takes into account four main features from a tweet, namely content, time, location and user to cluster homogeneously event-related tweets.
This new technique models textual content through a probabilistic topic model called Hierarchical Dirichlet Process and introduces Jensen-Shannon distance for the task of neighborhood identification in the textual dimension.
As a matter of fact, we show Tweet-SCAN performance in a real data set of geo-located tweets posted during Barcelona local festivities in 2014, for which some of the events were known beforehand.
By means of this data set, we are able to assess Tweet-SCAN capabilities to discover events, justify using a textual component and highlight the effects of several parameters.

Related Results

Sentiment Analysis of Tweets on Soda Taxes
Sentiment Analysis of Tweets on Soda Taxes
Context: As a primary source of added sugars, sugar-sweetened beverage (SSB) consumption may contribute to the obesity epidemic. A soda tax is an excise tax charged on ...
Evaluation of Medical Confidentiality Breaches on Twitter Among Anesthesiology and Intensive Care Health Care Workers
Evaluation of Medical Confidentiality Breaches on Twitter Among Anesthesiology and Intensive Care Health Care Workers
BACKGROUND: With the generalization of social network use by health care workers, we observe the emergence of breaches in medical confidentiality. Our objective was to ...
Sentiment Analysis of Russia-Ukraine Conflict Tweets Using RoBERTa
Sentiment Analysis of Russia-Ukraine Conflict Tweets Using RoBERTa
[Objective] The moment Russia officially invaded Ukraine, the world experienced a period of tension and uncertainty. As a social release valve digital communication, channels incre...
#Menopause: The Menopause Ontology Project
#Menopause: The Menopause Ontology Project
ABSTRACT Introduction Medical professionals and patients increasingly utilize social media to connect and share healthcare infor...
Support Analysis of Weighted Discussions in Twitter
Support Analysis of Weighted Discussions in Twitter
The analysis of opinions on social networks has recently received a considerable attention on many application fields. Although there exist many specialized and generalist social n...
Public engagement of scientists (Science Communication)
Public engagement of scientists (Science Communication)
Public engagement of scientists is defined as “all kinds of publicly accessible communication carried out by people presenting themselves as scientists. This includes scholarly com...
Geological and geomorphological objects of the Ukrainian Carpathians’ Beskid Mountains and their tourist attractiveness
Geological and geomorphological objects of the Ukrainian Carpathians’ Beskid Mountains and their tourist attractiveness
The article explores the geological and geomorphological objects of the Beskidy Ukrainian Carpathians for the further creation of geo-tourist routes. Geo-tourist areas combining se...
Temporal and Thematic Analysis of Promotional Waterpipe-Related Posts on Twitter/X in the US
Temporal and Thematic Analysis of Promotional Waterpipe-Related Posts on Twitter/X in the US
AbstractIntroductionWaterpipe tobacco smoking (WTS), also known as hookah, shisha, or narghile, is particularly popular among young people in the United States (US). WTS poses seri...

Back to Top