Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Who is Tweeting? A Scoping Review of Methods to Establish Race and Ethnicity from Twitter Datasets

View through CrossRef
Background: A growing amount of health research uses social media data. Those critical of social media research often cite that it may be unrepresentative of the population. Identifying the demographics of social media users enables us to measure the representativeness. Extracting race or ethnicity from social media data can be difficult and researchers may choose from a multitude of different approaches. Methods: We present a scoping review to identify the methods used to extract race or ethnicity from Twitter datasets. We searched 16 electronic databases and carried out reference checking in order to identify relevant articles. Sifting of each record was undertaken independently by at least two researchers with any disagreement discussed. The research could be grouped by the methods applied to extract race or ethnicity.Results: From 1093 records we identified 56 that met our inclusion criteria. The majority focus on Twitter users based in the US. A range of types of data were used including Twitter profile -pictures, bios, and/or location, and the content in the tweets themselves. The methods used were wide ranging and included using manual inference, linkage to census data, commercial software, language/dialect recognition and machine learning. Not all studies evaluated their methods. Those that did found accuracy to vary from 45% to 93% with significantly lower accuracy identifying non-white race categories. There may be some ethical questions over some of the methods used, particularly using photos or dialect, as well as questions surrounding accuracy.Conclusion: There is no standard approach or guidelines for extracting race or ethnicity from Twitter or other social media. Social media researchers must use careful interpretation of race or ethnicity and not over-promise what can be achieved, as even manual screening is a subjective, imperfect method. Future research should establish the accuracy of methods to inform evidence-based best practice guidelines for social media researchers, and be guided by concerns of equity and social justice.
Title: Who is Tweeting? A Scoping Review of Methods to Establish Race and Ethnicity from Twitter Datasets
Description:
Background: A growing amount of health research uses social media data.
Those critical of social media research often cite that it may be unrepresentative of the population.
Identifying the demographics of social media users enables us to measure the representativeness.
Extracting race or ethnicity from social media data can be difficult and researchers may choose from a multitude of different approaches.
Methods: We present a scoping review to identify the methods used to extract race or ethnicity from Twitter datasets.
We searched 16 electronic databases and carried out reference checking in order to identify relevant articles.
Sifting of each record was undertaken independently by at least two researchers with any disagreement discussed.
The research could be grouped by the methods applied to extract race or ethnicity.
Results: From 1093 records we identified 56 that met our inclusion criteria.
The majority focus on Twitter users based in the US.
A range of types of data were used including Twitter profile -pictures, bios, and/or location, and the content in the tweets themselves.
The methods used were wide ranging and included using manual inference, linkage to census data, commercial software, language/dialect recognition and machine learning.
Not all studies evaluated their methods.
Those that did found accuracy to vary from 45% to 93% with significantly lower accuracy identifying non-white race categories.
There may be some ethical questions over some of the methods used, particularly using photos or dialect, as well as questions surrounding accuracy.
Conclusion: There is no standard approach or guidelines for extracting race or ethnicity from Twitter or other social media.
Social media researchers must use careful interpretation of race or ethnicity and not over-promise what can be achieved, as even manual screening is a subjective, imperfect method.
Future research should establish the accuracy of methods to inform evidence-based best practice guidelines for social media researchers, and be guided by concerns of equity and social justice.

Related Results

Methods to Establish Race or Ethnicity of Twitter Users: Scoping Review (Preprint)
Methods to Establish Race or Ethnicity of Twitter Users: Scoping Review (Preprint)
BACKGROUND A growing amount of health research uses social media data. Those critical of social media research often cite that it may be unrepresentative of...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Mindy Calling: Size, Beauty, Race in The Mindy Project
Mindy Calling: Size, Beauty, Race in The Mindy Project
When characters in the Fox Television sitcom The Mindy Project call Mindy Lahiri fat, Mindy sees it as a case of misidentification. She reminds the character that she is a “petite ...
Well-being focused interventions for caregivers of children with developmental disabilities-a scoping review protocol
Well-being focused interventions for caregivers of children with developmental disabilities-a scoping review protocol
AbstractIntroductionChildren with developmental disabilities (DD) have complex health needs which imply that they will need assistance in many areas of their lives, a role usually ...
Race, Ethnicity, and War
Race, Ethnicity, and War
Numerous forms of violence and armed conflict in human history have been pursued and justified by deploying the concepts of ethnic and racial difference. Race and ethnicity are soc...
Osteopathic medical students’ understanding of race-based medicine
Osteopathic medical students’ understanding of race-based medicine
Abstract Context Race is a social construct, not a biological or genetic construct, utilized to categorize people based on obser...
Remaja dan Literasi Media Sosial
Remaja dan Literasi Media Sosial
Abstract. This study aims to describe teenager’s media literacy in Pekan baru through the 'Twitter Please Do Your Magic' movement on Twitter. Media literacy is a person's ability t...
A scoping review on the methodological and reporting quality of scoping reviews in China
A scoping review on the methodological and reporting quality of scoping reviews in China
Abstract Background Scoping reviews have emerged as a valuable method for synthesizing emerging evidence, offering a comprehensive contextual overview, and influencing pol...

Back to Top