Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

The Technological Bridge: R Programming’s Utility in Converting Social Media Data for Quantitative Financial Analysis

View through CrossRef
Abstract Research purpose. This study explores whether R programming can transform unstructured qualitative social media data into a quantitative format suitable for econometric modelling. It specifically examines how elements such as text, emojis, and sentiment from Reddit and X (formerly Twitter) can be converted into variables for regression analysis. With the aim to enhance the predictive power of traditional financial models using alternative data sources, the paper outlines comprehensive guidelines with specific technical steps, from scripting an API to extracting data from Reddit and X, through cleaning and tokenising to incorporating the data into regression models using R programming. The study addresses the growing need in financial economics to incorporate alternative data streams by offering a structured, replicable process for transforming high-volume, unstructured online content into statistically valid variables, thereby bridging the gap between qualitative market sentiment and quantitative modelling. Design / Methodology / Approach. Focusing on the methodology and R scripts, this research adopts a quantitative approach, transforming qualitative social media data into a format suitable for multiple linear and instrumental variable regression models to assess the effect of social media signals on asset prices, with GameStop (GME) and Best Buy (BBY) as case studies. The process ensures reproducibility and includes open-source code, enhancing transparency and applicability for both academic and professional financial data analysis contexts. Findings. The findings demonstrate that qualitative social media data can be quantified for financial analysis. It was effectively extracted, cleaned, and used for regression analysis. Results show that traditional market indicators fail to explain GME’s price shifts, while the frequency of rocket emojis (interpreted as speculative sentiment) was statistically significant. BBY’s returns, however, aligned more closely with market and industry indices, suggesting a lower influence of private sentiment. Originality / Value / Practical implications. The research provides a replicable method for integrating social media data into econometric models, contributing new tools for analysing market sentiment. By adapting classical financial models to modern data sources, the paper opens new directions for asset pricing research. The paper provides technical tools created in R for use in econometric analysis, useful both for academics and practitioners.
Title: The Technological Bridge: R Programming’s Utility in Converting Social Media Data for Quantitative Financial Analysis
Description:
Abstract Research purpose.
This study explores whether R programming can transform unstructured qualitative social media data into a quantitative format suitable for econometric modelling.
It specifically examines how elements such as text, emojis, and sentiment from Reddit and X (formerly Twitter) can be converted into variables for regression analysis.
With the aim to enhance the predictive power of traditional financial models using alternative data sources, the paper outlines comprehensive guidelines with specific technical steps, from scripting an API to extracting data from Reddit and X, through cleaning and tokenising to incorporating the data into regression models using R programming.
The study addresses the growing need in financial economics to incorporate alternative data streams by offering a structured, replicable process for transforming high-volume, unstructured online content into statistically valid variables, thereby bridging the gap between qualitative market sentiment and quantitative modelling.
Design / Methodology / Approach.
Focusing on the methodology and R scripts, this research adopts a quantitative approach, transforming qualitative social media data into a format suitable for multiple linear and instrumental variable regression models to assess the effect of social media signals on asset prices, with GameStop (GME) and Best Buy (BBY) as case studies.
The process ensures reproducibility and includes open-source code, enhancing transparency and applicability for both academic and professional financial data analysis contexts.
Findings.
The findings demonstrate that qualitative social media data can be quantified for financial analysis.
It was effectively extracted, cleaned, and used for regression analysis.
Results show that traditional market indicators fail to explain GME’s price shifts, while the frequency of rocket emojis (interpreted as speculative sentiment) was statistically significant.
BBY’s returns, however, aligned more closely with market and industry indices, suggesting a lower influence of private sentiment.
Originality / Value / Practical implications.
The research provides a replicable method for integrating social media data into econometric models, contributing new tools for analysing market sentiment.
By adapting classical financial models to modern data sources, the paper opens new directions for asset pricing research.
The paper provides technical tools created in R for use in econometric analysis, useful both for academics and practitioners.

Related Results

Financial Advisory LLM Model for Modernizing Financial Services and Innovative Solutions for Financial Literacy in India
Financial Advisory LLM Model for Modernizing Financial Services and Innovative Solutions for Financial Literacy in India
Abstract Dynamically evolving financial conditions in India place sophisticated models of financial advisory services relative to its own peculiar conditions more in demand...
LITERASI KEUANGAN SERTA PENGGUNAAN PRODUK DAN JASA LEMBAGA KEUANGAN
LITERASI KEUANGAN SERTA PENGGUNAAN PRODUK DAN JASA LEMBAGA KEUANGAN
This study entitled: "Financial Literacy and Utility Products and Services Financial Institutions". The purpose of this research are: 1) Knowing and analyzing the financial literac...
Interventions designed to improve financial capability: A systematic review
Interventions designed to improve financial capability: A systematic review
AbstractBackgroundThere is growing recognition that people need stronger financial capability to avoid and recover from financial difficulties and poverty. Researchers are testing ...
ECONOMIC ESSENCE OF THE FINANCIAL STABILITY OF THE BANKING SYSTEM
ECONOMIC ESSENCE OF THE FINANCIAL STABILITY OF THE BANKING SYSTEM
Introduction. The article examines the essence of financial stability and stability of the banking system in order to analyze and understand them. The main approaches to interpreti...
Decoding Millennial Financial Behavior: Factors Shaping Financial Management Nexus
Decoding Millennial Financial Behavior: Factors Shaping Financial Management Nexus
This study investigates the influence of Financial Literacy, Financial Knowledge, Financial Attitude, Locus of Control, and Income on Financial Management Behavior among millennial...
Numerical Simulation of Barge Impact on a Continuous Girder Bridge and Bridge Damage Detection
Numerical Simulation of Barge Impact on a Continuous Girder Bridge and Bridge Damage Detection
Vessel collisions on bridge piers have been frequently reported. As many bridges are vital in transportation networks and serve as lifelines, bridge damage might leads to catastrop...
Financial Strain and Health
Financial Strain and Health
One of the most fundamental results in health economics is that a greater socio-economic status is associated with better health outcomes. However, the experience of financial pres...

Back to Top