Javascript must be enabled to continue!
Theme
View through CrossRef
This chapter demonstrates how big data and computation can be used to identify and track recurrent themes as the products of external influence. It first considers the limitations of the Google Ngram Viewer as a tool for tracing thematic trends over time before turning to Douglas Biber's Corpus Linguistics: Investigating Language Structure and Use, a primer on various factors complicating word-focused text analysis and the subsequent conclusions one might draw regarding word meanings. It then discusses the results of the author's application of latent Dirichlet allocation (LDA) to a corpus of 3,346 nineteenth-century novels using the open-source MALLET (MAchine Learning for LanguagE Toolkit), a software package for topic modeling. It also explains the different types of analyses performed by the author, including text segmentation, word chunking, and author nationality, gender and time-themes relationship analyses. The thematic data from the LDA model reveal the degree to which author nationality, author gender, and date of publication could be predicted by the thematic signals expressed in the nineteenth-century novels corpus.
Title: Theme
Description:
This chapter demonstrates how big data and computation can be used to identify and track recurrent themes as the products of external influence.
It first considers the limitations of the Google Ngram Viewer as a tool for tracing thematic trends over time before turning to Douglas Biber's Corpus Linguistics: Investigating Language Structure and Use, a primer on various factors complicating word-focused text analysis and the subsequent conclusions one might draw regarding word meanings.
It then discusses the results of the author's application of latent Dirichlet allocation (LDA) to a corpus of 3,346 nineteenth-century novels using the open-source MALLET (MAchine Learning for LanguagE Toolkit), a software package for topic modeling.
It also explains the different types of analyses performed by the author, including text segmentation, word chunking, and author nationality, gender and time-themes relationship analyses.
The thematic data from the LDA model reveal the degree to which author nationality, author gender, and date of publication could be predicted by the thematic signals expressed in the nineteenth-century novels corpus.
Related Results
Twenty-First International Conference on New Directions in the Humanities. Conference Proceedings
Twenty-First International Conference on New Directions in the Humanities. Conference Proceedings
Proceedings of the Twenty-first International Conference on New Directions inthe Humanities, hosted by the Sorbonne Université, Paris, France, 28-30 June 2023. The conference featu...
Seventh International Conference on Communication & Media Studies. Conference Proceedings
Seventh International Conference on Communication & Media Studies. Conference Proceedings
Proceedings of the Seventh International Conference on Communication & Media Studies, hosted by the NUI Galway, Ireland, 25-26 August 2022. The conference featured research add...
Aging & Social Change: Thirteenth Interdisciplinary Conference Proceedings
Aging & Social Change: Thirteenth Interdisciplinary Conference Proceedings
Proceedings of the Aging & Social Change: Thirteenth Interdisciplinary Conference, hosted by the Polytechnic University of Marche, Ancona, Italy, 13-15 September 2023. The conf...
Fourteenth International Conference on The Constructed Environment Conference Proceedings
Fourteenth International Conference on The Constructed Environment Conference Proceedings
Proceedings of the Fourteenth International Conference on The Constructed Environment, hosted by the Universität Wien, Vienna, Austria , 5-6 April 2024. The conference featured res...
Eighteenth International Conference on the Arts in Society. Conference Proceedings
Eighteenth International Conference on the Arts in Society. Conference Proceedings
Proceedings of the Eighteenth International Conference on the Arts in Society, hosted by the Jagiellonian University, Kraków, Poland , 5-7 July 2023. The conference featured resear...
Conference on Religion & Spirituality in Society. Conference Proceedings
Conference on Religion & Spirituality in Society. Conference Proceedings
Proceedings of the Thirteenth International Conference on Religion & Spirituality in Society, hosted by the National and Kapodistrian University of Athens, Athens, Greece, 20-2...
Information, Medium & Society: Thirtieth International Conference on Publishing Studies Conference Proceedings
Information, Medium & Society: Thirtieth International Conference on Publishing Studies Conference Proceedings
Proceedings of the Information, Medium & Society: Twenty-first International Conference on Publishing Studies hosted by the Sorbonne Université, Paris, France, 30 June 2023. Th...
Thirteenth International Conference on The Image Conference Proceedings
Thirteenth International Conference on The Image Conference Proceedings
Proceedings of the Fourteenth International Conference on The Image, hosted by the University of San Jorge, Zaragoza, Spain, 15-16 November 2023. The conference featured research a...

