Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Topic Analysis and Classification of EGU Conference Abstracts

View through CrossRef
The corpus of Abstracts from the EGU General Assemblies 2000 - 2023 covers a wide range of Earth, planetary and space sciences topics, each with multiple subtopics. The abstracts are all in English, fairly uniform in length, cover one broad subject area, and are licenced under a permissive licence that allows further processing (CC BY 4.0), making this a high-quality text corpus for studies using natural language processing (NLP) and for the finetuning of Large Language Models (LLM). Our study makes use of openly available NLP software libraries and LLMs.In the first phase of this study, we were interested in finding out how well abstracts map to the topics covered by EGU Divisions and whether co-organisation of sessions contributes to or dilutes topics. The abstracts are available only in unstructured formats such as Portable Document Format (PDF) or plain text in XML extracts from the conference database. They are identified by abstract numbers but carry no information on the session or division where they were originally presented. We reconstructed this information from the online conference programme.To be able to employ a supervised learning approach of matching abstracts to topics, we defined the topics to be synonymous with the 23 scientific divisions of the EGU, using the division and co-listed divisions as topic labels.We finetuned the Bidirectional Encoder Representations from Transformers (BERT) and the slightly simplified DistillBERT language models for our topic modelling exercise. We also compared the machine classifications against a random association of abstracts and topics. Preliminary results obtained from our experiments show that using a machine learning model performs well in classifying the conference abstracts (accuracy = 0.66). The accuracy varies between divisions (0.40 for NP to 0.96 for G) and improves when taking co-organisation between divisions into account. Starting from one year of abstracts (EGU 2015), we plan to expand our analysis to cover all abstracts from all EGU General Assemblies (EGU 2000 - 2024).
Title: Topic Analysis and Classification of EGU Conference Abstracts
Description:
The corpus of Abstracts from the EGU General Assemblies 2000 - 2023 covers a wide range of Earth, planetary and space sciences topics, each with multiple subtopics.
The abstracts are all in English, fairly uniform in length, cover one broad subject area, and are licenced under a permissive licence that allows further processing (CC BY 4.
0), making this a high-quality text corpus for studies using natural language processing (NLP) and for the finetuning of Large Language Models (LLM).
Our study makes use of openly available NLP software libraries and LLMs.
In the first phase of this study, we were interested in finding out how well abstracts map to the topics covered by EGU Divisions and whether co-organisation of sessions contributes to or dilutes topics.
The abstracts are available only in unstructured formats such as Portable Document Format (PDF) or plain text in XML extracts from the conference database.
They are identified by abstract numbers but carry no information on the session or division where they were originally presented.
We reconstructed this information from the online conference programme.
To be able to employ a supervised learning approach of matching abstracts to topics, we defined the topics to be synonymous with the 23 scientific divisions of the EGU, using the division and co-listed divisions as topic labels.
We finetuned the Bidirectional Encoder Representations from Transformers (BERT) and the slightly simplified DistillBERT language models for our topic modelling exercise.
We also compared the machine classifications against a random association of abstracts and topics.
Preliminary results obtained from our experiments show that using a machine learning model performs well in classifying the conference abstracts (accuracy = 0.
66).
The accuracy varies between divisions (0.
40 for NP to 0.
96 for G) and improves when taking co-organisation between divisions into account.
Starting from one year of abstracts (EGU 2015), we plan to expand our analysis to cover all abstracts from all EGU General Assemblies (EGU 2000 - 2024).

Related Results

Status and development of the demographics of EGU General Assemblies’ presenters and EGU awardees
Status and development of the demographics of EGU General Assemblies’ presenters and EGU awardees
The EGU recognises the importance of equality, diversity, and inclusion (EDI)
 as a crucial foundation for scientific research. Since it’s foundation in 2021, the EGU EDI Committee...
Equality of opportunities in EGU recognitions: The EGU Awards Committee experience
Equality of opportunities in EGU recognitions: The EGU Awards Committee experience
The European Geosciences Union (EGU) is the leading organisation supporting Earth, planetary and space science research in Europe, upholding and promoting the highest standards of ...
Equality of opportunities in EGU recognitions: The EGU Awards Committee experience
Equality of opportunities in EGU recognitions: The EGU Awards Committee experience
The European Geosciences Union (EGU) is the leading organisation supporting Earth, planetary and space science research in Europe, upholding and promoting the highest standards of ...
DAMPAK TEKNOLOGI TERHADAP PROSES BELAJAR MENGAJAR
DAMPAK TEKNOLOGI TERHADAP PROSES BELAJAR MENGAJAR
DAFTAR PUSTAKAAditama, M. H. R., & Selfiardy, S. (2022). Kehidupan Mahasiswa Kuliah Sambil Bekerja di Masa Pandemi Covid-19. Kidspedia: Jurnal Pendidikan Anak Usia Dini, 3(...
Equality of opportunities in geosciences: The EGU Awards Committee experience
Equality of opportunities in geosciences: The EGU Awards Committee experience
<p>EGU, the European Geosciences Union, is Europe’s premier geosciences union, dedicated to the pursuit of excellence in the Earth, planetary, and space...
Equality of opportunities in geosciences: The EGU Awards Committee experience
Equality of opportunities in geosciences: The EGU Awards Committee experience
EGU, the European Geosciences Union, is Europe’s premier geosciences union, dedicated to the pursuit of excellence in the Earth, planetary, and space sciences for the ben...
Equality of opportunities in geosciences: The EGU Awards Committee experience
Equality of opportunities in geosciences: The EGU Awards Committee experience
<p>EGU, the European Geosciences Union, is Europe’s premier geosciences union, dedicated to the pursuit of excellence in the Earth, planetary, and space...
Equality of opportunities in geosciences: The EGU Awards Committee experience
Equality of opportunities in geosciences: The EGU Awards Committee experience
EGU, the European Geosciences Union, is Europe’s premier geosciences union, dedicated to the pursuit of excellence in the Earth, planetary, and space sciences for the ben...

Back to Top