Javascript must be enabled to continue!

A Comparative Study of Machine Learning, Natural Language Processing, and Hybrid Models for Academic Paper Acceptance Prediction

The exponential increase in submissions to top-tier conferences and journals has placed unprecedented strain on editorial systems. To address this challenge, the present study explores the potential of computational modelling for predicting paper acceptance decisions based on peer review content as textual input as well as confidence score and recommendation score as numerical input in the models. We utilised the PeerConf dataset by Hasan, et al. which contains 3,242 reviews across 1,236 papers. In the study we design and evaluate three modelling approaches, including traditional ML models, transformer-based and sentiment-integrated NLP models (BERT, DistilBERT), and a novel hybrid model incorporating structured features, textual inputs and sentiment within ML pipelines. We have used accuracy and F1 scores to capture and compare the predictive effectiveness of the models. Python 3.10 environment and scikit-learn library were used for machine learning models, and Hugging Face Transformers v4.x was used for transformer-based models. The study contributes to the understanding of how hybrid models compare with ML and NLP-based models and provide a viable solution to predict the paper acceptance decisions. All models were trained in a GPU-enabled environment using PyTorch and Scikit-learn. The study also suggests the viability of different approaches for designing editorial support systems. We found that hybrid models outperformed ML and sentiment-integrated NLP models with 83.51 % accuracy and an F1 score of 72.91 %.

Defence Scientific Information and Documentation Centre

Chandra Shekhar Pandey Shriram Pandey Tejash Pandey Shweta Pandey Harish Pandey Patanjali Mishra

DESIDOC Journal of Library & Information Technology

2025

Title: A Comparative Study of Machine Learning, Natural Language Processing, and Hybrid Models for Academic Paper Acceptance Prediction

Description:

The exponential increase in submissions to top-tier conferences and journals has placed unprecedented strain on editorial systems.

To address this challenge, the present study explores the potential of computational modelling for predicting paper acceptance decisions based on peer review content as textual input as well as confidence score and recommendation score as numerical input in the models.

We utilised the PeerConf dataset by Hasan, et al.

which contains 3,242 reviews across 1,236 papers.

In the study we design and evaluate three modelling approaches, including traditional ML models, transformer-based and sentiment-integrated NLP models (BERT, DistilBERT), and a novel hybrid model incorporating structured features, textual inputs and sentiment within ML pipelines.

We have used accuracy and F1 scores to capture and compare the predictive effectiveness of the models.

Python 3.

10 environment and scikit-learn library were used for machine learning models, and Hugging Face Transformers v4.

x was used for transformer-based models.

The study contributes to the understanding of how hybrid models compare with ML and NLP-based models and provide a viable solution to predict the paper acceptance decisions.

All models were trained in a GPU-enabled environment using PyTorch and Scikit-learn.

The study also suggests the viability of different approaches for designing editorial support systems.

We found that hybrid models outperformed ML and sentiment-integrated NLP models with 83.

51 % accuracy and an F1 score of 72.

91 %.

Back

<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

Primerjalna književnost na prelomu tisočletja

In a comprehensive and at times critical manner, this volume seeks to shed light on the development of events in Western (i.e., European and North American) comparative literature ...

Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga

The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...

Increased life expectancy of heart failure patients in a rural center by a multidisciplinary program

Abstract Funding Acknowledgements Type of funding sources: None. INTRODUCTION Patients with heart failure (HF)...

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

Abstract The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...

Investigating the Psychological Impact of Corrective Feedback on ESL Students’ Language Anxiety

This study investigates the psychological impact of corrective feedback on English as a Second Language (ESL) students' language anxiety using a quantitative research approach. Con...

Email:
Password:

Email:

A Comparative Study of Machine Learning, Natural Language Processing, and Hybrid Models for Academic Paper Acceptance Prediction

Related Results