Javascript must be enabled to continue!
Using binary classification to evaluate the quality of machine translators
View through CrossRef
Machine translators have become increasingly popular and currently play an important role because of their great assistance in cross-cultural communication. However, machine translators often produces some unnatural texts, and an evaluation of machine translators is thus needed to avoid the abuse of machine-translated texts. This paper presents the use of binary classification to evaluate the quality of machine translators without references. First, we construct a large-scale dataset including humangenerated texts and machine-translated texts. Second, the dataset is used to train the multiple binary classifiers, e.g., decision tree, random forest, extreme gradient boosting, support vector machines, logistic regression, etc. Finally, these trained classifiers constitute the ensemble model by majority voting, and this ensemble model is used to evaluate the qualities of machine-translated texts. Experimental results show that the proposed evaluation method better measures the qualities of translated texts by some commercial machine translators.
Title: Using binary classification to evaluate the quality of machine translators
Description:
Machine translators have become increasingly popular and currently play an important role because of their great assistance in cross-cultural communication.
However, machine translators often produces some unnatural texts, and an evaluation of machine translators is thus needed to avoid the abuse of machine-translated texts.
This paper presents the use of binary classification to evaluate the quality of machine translators without references.
First, we construct a large-scale dataset including humangenerated texts and machine-translated texts.
Second, the dataset is used to train the multiple binary classifiers, e.
g.
, decision tree, random forest, extreme gradient boosting, support vector machines, logistic regression, etc.
Finally, these trained classifiers constitute the ensemble model by majority voting, and this ensemble model is used to evaluate the qualities of machine-translated texts.
Experimental results show that the proposed evaluation method better measures the qualities of translated texts by some commercial machine translators.
Related Results
BINARY TOPOLOGY BASED ON SOME NEW SETS
BINARY TOPOLOGY BASED ON SOME NEW SETS
In this chapter, we introduce and some new sets called binary -open sets, binary -sets, binary -sets, binary -closed sets, binary -sets and binary -sets , which are simple forms of...
A Critical Analysis of Adaptation, Domestication and Foreignization as Effective Strategies for Translating Shakespeare’s Plays into Assamese
A Critical Analysis of Adaptation, Domestication and Foreignization as Effective Strategies for Translating Shakespeare’s Plays into Assamese
One of the major challenges faced by the translators is finding equivalence in the target language. The translators of Shakespeare plays have used Assamese words as appropriate equ...
Exploring translators’ visibility via peritexts in the English (re)translations of Xi You Ji
Exploring translators’ visibility via peritexts in the English (re)translations of Xi You Ji
Background and Purpose: Research on translators’ visibility traditionally focuses on the analysis of the main text, with very little attention paid to the analysis of paratext. As ...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
A CORPUS-BASED STUDY OF THE STYLISTIC DIFFERENCES BETWEEN IN-COMING TRANSLATORS AND OUT-GOING TRANSLATORS FOR ENGLISH VERSIONS OF TIANWEN IN THE CONTEXT OF “TRANSLATING CHINA”
A CORPUS-BASED STUDY OF THE STYLISTIC DIFFERENCES BETWEEN IN-COMING TRANSLATORS AND OUT-GOING TRANSLATORS FOR ENGLISH VERSIONS OF TIANWEN IN THE CONTEXT OF “TRANSLATING CHINA”
<p>This study, based on a self-compiled parallel Chinese-English corpus of multiple translations of Tianwen and the Brown Reference Corpus, conducts an in-depth comparative a...
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Abstract
The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...
Audiovisual and literary translators' attitudes toward proofreading
Audiovisual and literary translators' attitudes toward proofreading
Before a translation is published, it undergoes the process of revision. There are two types of revision, self-revision, and other-revision, and the latter can be carried out by a ...

