Javascript must be enabled to continue!

Using binary classification to evaluate the quality of machine translators

Machine translators have become increasingly popular and currently play an important role because of their great assistance in cross-cultural communication. However, machine translators often produces some unnatural texts, and an evaluation of machine translators is thus needed to avoid the abuse of machine-translated texts. This paper presents the use of binary classification to evaluate the quality of machine translators without references. First, we construct a large-scale dataset including humangenerated texts and machine-translated texts. Second, the dataset is used to train the multiple binary classifiers, e.g., decision tree, random forest, extreme gradient boosting, support vector machines, logistic regression, etc. Finally, these trained classifiers constitute the ensemble model by majority voting, and this ensemble model is used to evaluate the qualities of machine-translated texts. Experimental results show that the proposed evaluation method better measures the qualities of translated texts by some commercial machine translators.

Elsevier BV

Ran Li Yihao Yang Kelin Shen Mohammed Hijji

Kuwait Journal of Science

2022

Title: Using binary classification to evaluate the quality of machine translators

Description:

Machine translators have become increasingly popular and currently play an important role because of their great assistance in cross-cultural communication.

However, machine translators often produces some unnatural texts, and an evaluation of machine translators is thus needed to avoid the abuse of machine-translated texts.

This paper presents the use of binary classification to evaluate the quality of machine translators without references.

First, we construct a large-scale dataset including humangenerated texts and machine-translated texts.

Second, the dataset is used to train the multiple binary classifiers, e.

, decision tree, random forest, extreme gradient boosting, support vector machines, logistic regression, etc.

Finally, these trained classifiers constitute the ensemble model by majority voting, and this ensemble model is used to evaluate the qualities of machine-translated texts.

Experimental results show that the proposed evaluation method better measures the qualities of translated texts by some commercial machine translators.

Back

Related Results

BINARY TOPOLOGY BASED ON SOME NEW SETS

In this chapter, we introduce and some new sets called binary -open sets, binary -sets, binary -sets, binary -closed sets, binary -sets and binary -sets , which are simple forms of...

A Critical Analysis of Adaptation, Domestication and Foreignization as Effective Strategies for Translating Shakespeare’s Plays into Assamese

One of the major challenges faced by the translators is finding equivalence in the target language. The translators of Shakespeare plays have used Assamese words as appropriate equ...

Exploring translators’ visibility via peritexts in the English (re)translations of Xi You Ji

Background and Purpose: Research on translators’ visibility traditionally focuses on the analysis of the main text, with very little attention paid to the analysis of paratext. As ...

Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing

Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

A CORPUS-BASED STUDY OF THE STYLISTIC DIFFERENCES BETWEEN IN-COMING TRANSLATORS AND OUT-GOING TRANSLATORS FOR ENGLISH VERSIONS OF TIANWEN IN THE CONTEXT OF “TRANSLATING CHINA”

<p>This study, based on a self-compiled parallel Chinese-English corpus of multiple translations of Tianwen and the Brown Reference Corpus, conducts an in-depth comparative a...

Abstract The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...

Audiovisual and literary translators' attitudes toward proofreading

Before a translation is published, it undergoes the process of revision. There are two types of revision, self-revision, and other-revision, and the latter can be carried out by a ...

Email:
Password:

Email: