Javascript must be enabled to continue!

Chinese Medical Paraphrase Generation: Based on Neural Machine Translation

Abstract Background: As people prefer to obtain medical knowledge online, medical intelligence question-answer systems based on question matching have attracted more and more attention, especially in China. However, due to the lack of paraphrase corpus of medical question, the development of this field is limited.Objective: We propose a method for paraphrase generation which suitable for the Chinese medical field and use deep learning models instead of artificial evaluation for the first time. The method is designed to be able to automatically construct high quality Chinese medical paraphrase.Methods: Validation experiments were carried out on two Chinese paraphrase data (one is general data, the other is medical data). Neural machine translation is used to generated paraphrase, that is, translate a sentence into other languages, and then reverse-translate it back to the original language to get the corresponding paraphrase. BLUE, ROUGEs, are used as quantitative evaluation metrics. Three deep text matching models are used to evaluate the generated paraphrase, instead of manual. Precision, Recall, F1 and AUC are used as qualitative evaluation metrics.Results: 49908 and 4062 paraphrases were generated on the two datasets, and the generated efficiency was 97.03% and 98.38%, respectively. For the data in the two fields, the generated and original paraphrase pairs are very similar at the quantitative and qualitative evaluation metrics, especially the medical field. Take medical data as example, BLUE of generated and original paraphrase pairs are 0.556 and 0.626, respectively; the mean difference of AUC between the two groups was 0.015. Conclusions: We first propose a paraphrase generation method based on neural machine translation and use deep text matching model instead of manual evaluation to evaluate the generated paraphrase. By analyzing the evaluation metrics, it can be concluded that：the paraphrase generated method has reached or even exceeded the level of artificial construction at the semantic level, especially in medical field; the deep text matching model can replace manual evaluation and realize automated paraphrase generation. This is of great significance to the development of Chinese medical paraphrase generation.

Research Square Platform LLC

Bo Sun Fei Zhang Jing Yuan Zhao Wei Shu Ting

2021

Title: Chinese Medical Paraphrase Generation: Based on Neural Machine Translation

Description:

However, due to the lack of paraphrase corpus of medical question, the development of this field is limited.

Objective: We propose a method for paraphrase generation which suitable for the Chinese medical field and use deep learning models instead of artificial evaluation for the first time.

The method is designed to be able to automatically construct high quality Chinese medical paraphrase.

Methods: Validation experiments were carried out on two Chinese paraphrase data (one is general data, the other is medical data).

Neural machine translation is used to generated paraphrase, that is, translate a sentence into other languages, and then reverse-translate it back to the original language to get the corresponding paraphrase.

BLUE, ROUGEs, are used as quantitative evaluation metrics.

Three deep text matching models are used to evaluate the generated paraphrase, instead of manual.

Precision, Recall, F1 and AUC are used as qualitative evaluation metrics.

Results: 49908 and 4062 paraphrases were generated on the two datasets, and the generated efficiency was 97.

03% and 98.

38%, respectively.

For the data in the two fields, the generated and original paraphrase pairs are very similar at the quantitative and qualitative evaluation metrics, especially the medical field.

Take medical data as example, BLUE of generated and original paraphrase pairs are 0.

556 and 0.

626, respectively; the mean difference of AUC between the two groups was 0.

015.

Conclusions: We first propose a paraphrase generation method based on neural machine translation and use deep text matching model instead of manual evaluation to evaluate the generated paraphrase.

By analyzing the evaluation metrics, it can be concluded that：the paraphrase generated method has reached or even exceeded the level of artificial construction at the semantic level, especially in medical field; the deep text matching model can replace manual evaluation and realize automated paraphrase generation.

This is of great significance to the development of Chinese medical paraphrase generation.

Back

We have entered into the era of artificial intelligence, neural machine translation, and especially large language models which have dramatically changed the landscape of human tra...

Translation

The theoretical, empirical, and pedagogic study of translation is the concern of the interdisciplinary and international field of scholarship known, since 1972, as translation stud...

SPECIFIC TRAITS OF HUNGARIAN-UKRAINIAN POETRY TRANSLATION (BASED ON YURII SHKROBYNETS’ TRANSLATIONS)

The article addresses matters related to the peculiarities of Hungarian-Ukrainian poetic translation. It was noted that the quality, complexity and overall mastery of literary tran...

Cultranslatology in China

Culture has long been noticed in translation practice, and theoretical research on translation and culture has a history of over 40 years. Unlike the cultural schools of translatio...

Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika

Genre implies formal and stylistic conventions of a particular text type, which inevitably affects the translation process. This „force of genre bias“ (Prieto Ramos, 2014) has been...

Urdu Short Paraphrase Detection at Sentence Level

Paraphrase detection systems uncover the relationship between two text fragments and classify them as paraphrased when they convey the same idea; otherwise non-paraphrased. Previou...

CONSTRUCTION AND APPLICATION OF THE LACUNA’S TRANSLATION MODEL IN MODERN LINGUISTICS

The cultural turn makes translation shift from word – text to cultural register. According to the view of cultural translation, culture serves as the translational unit. The lacuna...

Translation Divergences in Chinese–English Machine Translation: An Empirical Investigation

In this article, we conduct an empirical investigation of translation divergences between Chinese and English relying on a parallel treebank. To do this, we first devise a hierarch...

Email:
Password:

Email:

Chinese Medical Paraphrase Generation: Based on Neural Machine Translation

Related Results