Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Chinese Medical Paraphrase Generation: Based on Neural Machine Translation

View through CrossRef
Abstract Background: As people prefer to obtain medical knowledge online, medical intelligence question-answer systems based on question matching have attracted more and more attention, especially in China. However, due to the lack of paraphrase corpus of medical question, the development of this field is limited.Objective: We propose a method for paraphrase generation which suitable for the Chinese medical field and use deep learning models instead of artificial evaluation for the first time. The method is designed to be able to automatically construct high quality Chinese medical paraphrase.Methods: Validation experiments were carried out on two Chinese paraphrase data (one is general data, the other is medical data). Neural machine translation is used to generated paraphrase, that is, translate a sentence into other languages, and then reverse-translate it back to the original language to get the corresponding paraphrase. BLUE, ROUGEs, are used as quantitative evaluation metrics. Three deep text matching models are used to evaluate the generated paraphrase, instead of manual. Precision, Recall, F1 and AUC are used as qualitative evaluation metrics.Results: 49908 and 4062 paraphrases were generated on the two datasets, and the generated efficiency was 97.03% and 98.38%, respectively. For the data in the two fields, the generated and original paraphrase pairs are very similar at the quantitative and qualitative evaluation metrics, especially the medical field. Take medical data as example, BLUE of generated and original paraphrase pairs are 0.556 and 0.626, respectively; the mean difference of AUC between the two groups was 0.015. Conclusions: We first propose a paraphrase generation method based on neural machine translation and use deep text matching model instead of manual evaluation to evaluate the generated paraphrase. By analyzing the evaluation metrics, it can be concluded that:the paraphrase generated method has reached or even exceeded the level of artificial construction at the semantic level, especially in medical field; the deep text matching model can replace manual evaluation and realize automated paraphrase generation. This is of great significance to the development of Chinese medical paraphrase generation.
Title: Chinese Medical Paraphrase Generation: Based on Neural Machine Translation
Description:
Abstract Background: As people prefer to obtain medical knowledge online, medical intelligence question-answer systems based on question matching have attracted more and more attention, especially in China.
However, due to the lack of paraphrase corpus of medical question, the development of this field is limited.
Objective: We propose a method for paraphrase generation which suitable for the Chinese medical field and use deep learning models instead of artificial evaluation for the first time.
The method is designed to be able to automatically construct high quality Chinese medical paraphrase.
Methods: Validation experiments were carried out on two Chinese paraphrase data (one is general data, the other is medical data).
Neural machine translation is used to generated paraphrase, that is, translate a sentence into other languages, and then reverse-translate it back to the original language to get the corresponding paraphrase.
BLUE, ROUGEs, are used as quantitative evaluation metrics.
Three deep text matching models are used to evaluate the generated paraphrase, instead of manual.
Precision, Recall, F1 and AUC are used as qualitative evaluation metrics.
Results: 49908 and 4062 paraphrases were generated on the two datasets, and the generated efficiency was 97.
03% and 98.
38%, respectively.
For the data in the two fields, the generated and original paraphrase pairs are very similar at the quantitative and qualitative evaluation metrics, especially the medical field.
Take medical data as example, BLUE of generated and original paraphrase pairs are 0.
556 and 0.
626, respectively; the mean difference of AUC between the two groups was 0.
015.
Conclusions: We first propose a paraphrase generation method based on neural machine translation and use deep text matching model instead of manual evaluation to evaluate the generated paraphrase.
By analyzing the evaluation metrics, it can be concluded that:the paraphrase generated method has reached or even exceeded the level of artificial construction at the semantic level, especially in medical field; the deep text matching model can replace manual evaluation and realize automated paraphrase generation.
This is of great significance to the development of Chinese medical paraphrase generation.

Related Results

Cultranslatology in China
Cultranslatology in China
Culture has long been noticed in translation practice, and theoretical research on translation and culture has a history of over 40 years. Unlike the cultural schools of translatio...
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Genre implies formal and stylistic conventions of a particular text type, which inevitably affects the translation process. This „force of genre bias“ (Prieto Ramos, 2014) has been...
Urdu Short Paraphrase Detection at Sentence Level
Urdu Short Paraphrase Detection at Sentence Level
Paraphrase detection systems uncover the relationship between two text fragments and classify them as paraphrased when they convey the same idea; otherwise non-paraphrased. Previou...
Metode dan Prosedur Penerjemahan Quote Anime Bahasa Jepang ke dalam Bahasa Indonesia pada Official Akun LINE Bahasa Jepang Bersama
Metode dan Prosedur Penerjemahan Quote Anime Bahasa Jepang ke dalam Bahasa Indonesia pada Official Akun LINE Bahasa Jepang Bersama
The title of this research is “Method and Procedure of Translation used in Anime Quotes from Japanese to Indonesia at Official Account LINE Bahasa Jepang Bersama”. The purpose of t...
Audiovisual translation and media accessibility training in the EMT network
Audiovisual translation and media accessibility training in the EMT network
The increase in demand for the localisation of audiovisual media content has led to increased incorporation of audiovisual translation and accessibility modules into university cur...
Neural Machine Translation and Multilingual NLP: A Survey of Methods, Architectures, and Applications
Neural Machine Translation and Multilingual NLP: A Survey of Methods, Architectures, and Applications
Neural machine translation (NMT) has revolutionized the field of natural language processing by enabling high-quality automatic translation between languages using deep neural netw...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Betti Alver Maksim Gorki „Lapsepõlve“ tõlkijana / Betti Alver as a Maksim Gorky’s “My Childhood” translator
Betti Alver Maksim Gorki „Lapsepõlve“ tõlkijana / Betti Alver as a Maksim Gorky’s “My Childhood” translator
Artiklis vaadeldakse Maksim Gorki eestindusi nende ajaloolises kontekstis ja tuuakse välja põhimõttelised erinevused 20. sajandi alguse tõlgete ja varaste nõukogudeaegsete tõlgete ...

Back to Top