Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Recognizing Textual Entailment: Challenges in the Portuguese Language

View through CrossRef
Recognizing textual entailment comprises the task of determining semantic entailment relations between text fragments. A text fragment entails another text fragment if, from the meaning of the former, one can infer the meaning of the latter. If such relation is bidirectional, then we are in the presence of a paraphrase. Automatically recognizing textual entailment relations captures major semantic inference needs in several natural language processing (NLP) applications. As in many NLP tasks, textual entailment corpora for English abound, while the same is not true for more resource-scarce languages such as Portuguese. Exploiting what seems to be the only Portuguese corpus for textual entailment and paraphrases (the ASSIN corpus), in this paper, we address the task of automatically recognizing textual entailment (RTE) and paraphrases from text written in the Portuguese language, by employing supervised machine learning techniques. We employ lexical, syntactic and semantic features, and analyze the impact of using semantic-based approaches in the performance of the system. We then try to take advantage of the bi-dialect nature of ASSIN to compensate its limited size. With the same aim, we explore modeling the task of recognizing textual entailment and paraphrases as a binary classification problem by considering the bidirectional nature of paraphrases as entailment relationships. Addressing the task as a multi-class classification problem, we achieve results in line with the winner of the ASSIN Challenge. In addition, we conclude that semantic-based approaches are promising in this task, and that combining data from European and Brazilian Portuguese is less straightforward than it may initially seem. The binary classification modeling of the problem does not seem to bring advantages to the original multi-class model, despite the outstanding results obtained by the binary classifier for recognizing textual entailments.
Title: Recognizing Textual Entailment: Challenges in the Portuguese Language
Description:
Recognizing textual entailment comprises the task of determining semantic entailment relations between text fragments.
A text fragment entails another text fragment if, from the meaning of the former, one can infer the meaning of the latter.
If such relation is bidirectional, then we are in the presence of a paraphrase.
Automatically recognizing textual entailment relations captures major semantic inference needs in several natural language processing (NLP) applications.
As in many NLP tasks, textual entailment corpora for English abound, while the same is not true for more resource-scarce languages such as Portuguese.
Exploiting what seems to be the only Portuguese corpus for textual entailment and paraphrases (the ASSIN corpus), in this paper, we address the task of automatically recognizing textual entailment (RTE) and paraphrases from text written in the Portuguese language, by employing supervised machine learning techniques.
We employ lexical, syntactic and semantic features, and analyze the impact of using semantic-based approaches in the performance of the system.
We then try to take advantage of the bi-dialect nature of ASSIN to compensate its limited size.
With the same aim, we explore modeling the task of recognizing textual entailment and paraphrases as a binary classification problem by considering the bidirectional nature of paraphrases as entailment relationships.
Addressing the task as a multi-class classification problem, we achieve results in line with the winner of the ASSIN Challenge.
In addition, we conclude that semantic-based approaches are promising in this task, and that combining data from European and Brazilian Portuguese is less straightforward than it may initially seem.
The binary classification modeling of the problem does not seem to bring advantages to the original multi-class model, despite the outstanding results obtained by the binary classifier for recognizing textual entailments.

Related Results

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Aviation English - A global perspective: analysis, teaching, assessment
Aviation English - A global perspective: analysis, teaching, assessment
This e-book brings together 13 chapters written by aviation English researchers and practitioners settled in six different countries, representing institutions and universities fro...
Passing a USA National Bar Exam &ndash; a First Experiment
Passing a USA National Bar Exam &ndash; a First Experiment
Bar exams provide are a key watershed by which legal professionals demonstrate their knowledge of the law and its application. Passing the bar entitles one to practice the law in a...
A Wideband mm-Wave Printed Dipole Antenna for 5G Applications
A Wideband mm-Wave Printed Dipole Antenna for 5G Applications
<span lang="EN-MY">In this paper, a wideband millimeter-wave (mm-Wave) printed dipole antenna is proposed to be used for fifth generation (5G) communications. The single elem...
Que estandardização para o português: pluricêntrica ou panlusófona?
Que estandardização para o português: pluricêntrica ou panlusófona?
Portuguese is a pluricentric language with two well-established national varieties – European Portuguese (EP) and Brazilian Portuguese (BP) – and varieties in development, such as ...
Rodnoosjetljiv jezik na primjeru njemačkih časopisa Brigitte i Der Spiegel
Rodnoosjetljiv jezik na primjeru njemačkih časopisa Brigitte i Der Spiegel
On the basis of the comparative analysis of texts of the German biweekly magazine Brigitte and the weekly magazine Der Spiegel and under the presumption that gender-sensitive langu...
REFLECTING THE ATTITUDES ABOUT THE SCHOLARLY CONTRIBUTION OF ACADEMICIAN VOJISLAV P. NIKČEVIĆ
REFLECTING THE ATTITUDES ABOUT THE SCHOLARLY CONTRIBUTION OF ACADEMICIAN VOJISLAV P. NIKČEVIĆ
The modern meaning of linguistic and literal science in Montenegro comes from the pioneer’s works of academic Vojislav P. Nikcevic, who made in period from 1965. to 2007., not only...
Effective Arabic Language Teaching Strategies in the Language Laboratory for Students of Darussalam Gontor Islamic Institution
Effective Arabic Language Teaching Strategies in the Language Laboratory for Students of Darussalam Gontor Islamic Institution
Language is an important tool for the life of civilized man. Through language, people can communicate with each other, and convey their intentions and feelings to others. The moder...

Back to Top