Javascript must be enabled to continue!
SIMPITIKI: a Simplification corpus for Italian
View through CrossRef
In this work, we analyse whether Wikipedia can be used to leverage simplification pairs instead of Simple Wikipedia, which has proved unreliable for assessing automatic simplification systems, and is available only in English. We focus on sentence pairs in which the target sentence is the outcome of a Wikipedia edit marked as ‘simplified’, and manually annotate simplification phenomena following an existing scheme proposed for previous simplification corpora in Italian. The outcome of this work is the SIMPITIKI corpus, which we make freely available, with pairs of sentences extracted from Wikipedia edits and annotated with simplification types. The resource contains also another corpus with roughly the same number of simplifications, which was manually created by simplifying documents in the administrative domain.
Title: SIMPITIKI: a Simplification corpus for Italian
Description:
In this work, we analyse whether Wikipedia can be used to leverage simplification pairs instead of Simple Wikipedia, which has proved unreliable for assessing automatic simplification systems, and is available only in English.
We focus on sentence pairs in which the target sentence is the outcome of a Wikipedia edit marked as ‘simplified’, and manually annotate simplification phenomena following an existing scheme proposed for previous simplification corpora in Italian.
The outcome of this work is the SIMPITIKI corpus, which we make freely available, with pairs of sentences extracted from Wikipedia edits and annotated with simplification types.
The resource contains also another corpus with roughly the same number of simplifications, which was manually created by simplifying documents in the administrative domain.
Related Results
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Žanrovska analiza pomorskopravnih tekstova i ostvarenje prijevodnih univerzalija u njihovim prijevodima s engleskoga jezika
Genre implies formal and stylistic conventions of a particular text type, which inevitably affects the translation process. This „force of genre bias“ (Prieto Ramos, 2014) has been...
Numerical Simplification and its Effect on Fragment Distributions in Genetic Programming
Numerical Simplification and its Effect on Fragment Distributions in Genetic Programming
<p>In tree-based genetic programming (GP) there is a tendency for the program trees to increase in size from one generation to the next. If this increase in program size is n...
Semantically Enriched Simplification of Trajectories
Semantically Enriched Simplification of Trajectories
Abstract. Moving objects that are equipped with GPS devices generate huge volumes of spatio-temporal data. This spatial and temporal information is used in tracing the path travell...
L’insegnamento dell’italiano a stranieri
Alcune coordinate di riferimento per gli anni Venti
L’insegnamento dell’italiano a stranieri
Alcune coordinate di riferimento per gli anni Venti
This book develops the theme of teaching Italian abroad, starting from the awareness of the motivations for foreign students to study the Italian language and the different methodo...
Corpus Linguistics at Work
Corpus Linguistics at Work
The book offers a combined discussion of the main theoretical, methodological and application issues related to corpus work. Thus, starting from the definition of what is a corpus ...
Research Status and Current Problems of Corpus Linguistics in China
Research Status and Current Problems of Corpus Linguistics in China
After more than 40 years of development, China has made significant achievements in corpus-based research, while problems still remain: corpus-based studies of linguistic phenomena...
I ovdi i ondi, i gori i doli. Poimanje doma, domovine i zavičaja kod hrvatskih iseljenika, povratnika i transmigranata
I ovdi i ondi, i gori i doli. Poimanje doma, domovine i zavičaja kod hrvatskih iseljenika, povratnika i transmigranata
The aim of this article is to present one aspect discussed by informants from three different studies on Croatian (trans)migrants and returnees: a study of recent generations of Cr...
LOOPS: LOcally Optimized Polygon Simplification
LOOPS: LOcally Optimized Polygon Simplification
AbstractDisplaying polygonal vector data is essential in various application scenarios such as geometry visualization, vector graphics rendering, CAD drawing and in particular geog...

