Javascript must be enabled to continue!
Vojko Gorjanc, Korpus FidaPLUS: nova generacija slovenskega referenčnega korpusa
View through CrossRef
The article presents the FidaPLUS corpus, an upgrade of the first Slovene reference corpusFida. FidaPLUS is an extensive collection of electronic texts which represents a balanced sample of heterogeneous contemporary texts in Slovene. In addition, the corpus is accompanied by a powerful, information-based concordancer, freely available for general use on the internet. The article focuses on the presentation of the improvements made to the new reference corpus, i.e. the improved lemmatization of corpus texts, new statistical methods for the collocator search, an upgrade of the concordancer interface, and the creation of an information network needed for the corpus analysis by the user. Information concerning the very structure of the corpus is also provided, since comprehension of the corpus composition is vital for the interpretation of language information. An attempt is also made to place the new corpus within the Slovene research area as a significant milestone not only for corpus studies, but for linguistics in general.
Title: Vojko Gorjanc, Korpus FidaPLUS: nova generacija slovenskega referenčnega korpusa
Description:
The article presents the FidaPLUS corpus, an upgrade of the first Slovene reference corpusFida.
FidaPLUS is an extensive collection of electronic texts which represents a balanced sample of heterogeneous contemporary texts in Slovene.
In addition, the corpus is accompanied by a powerful, information-based concordancer, freely available for general use on the internet.
The article focuses on the presentation of the improvements made to the new reference corpus, i.
e.
the improved lemmatization of corpus texts, new statistical methods for the collocator search, an upgrade of the concordancer interface, and the creation of an information network needed for the corpus analysis by the user.
Information concerning the very structure of the corpus is also provided, since comprehension of the corpus composition is vital for the interpretation of language information.
An attempt is also made to place the new corpus within the Slovene research area as a significant milestone not only for corpus studies, but for linguistics in general.
Related Results
Prezimljavanje potkornjaka
Prezimljavanje potkornjaka
Prezimljavanje potkornjaka je vrlo zamršeno pitanje. Ono je i interesantno i čudnovato. Uzrok tome leži u činjenici da većina vrsta potkornjaka ima više generacija godišnje, a broj...
SAVREMENI PRISTUP OBRAZOVANJU DJECE I MLADIH KROZ KONCEPT UČENJA NA DALJINU
SAVREMENI PRISTUP OBRAZOVANJU DJECE I MLADIH KROZ KONCEPT UČENJA NA DALJINU
Današnji učenici su se radikalno promijenili. Oni nisu promijenili samo odjeću i stil u odnosu na prijašnje, nego na potpuno drugačiji način razmišljaju i obrađuju informacije od s...
PERLUASAN MAKNA KATA “VIRAL” DALAM TEKS BERBASIS KORPUS LCC INDONESIA 2023 DI CQPWEB
PERLUASAN MAKNA KATA “VIRAL” DALAM TEKS BERBASIS KORPUS LCC INDONESIA 2023 DI CQPWEB
Perkembangan ilmu dan teknologi diiringi oleh perkembangan bahasa yang ditunjukkan dengan
munculya istilah baru atau konsep perubahan makna pada kata yang sudah ada sebelumnya.
Sal...
PENYUSUNAN KAMUS KEDOKTERAN ARAB-INDONESIA DENGAN PENDEKATAN LINGUISTIK KORPUS
PENYUSUNAN KAMUS KEDOKTERAN ARAB-INDONESIA DENGAN PENDEKATAN LINGUISTIK KORPUS
Kamus Kedokteran ini merupakan data khusus yang memuat tentang dunia kedokteran (mu’jam takshish) yang secara khusus memuat istilah-istilah yang populer dipakai di ranah kedoktera...
Gigafida in slWaC: tematska primerjava
Gigafida in slWaC: tematska primerjava
V prispevku analiziramo dvoje: (a) vključevanje besedil z interneta v obstoječe referenčne korpuse, ki ga soočamo z obstojem spletnih korpusov, ter (b) dva najnovejša korpusa slove...
Gradnja novega korpusa slovenščine
Gradnja novega korpusa slovenščine
The article presents the initial work in the construction of a new reference corpus for Slovene. This will be an upgrade of the FidaPLUS corpus and will be divided into two parts: ...
Comparative Analysis of Syntax and Semantics Sering with Jingchang
Comparative Analysis of Syntax and Semantics Sering with Jingchang
The learning of a foreign language cannot be separated from an understanding of the structure and characteristics of that language, especially in terms of understanding grammar. Th...
CLIP Identifies Nova-Regulated RNA Networks in the Brain
CLIP Identifies Nova-Regulated RNA Networks in the Brain
Nova proteins are neuron-specific antigens targeted in paraneoplastic opsoclonus myoclonus ataxia (POMA), an autoimmune neurologic disease characterized by abnormal motor inhibitio...

