Javascript must be enabled to continue!

Logično sklepanje v naravnem jeziku za slovenščino

In recent years, large language models have been the most successful approach to natural language processing. An important problem in this field is natural language inference, which requires models to contain relatively broad general knowledge. Moreover, the requirement for models to explain their reasoning can offer additional insights into their functioning. We tested several approaches for natural language inference in Slovene. We used two Slovene large language models, SloBERTa and SloT5, as well as a much larger English model GPT-3.5-turbo. Training data consisted of the Slovene dataset SI-NLI and an additional 50,000 machine-translated samples from the English dataset ESNLI. The SloBERTa model was fine-tuned on both datasets. Fine-tuning it on the SI-NLI dataset achieved a classification accuracy of 73.2% on the SI-NLI test set. Pretraining it on the ESNLI dataset improved its accuracy to 75.3%. We observe that models make different types of errors compared to humans and that they generalize poorly across different datasets. The SloT5 model was also fine-tuned on ESNLI to generate explanations for natural language inference samples. Less than a third of explanations were appropriate, with the model learning common sentence patterns from the domain and producing semantically meaningless explanations. We assume that the tested Slovene large language models with up to several hundred million parameters are capable of identifying and using language patterns, but their language understanding is not necessarily sufficient to understand reality. When the considerably larger GPT-3.5-turbo was used both for classification and explanation generation, it achieved an accuracy of 56.5% on the SI-NLI test set using zero-shot learning, but with 81% of the explanations being appropriate for the correctly classified samples. In comparison with smaller Slovene models, this model shows a reasonable understanding of reality but is limited by its lower Slovene proficiency.

University of Ljubljana

Tim Kmecl Marko Robnik-Šikonja

Slovenščina 2.0: empirične, aplikativne in interdisciplinarne raziskave

2024

Title: Logično sklepanje v naravnem jeziku za slovenščino

Description:

In recent years, large language models have been the most successful approach to natural language processing.

An important problem in this field is natural language inference, which requires models to contain relatively broad general knowledge.

Moreover, the requirement for models to explain their reasoning can offer additional insights into their functioning.

We tested several approaches for natural language inference in Slovene.

We used two Slovene large language models, SloBERTa and SloT5, as well as a much larger English model GPT-3.

5-turbo.

Training data consisted of the Slovene dataset SI-NLI and an additional 50,000 machine-translated samples from the English dataset ESNLI.

The SloBERTa model was fine-tuned on both datasets.

Fine-tuning it on the SI-NLI dataset achieved a classification accuracy of 73.

2% on the SI-NLI test set.

Pretraining it on the ESNLI dataset improved its accuracy to 75.

3%.

We observe that models make different types of errors compared to humans and that they generalize poorly across different datasets.

The SloT5 model was also fine-tuned on ESNLI to generate explanations for natural language inference samples.

Less than a third of explanations were appropriate, with the model learning common sentence patterns from the domain and producing semantically meaningless explanations.

We assume that the tested Slovene large language models with up to several hundred million parameters are capable of identifying and using language patterns, but their language understanding is not necessarily sufficient to understand reality.

When the considerably larger GPT-3.

5-turbo was used both for classification and explanation generation, it achieved an accuracy of 56.

5% on the SI-NLI test set using zero-shot learning, but with 81% of the explanations being appropriate for the correctly classified samples.

In comparison with smaller Slovene models, this model shows a reasonable understanding of reality but is limited by its lower Slovene proficiency.

Back

Namen: Namen članka je preveriti berljivost izvlečkov s področja bibliotekarstva in informacijske znanosti na primeru izvlečkov iz revije Knjižnica. Zanimala nas je razlika med ber...

Etimološko-semantički opis lekseme “hikmet” i drugih riječi deriviranih iz istoga korijena

U radu smo etimologizirali orijentalnu leksemu hikmet (ḥikma), semantički je opisali, potvrdili polisemičnost ove lekseme u jeziku izvorniku, ali i u jeziku primaocu – arapskom / b...

Dva jezika irske poezije na primeru pesnika Michaela Hartnetta

Članek na primeru pesnika Michaela Hartnetta prikazuje del problematike jezika na Irskem, ki je v književnosti velikokrat tematizirana. Hartnett se sprva uveljavi kot pesnik v angl...

Corrigendum / Popravek

At the paper »Silver fir (Abies alba Mill.) ectomycorrhiza across its geographic areal – a review approach« (»Ektomikorizni simbionti bele jelke (Abies alba Mill.) na naravnem obmo...

Djece je sve manje ali su potrebe sve veće

Evidentno je da je demografski trend Republike Hrvatske u silaznoj putanji kao i u većini Europskih zemalja. Jedan od razloga je smanjenje broja žena u fertilnoj dobi jer žene sve ...

Izvor pripon -ar, -lj-ar in -n-ar v slovenščini

V prispevku je obravnavan izvor pripon -ar (kajžar, gruntar, kočar, bajtar), -lj-ar (kajžljar, kočljar, bajtljar) in -n-ar (gruntnar, kočnar, bajtnar) v narečni in knjižni slovenšč...

Župnijske kronike — kdaj slovenščina prevlada nad nemščino?

V članku se lotevamo vprašanja, kdaj je v župnijskih kronikalnih besedilih nemščino nasledila slovenščina in nad njo postopoma prevladala ter v katerih zvrsteh kronik se je to v sl...

Vpliv informativne literature v izbranih elektronskih virih na razvoj gradnikov bralne pismenosti

V prispevku je preverjeno, kako informativna literatura v izbranih elektronskih virih (i-učbenik, učno e-okolje, spletna stran) vpliva na razvoj bralne pismenosti, tj. na zmožnost ...

Email:
Password:

Email:

Logično sklepanje v naravnem jeziku za slovenščino

Related Results