Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Fusion of Fast-text and Indo-Wordnet for Disambiguation of Word Sense in the Marathi Language

View through CrossRef
This research employs the combination of the FastText model and Indo-WordNet to address the issue of word sense disambiguation (WSD) in Marathi literature. The initial iteration of the algorithm employed word pair matching as the technique to ascertain the presence of overlap between the items in the "context bag" and the "sense bag" derived from the lexical resource WordNet. The current methodology involves the computation of overlap by utilizing a semantic similarity metric that leverages fastText subword embeddings. This approach demonstrates proficiency in effectively managing unanticipated word formations, while simultaneously elucidating the inherent semantics of the terms. Significant progress has been achieved in the field of Word Sense Disambiguation (WSD) for both the English language and many European languages. There is a substantial challenge to be surmounted in relation to Marathi and other languages spoken in India. The Marathi text corpus, sourced from the government of India, comprises a vast assemblage of Marathi sentences. The dataset used in this study consisted of the Indo WordNet for the Marathi language and the Marathi Online Dictionary. The results of the conducted experiments demonstrate promising discoveries. The target words that possess semantically distinct synsets in WordNet are assigned a high F1 score. The achieved F1 score of 89% above the baseline and signifies substantial advancements in compared to previous knowledge-based methodologies employed for low resource Indian languages.
Title: Fusion of Fast-text and Indo-Wordnet for Disambiguation of Word Sense in the Marathi Language
Description:
This research employs the combination of the FastText model and Indo-WordNet to address the issue of word sense disambiguation (WSD) in Marathi literature.
The initial iteration of the algorithm employed word pair matching as the technique to ascertain the presence of overlap between the items in the "context bag" and the "sense bag" derived from the lexical resource WordNet.
The current methodology involves the computation of overlap by utilizing a semantic similarity metric that leverages fastText subword embeddings.
This approach demonstrates proficiency in effectively managing unanticipated word formations, while simultaneously elucidating the inherent semantics of the terms.
Significant progress has been achieved in the field of Word Sense Disambiguation (WSD) for both the English language and many European languages.
There is a substantial challenge to be surmounted in relation to Marathi and other languages spoken in India.
The Marathi text corpus, sourced from the government of India, comprises a vast assemblage of Marathi sentences.
The dataset used in this study consisted of the Indo WordNet for the Marathi language and the Marathi Online Dictionary.
The results of the conducted experiments demonstrate promising discoveries.
The target words that possess semantically distinct synsets in WordNet are assigned a high F1 score.
The achieved F1 score of 89% above the baseline and signifies substantial advancements in compared to previous knowledge-based methodologies employed for low resource Indian languages.

Related Results

Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
Hubungan Perilaku Pola Makan dengan Kejadian Anak Obesitas
<p><em><span style="font-size: 11.0pt; font-family: 'Times New Roman',serif; mso-fareast-font-family: 'Times New Roman'; mso-ansi-language: EN-US; mso-fareast-langua...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
The Nuclear Fusion Award
The Nuclear Fusion Award
The Nuclear Fusion Award ceremony for 2009 and 2010 award winners was held during the 23rd IAEA Fusion Energy Conference in Daejeon. This time, both 2009 and 2010 award winners w...
Indo-Anglian: Connotations and Denotations
Indo-Anglian: Connotations and Denotations
A different name than English literature, ‘Anglo-Indian Literature’, was given to the body of literature in English that emerged on account of the British interaction with India un...
IIST BCI Dataset-2 for Selected Common Marathi Words
IIST BCI Dataset-2 for Selected Common Marathi Words
To solve problems of neurodegenerative disorder patients, Brain-Computer Interface (BCI) based solutions require datasets relevant to the languages spoken by patients. BCI Research...
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
Učinak poučavanja razrednomu jeziku u izobrazbi nastavnika njemačkoga
The actual use of classroom language is principally limited to the classroom environment. As far as foreign language learning is concerned, the classroom often turns out to be the ...
E-Press and Oppress
E-Press and Oppress
From elephants to ABBA fans, silicon to hormone, the following discussion uses a new research method to look at printed text, motion pictures and a te...

Back to Top