Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Handling Compound Hindi OOV words in web queries

View through CrossRef
Abstract Handling of Out of Vocabulary (OOV) words is still a problem in NLP. If there is a word for which morphological analyser is not able to find a morpheme, that word is known as OOV word. These words if not identified, may restrict to understand the proper meaning of the sentence. It may also have severe impact on the IR system involving the queries. Detection and identification of OOV words in information retrieval is a challenging task. This problem may become more challenging in case of cross lingual information retrieval (CLIR) due to issues in query translation. The objective of this paper is to understand the impact of web queries involving these words on the retrieval effectiveness of web searches. Subsequently, we have also proposed an algorithm to successfully detect and handle the impact of Hindi web queries involving compound OOV. Our results have shown increased precision of 8.53% for one-word web queries involving only OOV word and 15.68% with queries having more than one word having at least one OOV word.
Title: Handling Compound Hindi OOV words in web queries
Description:
Abstract Handling of Out of Vocabulary (OOV) words is still a problem in NLP.
If there is a word for which morphological analyser is not able to find a morpheme, that word is known as OOV word.
These words if not identified, may restrict to understand the proper meaning of the sentence.
It may also have severe impact on the IR system involving the queries.
Detection and identification of OOV words in information retrieval is a challenging task.
This problem may become more challenging in case of cross lingual information retrieval (CLIR) due to issues in query translation.
The objective of this paper is to understand the impact of web queries involving these words on the retrieval effectiveness of web searches.
Subsequently, we have also proposed an algorithm to successfully detect and handle the impact of Hindi web queries involving compound OOV.
Our results have shown increased precision of 8.
53% for one-word web queries involving only OOV word and 15.
68% with queries having more than one word having at least one OOV word.

Related Results

Isolation, characterization and semi-synthesis of natural products dimeric amide alkaloids
Isolation, characterization and semi-synthesis of natural products dimeric amide alkaloids
 Isolation, characterization of natural products dimeric amide alkaloids from roots of the Piper chaba Hunter. The synthesis of these products using intermolecular [4+2] cycloaddit...
Graph-based interactive bibliographic information retrieval systems
Graph-based interactive bibliographic information retrieval systems
In the big data era, we have witnessed the explosion of scholarly literature. This explosion has imposed challenges to the retrieval of bibliographic information. Retrieval of inte...
Compound Words Found in Seventy-Seven Thousand Service-Trees (Sri Chinmoy)
Compound Words Found in Seventy-Seven Thousand Service-Trees (Sri Chinmoy)
This research deals with compound words used in Seventy-Seven Thousand Service-Trees by Sri Chinmoy. The case is many people did not recognize and aware of using it. This research ...
Računalno potpomognuto usmjeravanje kod dvojezičnih govornika
Računalno potpomognuto usmjeravanje kod dvojezičnih govornika
This thesis investigates whether modern computer models can confirm how people encounter words and then use these findings in didactics. In recent years, computers have been used i...
The Making of Modern Hindi
The Making of Modern Hindi
The Making of Modern Hindi examines the politics and processes of making Hindi modern at a formative moment in India’s history, when British imperialism was at its peak and anti-co...
Ab. No. 60 Translation and Cross-Cultural Adaptation of Health Literacy Instrument for Adults in Hindi
Ab. No. 60 Translation and Cross-Cultural Adaptation of Health Literacy Instrument for Adults in Hindi
Introduction: Physiotherapists play a significant role in health promotion and wellness. Health literacy can help people prevent health problems, protect health and bet...
Analysis of The English Closed Compound Words
Analysis of The English Closed Compound Words
    Compound Words is a part of elements that finding in morphology. Morphology is learning about morpheme and morpheme is the element of  language  that have meaning and also su...

Back to Top