Javascript must be enabled to continue!
Khmer Semantic Search Engine (KSE): Digital Information Access and Document Retrieval
View through CrossRef
Abstract
The search engine process is crucial for document content retrieval. For Khmer documents, an effective tool is needed to extract essential keywords and facilitate accurate searches. Despite the daily generation of significant Khmer content, Cambodians struggle to find necessary documents due to the lack of an effective semantic searching tool. Even Google does not deliver high accuracy for Khmer content. Semantic search engines improve search results by employing advanced algorithms to understand various content types. With the rise in Khmer digital content—such as reports, articles, and social media feedback—enhanced search capabilities are essential. This research proposes the first Khmer Semantic Search Engine (KSE), designed to enhance traditional Khmer search methods. Utilizing semantic matching techniques and formally annotated semantic content, our tool extracts meaningful keywords from user queries, performs precise matching, and provides the best matching offline documents and online URLs. We propose three semantic search frameworks: semantic search based on a keyword dictionary, semantic search based on ontology, and semantic search based on ranking. Additionally, we developed tools for data preparation, including document addition and manual keyword extraction. To evaluate performance, we created a ground truth dataset and addressed issues related to searching and semantic search. Our findings demonstrate that understanding search term semantics can lead to significantly more accurate results.
Title: Khmer Semantic Search Engine (KSE): Digital Information Access and Document Retrieval
Description:
Abstract
The search engine process is crucial for document content retrieval.
For Khmer documents, an effective tool is needed to extract essential keywords and facilitate accurate searches.
Despite the daily generation of significant Khmer content, Cambodians struggle to find necessary documents due to the lack of an effective semantic searching tool.
Even Google does not deliver high accuracy for Khmer content.
Semantic search engines improve search results by employing advanced algorithms to understand various content types.
With the rise in Khmer digital content—such as reports, articles, and social media feedback—enhanced search capabilities are essential.
This research proposes the first Khmer Semantic Search Engine (KSE), designed to enhance traditional Khmer search methods.
Utilizing semantic matching techniques and formally annotated semantic content, our tool extracts meaningful keywords from user queries, performs precise matching, and provides the best matching offline documents and online URLs.
We propose three semantic search frameworks: semantic search based on a keyword dictionary, semantic search based on ontology, and semantic search based on ranking.
Additionally, we developed tools for data preparation, including document addition and manual keyword extraction.
To evaluate performance, we created a ground truth dataset and addressed issues related to searching and semantic search.
Our findings demonstrate that understanding search term semantics can lead to significantly more accurate results.
Related Results
Theoretical study of laser-cooled SH<sup>–</sup> anion
Theoretical study of laser-cooled SH<sup>–</sup> anion
The potential energy curves, dipole moments, and transition dipole moments for the <inline-formula><tex-math id="M13">\begin{document}${{\rm{X}}^1}{\Sigma ^ + }$\end{do...
The Connection and Extended Development in Making for Khmer Ceramics Culture: A Case Study of Thailand and the Kingdom of Cambodia
The Connection and Extended Development in Making for Khmer Ceramics Culture: A Case Study of Thailand and the Kingdom of Cambodia
The objectives of this research are to study the current status of the earthenware production profession, develop and disseminate knowledge gained from research about the cultural ...
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
<sec> <b>Purpose:</b> The interaction of intense, ultrashort laser pulses with atoms gives rise to rich non-perturbative phenomena, which are encoded within th...
Comparison of Market, Size and Value Premium of Random Samples in KSE and Non KSE 100 Companies
Comparison of Market, Size and Value Premium of Random Samples in KSE and Non KSE 100 Companies
This study is directed towards the identification of key risk variables that explains the variations in expected stocks’ returns and gives rise to Risk Premium for taking an extra ...
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
A Semantic Orthogonal Mapping Method Through Deep-Learning for Semantic Computing
In order to realize an artificial intelligent system, a basic mechanism should be provided for expressing and processing the semantic. We have presented semantic computing models i...
Development of the Tour Split-Cycle Internal Combustion Engine
Development of the Tour Split-Cycle Internal Combustion Engine
<div class="section abstract"><div class="htmlview paragraph">The Tour engine is a novel split-cycle internal combustion engine (ICE) that divides the four-stroke Otto ...
Integration of KSE Stock Index with Other Asian Countries: Empirical Evidence of Measuring Expected Returns on the Basis of Integration
Integration of KSE Stock Index with Other Asian Countries: Empirical Evidence of Measuring Expected Returns on the Basis of Integration
The purpose of this paper is to investigate the expected returns of KSE stock index based on integration among stock markets of Japan, China, India, Singapore and Malaysia from 200...
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The nature of automatic semantic retrieval in individuals with mild cognitive impairment
The number of people diagnosed with Alzheimer’s disease (AD), a progressive and terminal kind of dementia, continues to rise with an estimated 14 million Americans affected by 2050...

