Javascript must be enabled to continue!
Khmer Semantic Search Engine (KSE): Digital Information Access and Document Retrieval
View through CrossRef
Abstract
The search engine process is crucial for document content retrieval. For Khmer documents, an effective tool is needed to extract essential keywords and facilitate accurate searches. Despite the daily generation of significant Khmer content, Cambodians struggle to find necessary documents due to the lack of an effective semantic searching tool. Even Google does not deliver high accuracy for Khmer content. Semantic search engines improve search results by employing advanced algorithms to understand various content types. With the rise in Khmer digital content—such as reports, articles, and social media feedback—enhanced search capabilities are essential. This research proposes the first Khmer Semantic Search Engine (KSE), designed to enhance traditional Khmer search methods. Utilizing semantic matching techniques and formally annotated semantic content, our tool extracts meaningful keywords from user queries, performs precise matching, and provides the best matching offline documents and online URLs. We propose three semantic search frameworks: semantic search based on a keyword dictionary, semantic search based on ontology, and semantic search based on ranking. Additionally, we developed tools for data preparation, including document addition and manual keyword extraction. To evaluate performance, we created a ground truth dataset and addressed issues related to searching and semantic search. Our findings demonstrate that understanding search term semantics can lead to significantly more accurate results.
Title: Khmer Semantic Search Engine (KSE): Digital Information Access and Document Retrieval
Description:
Abstract
The search engine process is crucial for document content retrieval.
For Khmer documents, an effective tool is needed to extract essential keywords and facilitate accurate searches.
Despite the daily generation of significant Khmer content, Cambodians struggle to find necessary documents due to the lack of an effective semantic searching tool.
Even Google does not deliver high accuracy for Khmer content.
Semantic search engines improve search results by employing advanced algorithms to understand various content types.
With the rise in Khmer digital content—such as reports, articles, and social media feedback—enhanced search capabilities are essential.
This research proposes the first Khmer Semantic Search Engine (KSE), designed to enhance traditional Khmer search methods.
Utilizing semantic matching techniques and formally annotated semantic content, our tool extracts meaningful keywords from user queries, performs precise matching, and provides the best matching offline documents and online URLs.
We propose three semantic search frameworks: semantic search based on a keyword dictionary, semantic search based on ontology, and semantic search based on ranking.
Additionally, we developed tools for data preparation, including document addition and manual keyword extraction.
To evaluate performance, we created a ground truth dataset and addressed issues related to searching and semantic search.
Our findings demonstrate that understanding search term semantics can lead to significantly more accurate results.
Related Results
Khmer-thai people’s attitudes and motivations in studing standard khmer in changwat surin
Khmer-thai people’s attitudes and motivations in studing standard khmer in changwat surin
Motivated by the problem of teaching Khmer language in Khmer-Thai’s communities in Surin province, this study examines the establishment and types of schools that teach Standard Kh...
Access Denied
Access Denied
Introduction
As social-distancing mandates in response to COVID-19 restricted in-person data collection methods such as participant observation and interviews, researchers turned t...
Theoretical study of laser-cooled SH<sup>–</sup> anion
Theoretical study of laser-cooled SH<sup>–</sup> anion
The potential energy curves, dipole moments, and transition dipole moments for the <inline-formula><tex-math id="M13">\begin{document}${{\rm{X}}^1}{\Sigma ^ + }$\end{do...
The Connection and Extended Development in Making for Khmer Ceramics Culture: A Case Study of Thailand and the Kingdom of Cambodia
The Connection and Extended Development in Making for Khmer Ceramics Culture: A Case Study of Thailand and the Kingdom of Cambodia
The objectives of this research are to study the current status of the earthenware production profession, develop and disseminate knowledge gained from research about the cultural ...
Beyond the Dinar: Deciphering Monetary Policy Shocks in Kuwait’s Equity Market
Beyond the Dinar: Deciphering Monetary Policy Shocks in Kuwait’s Equity Market
Purpose: This study investigates the impact of US and Kuwait monetary policy on the Kuwait Stock Exchange (KSE). Study design/methodology/approach: Our study employs an event study...
Volatility Spillover Among Market Indices: Case of Pakistan Stock Exchange
Volatility Spillover Among Market Indices: Case of Pakistan Stock Exchange
The study aims to investigate the return and volatility spillover of markets after the financial crisis’s year (2007-2008) and its widespread impact on other countries. The return ...
SEMANTIC BASED PATTERN SEARCH ENGINE
SEMANTIC BASED PATTERN SEARCH ENGINE
The evolution of search engines from simple keyword-based systems to more sophisticated semantic-based models marks a significant advancement in the field of information retrieval....
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
Revisiting near-threshold photoelectron interference in argon with a non-adiabatic semiclassical model
<sec> <b>Purpose:</b> The interaction of intense, ultrashort laser pulses with atoms gives rise to rich non-perturbative phenomena, which are encoded within th...

