Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Siamese Hybrid Network Approach for Sentence Similarity

View through CrossRef
This paper presents a novel Siamese Hybrid Network approach, namely Siamese Bidirectional Long Short Memory with Convolutional Neural Network (SiBiLConv), for evaluating the similarity in natural language. The model integrates a Siamese neural network architecture with similarity metrics, including Manhattan Distance and Cosine Similarity, to improve the accuracy of semantic relationships measurement between sentences. Evaluations were performed on Sinhala, a complex and under-resourced language spoken in Sri Lanka, which poses unique challenges due to its morphological richness and syntactic variability. The SiBiLConv model achieved an accuracy of 89.80%, an F1 score of 0.9041, and a mean squared error (MSE) of 0.0281 with the Cosine Distance metric outperforming baseline models such as MaLSTM, which achieved an accuracy of 78.99% and an F1 score of 0.7797. While existing methods for sentence similarity primarily focus on resource-rich languages, this work addresses the pressing need for tailored approaches in low-resource language contexts, where pre-trained models and annotated datasets are often limited. The novelty lies in SiBiLConv's hybrid architecture and metric integration, specifically designed to overcome the syntactic and semantic complexities of Sinhala. This research not only bridges a critical gap in the application of sentence similarity models for low-resource languages but also establishes a framework adaptable to other morphologically rich languages, advancing the broader scope of natural language processing. Keywords: Siamese Hybrid Network, Sentences similarity, Sinhala sentence similarity, Morphologically Rich Language Processing
Title: Siamese Hybrid Network Approach for Sentence Similarity
Description:
This paper presents a novel Siamese Hybrid Network approach, namely Siamese Bidirectional Long Short Memory with Convolutional Neural Network (SiBiLConv), for evaluating the similarity in natural language.
The model integrates a Siamese neural network architecture with similarity metrics, including Manhattan Distance and Cosine Similarity, to improve the accuracy of semantic relationships measurement between sentences.
Evaluations were performed on Sinhala, a complex and under-resourced language spoken in Sri Lanka, which poses unique challenges due to its morphological richness and syntactic variability.
The SiBiLConv model achieved an accuracy of 89.
80%, an F1 score of 0.
9041, and a mean squared error (MSE) of 0.
0281 with the Cosine Distance metric outperforming baseline models such as MaLSTM, which achieved an accuracy of 78.
99% and an F1 score of 0.
7797.
While existing methods for sentence similarity primarily focus on resource-rich languages, this work addresses the pressing need for tailored approaches in low-resource language contexts, where pre-trained models and annotated datasets are often limited.
The novelty lies in SiBiLConv's hybrid architecture and metric integration, specifically designed to overcome the syntactic and semantic complexities of Sinhala.
This research not only bridges a critical gap in the application of sentence similarity models for low-resource languages but also establishes a framework adaptable to other morphologically rich languages, advancing the broader scope of natural language processing.
Keywords: Siamese Hybrid Network, Sentences similarity, Sinhala sentence similarity, Morphologically Rich Language Processing.

Related Results

Pola Fungsi Kalimat pada Novel “Pulang” Karya Tere Liye dan Kelayakannya sebagai Materi Pengayaan Siswa Kelas Xll SMA
Pola Fungsi Kalimat pada Novel “Pulang” Karya Tere Liye dan Kelayakannya sebagai Materi Pengayaan Siswa Kelas Xll SMA
Understanding sentence function patterns plays a major role in reading a novel, especially in class XII. By studying the understanding of sentence function patterns, class XII stud...
Land Cover Change Detection using M Siamese Network
Land Cover Change Detection using M Siamese Network
Land cover change detection has been a topic of active research in the remote sensing community. Due to enormous amount of data available from satellites. The land cover change det...
Hybrid-Enhanced Siamese Similarity Models in Ligand-Based Virtual Screen
Hybrid-Enhanced Siamese Similarity Models in Ligand-Based Virtual Screen
Information technology has become an integral aspect of the drug development process. The virtual screening process (VS) is a computational technique for screening chemical compoun...
Study on Electromagnetic Shielding of Infrared /Visible Optical Window
Study on Electromagnetic Shielding of Infrared /Visible Optical Window
In allusion to electromagnetic radiation damage that existed in daily life, social safety and military field, electromagnetic shielding technology of infrared and infrared optical ...
Pengaruh Konsentrasi Sukrosa terhadap Karakteristik Wine Jeruk Siam Kintamani (Citrus nobilis L.)
Pengaruh Konsentrasi Sukrosa terhadap Karakteristik Wine Jeruk Siam Kintamani (Citrus nobilis L.)
Kintamani Siamese oranges are one of Bali’s local fruits. There is a problem during harvest season, namely the price of oranges falls due to oversupply. To solve the problem, it is...
KALIMAT TANYA DALAM BAHASA INDONESIA
KALIMAT TANYA DALAM BAHASA INDONESIA
Interrogative sentence is one kind of sentences in Indonesian, which formed as proposition that required answer from hearer. It also called as requesting question. The difference w...
STRUKTUR KALIMAT TUNGGAL BAHASA KANUM SOTA THE STRUCTURE OF THE SIMPLE SENTENCE OF KANUM SOTA LANGUAGE
STRUKTUR KALIMAT TUNGGAL BAHASA KANUM SOTA THE STRUCTURE OF THE SIMPLE SENTENCE OF KANUM SOTA LANGUAGE
Abstract Kanum Sota language is spoken by speaker aroun Sota District, Merauke, Papua Province. This study uses descriptive method to describe the structure of the simple sen...

Back to Top