Javascript must be enabled to continue!
Siamese Hybrid Network Approach for Sentence Similarity
View through CrossRef
This paper presents a novel Siamese Hybrid Network approach, namely Siamese Bidirectional Long Short Memory with Convolutional Neural Network (SiBiLConv), for evaluating the similarity in natural language. The model integrates a Siamese neural network architecture with similarity metrics, including Manhattan Distance and Cosine Similarity, to improve the accuracy of semantic relationships measurement between sentences. Evaluations were performed on Sinhala, a complex and under-resourced language spoken in Sri Lanka, which poses unique challenges due to its morphological richness and syntactic variability. The SiBiLConv model achieved an accuracy of 89.80%, an F1 score of 0.9041, and a mean squared error (MSE) of 0.0281 with the Cosine Distance metric outperforming baseline models such as MaLSTM, which achieved an accuracy of 78.99% and an F1 score of 0.7797. While existing methods for sentence similarity primarily focus on resource-rich languages, this work addresses the pressing need for tailored approaches in low-resource language contexts, where pre-trained models and annotated datasets are often limited. The novelty lies in SiBiLConv's hybrid architecture and metric integration, specifically designed to overcome the syntactic and semantic complexities of Sinhala. This research not only bridges a critical gap in the application of sentence similarity models for low-resource languages but also establishes a framework adaptable to other morphologically rich languages, advancing the broader scope of natural language processing.
Keywords: Siamese Hybrid Network, Sentences similarity, Sinhala sentence similarity, Morphologically Rich Language Processing
University of Sri Jayewardenepura
Title: Siamese Hybrid Network Approach for Sentence Similarity
Description:
This paper presents a novel Siamese Hybrid Network approach, namely Siamese Bidirectional Long Short Memory with Convolutional Neural Network (SiBiLConv), for evaluating the similarity in natural language.
The model integrates a Siamese neural network architecture with similarity metrics, including Manhattan Distance and Cosine Similarity, to improve the accuracy of semantic relationships measurement between sentences.
Evaluations were performed on Sinhala, a complex and under-resourced language spoken in Sri Lanka, which poses unique challenges due to its morphological richness and syntactic variability.
The SiBiLConv model achieved an accuracy of 89.
80%, an F1 score of 0.
9041, and a mean squared error (MSE) of 0.
0281 with the Cosine Distance metric outperforming baseline models such as MaLSTM, which achieved an accuracy of 78.
99% and an F1 score of 0.
7797.
While existing methods for sentence similarity primarily focus on resource-rich languages, this work addresses the pressing need for tailored approaches in low-resource language contexts, where pre-trained models and annotated datasets are often limited.
The novelty lies in SiBiLConv's hybrid architecture and metric integration, specifically designed to overcome the syntactic and semantic complexities of Sinhala.
This research not only bridges a critical gap in the application of sentence similarity models for low-resource languages but also establishes a framework adaptable to other morphologically rich languages, advancing the broader scope of natural language processing.
Keywords: Siamese Hybrid Network, Sentences similarity, Sinhala sentence similarity, Morphologically Rich Language Processing.
Related Results
Pola Fungsi Kalimat pada Novel “Pulang” Karya Tere Liye dan Kelayakannya sebagai Materi Pengayaan Siswa Kelas Xll SMA
Pola Fungsi Kalimat pada Novel “Pulang” Karya Tere Liye dan Kelayakannya sebagai Materi Pengayaan Siswa Kelas Xll SMA
Understanding sentence function patterns plays a major role in reading a novel, especially in class XII. By studying the understanding of sentence function patterns, class XII stud...
Land Cover Change Detection using M Siamese Network
Land Cover Change Detection using M Siamese Network
Land cover change detection has been a topic of active research in the remote sensing community. Due to enormous amount of data available from satellites. The land cover change det...
Should the Identification Guidelines for Siamese Crocodiles Be Revised? Differing Post-Occipital Scute Scale Numbers Show Phenotypic Variation Does Not Result from Hybridization with Saltwater Crocodiles
Should the Identification Guidelines for Siamese Crocodiles Be Revised? Differing Post-Occipital Scute Scale Numbers Show Phenotypic Variation Does Not Result from Hybridization with Saltwater Crocodiles
Populations of Siamese crocodiles (Crocodylus siamensis) have severely declined because of hunting and habitat fragmentation, necessitating a reintroduction plan involving commerci...
Hybrid-Enhanced Siamese Similarity Models in Ligand-Based Virtual Screen
Hybrid-Enhanced Siamese Similarity Models in Ligand-Based Virtual Screen
Information technology has become an integral aspect of the drug development process. The virtual screening process (VS) is a computational technique for screening chemical compoun...
Study on Electromagnetic Shielding of Infrared /Visible Optical Window
Study on Electromagnetic Shielding of Infrared /Visible Optical Window
In allusion to electromagnetic radiation damage that existed in daily life, social safety and military field, electromagnetic shielding technology of infrared and infrared optical ...
Pengaruh Konsentrasi Sukrosa terhadap Karakteristik Wine Jeruk Siam Kintamani (Citrus nobilis L.)
Pengaruh Konsentrasi Sukrosa terhadap Karakteristik Wine Jeruk Siam Kintamani (Citrus nobilis L.)
Kintamani Siamese oranges are one of Bali’s local fruits. There is a problem during harvest season, namely the price of oranges falls due to oversupply. To solve the problem, it is...
KALIMAT TANYA DALAM BAHASA INDONESIA
KALIMAT TANYA DALAM BAHASA INDONESIA
Interrogative sentence is one kind of sentences in Indonesian, which formed as proposition that required answer from hearer. It also called as requesting question. The difference w...
STRUKTUR KALIMAT TUNGGAL BAHASA KANUM SOTA THE STRUCTURE OF THE SIMPLE SENTENCE OF KANUM SOTA LANGUAGE
STRUKTUR KALIMAT TUNGGAL BAHASA KANUM SOTA THE STRUCTURE OF THE SIMPLE SENTENCE OF KANUM SOTA LANGUAGE
Abstract
Kanum Sota language is spoken by speaker aroun Sota District, Merauke, Papua Province. This study uses descriptive method to describe the structure of the simple sen...

