Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Vowel Phoneme Segmentation for Speaker Identification Using an ANN-Based Framework

View through CrossRef
AbstractVowel phonemes are a part of any acoustic speech signal. Vowel sounds occur in speech more frequently and with higher energy. Therefore, vowel phoneme can be used to extract different amounts of speaker discriminative information in situations where acoustic information is noise corrupted. This article presents an approach to identify a speaker using the vowel sound segmented out from words spoken by the speaker. The work uses a combined self-organizing map (SOM)- and probabilistic neural network (PNN)-based approach to segment the vowel phoneme. The segmented vowel is later used to identify the speaker of the word by matching the patterns with a learning vector quantization (LVQ)-based code book. The LVQ code book is prepared by taking features of clean vowel phonemes uttered by the male and female speakers to be identified. The proposed work formulates a framework for the design of a speaker-recognition model of the Assamese language, which is spoken by ∼3 million people in the Northeast Indian state of Assam. The experimental results show that the segmentation success rates obtained using a SOM-based technique provides an increase of at least 7% compared with the discrete wavelet transform-based technique. This increase contributes to the improvement in overall performance of speaker identification by ∼3% compared with earlier related works.
Title: Vowel Phoneme Segmentation for Speaker Identification Using an ANN-Based Framework
Description:
AbstractVowel phonemes are a part of any acoustic speech signal.
Vowel sounds occur in speech more frequently and with higher energy.
Therefore, vowel phoneme can be used to extract different amounts of speaker discriminative information in situations where acoustic information is noise corrupted.
This article presents an approach to identify a speaker using the vowel sound segmented out from words spoken by the speaker.
The work uses a combined self-organizing map (SOM)- and probabilistic neural network (PNN)-based approach to segment the vowel phoneme.
The segmented vowel is later used to identify the speaker of the word by matching the patterns with a learning vector quantization (LVQ)-based code book.
The LVQ code book is prepared by taking features of clean vowel phonemes uttered by the male and female speakers to be identified.
The proposed work formulates a framework for the design of a speaker-recognition model of the Assamese language, which is spoken by ∼3 million people in the Northeast Indian state of Assam.
The experimental results show that the segmentation success rates obtained using a SOM-based technique provides an increase of at least 7% compared with the discrete wavelet transform-based technique.
This increase contributes to the improvement in overall performance of speaker identification by ∼3% compared with earlier related works.

Related Results

SISTEM FONOLOGI BAHASA TAE (The Phonology System of Tae Language)
SISTEM FONOLOGI BAHASA TAE (The Phonology System of Tae Language)
This study aims to identify and describe qualitatively the phonological system of Tae Rongkong dialect in North Luwu Regency, South Sulawesi. The analysis was carried out on 200 Sw...
Quarantine Powers, Biodefense, and Andrew Speaker
Quarantine Powers, Biodefense, and Andrew Speaker
In January 2007, “Andrew Speaker (“Speaker”) underwent a chest X-ray and CT scan, which revealed an abnormality in his lungs.” However, tests results indicated that he did not ha...
FONOLOGI BAHASA PRANCIS
FONOLOGI BAHASA PRANCIS
Understanding phonology is the pivotal thing in learning foreign language. By understanding the target language phonology, learners will be easier to learn foreign language pronunc...
Intrinsic fundamental frequency of vowels in children with Childhood Apraxia of Speech (CAS)
Intrinsic fundamental frequency of vowels in children with Childhood Apraxia of Speech (CAS)
Background Intrinsic pitch (IF0) is an inherent property of vowels where high vowels are produced with a higher fundamental frequency than low vowels. Although well studied in adul...
Vowel harmony in Yeyi
Vowel harmony in Yeyi
Yeyi (Bantu, R41) is an endangered language spoken in northwestern Botswana and northeastern Namibia. Yeyi exhibits two peculiar processes of regressive vowel harmony. The first ch...
Analisis Morfofonemik Dialek Lamalera
Analisis Morfofonemik Dialek Lamalera
This research is entitled "Morphophonemic Analysis of Lamalera Dialect". This title was taken as research because the Lmalera dialect is the Lamaholot language and has its own iden...
AI‐enabled precise brain tumor segmentation by integrating Refinenet and contour‐constrained features in MRI images
AI‐enabled precise brain tumor segmentation by integrating Refinenet and contour‐constrained features in MRI images
AbstractBackgroundMedical image segmentation is a fundamental task in medical image analysis and has been widely applied in multiple medical fields. The latest transformer‐based de...
Multiple surface segmentation using novel deep learning and graph based methods
Multiple surface segmentation using novel deep learning and graph based methods
<p>The task of automatically segmenting 3-D surfaces representing object boundaries is important in quantitative analysis of volumetric images, which plays a vital role in nu...

Back to Top