Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

A NOVEL APPROACH FOR SINGER IDENTIFICATION AND VOCAL RANGE ESTIMATION USING HYBRID DEEP LEARNING MODEL

View through CrossRef
Automatic singer identification and estimating vocal range  of singer are essential  in music information retrieval, with significant applications in music education, digital archiving, personalized training, vocal health monitoring and assisting composers in selecting suitable singers for specific compositions. While previous research focused on singer classification using timbre and spectral features, computational approaches for vocal range estimation of singer  remain underexplored. Identification of singer and analysis of vocal range will have a great impact on commercial and academic domains.  In this research, we propose a hybrid PA (Pitch Analysis) model that employs CREPE deep learning model for pitch extraction, capturing pitch characteristics directly from vocal tracks after isolating it from instruments in a polyphonic recording. The extracted pitch values are then transformed into scalar embeddings and provided as an input to a SRT (Singer Range Transformer) model, which is used to train a Transformer encoder architecture to identify singers and extract their highest and lowest range of pitch. A customized dataset was curated to ensure diversity and robustness, consisting of twenty distinct Carnatic music compositions by the same singer. Experimental results demonstrated that the proposed model achieved high accuracy in identifying the singer and provides reliable vocal range estimations consistent with musicological expectations. This research introduces a novel computational approach to singer identification and vocal range analysis, offering valuable contributions to music education, training, archiving and vocal-health wellness.
Title: A NOVEL APPROACH FOR SINGER IDENTIFICATION AND VOCAL RANGE ESTIMATION USING HYBRID DEEP LEARNING MODEL
Description:
Automatic singer identification and estimating vocal range  of singer are essential  in music information retrieval, with significant applications in music education, digital archiving, personalized training, vocal health monitoring and assisting composers in selecting suitable singers for specific compositions.
While previous research focused on singer classification using timbre and spectral features, computational approaches for vocal range estimation of singer  remain underexplored.
Identification of singer and analysis of vocal range will have a great impact on commercial and academic domains.
  In this research, we propose a hybrid PA (Pitch Analysis) model that employs CREPE deep learning model for pitch extraction, capturing pitch characteristics directly from vocal tracks after isolating it from instruments in a polyphonic recording.
The extracted pitch values are then transformed into scalar embeddings and provided as an input to a SRT (Singer Range Transformer) model, which is used to train a Transformer encoder architecture to identify singers and extract their highest and lowest range of pitch.
A customized dataset was curated to ensure diversity and robustness, consisting of twenty distinct Carnatic music compositions by the same singer.
Experimental results demonstrated that the proposed model achieved high accuracy in identifying the singer and provides reliable vocal range estimations consistent with musicological expectations.
This research introduces a novel computational approach to singer identification and vocal range analysis, offering valuable contributions to music education, training, archiving and vocal-health wellness.

Related Results

CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
FONOLOGI BAHASA PRANCIS
FONOLOGI BAHASA PRANCIS
Understanding phonology is the pivotal thing in learning foreign language. By understanding the target language phonology, learners will be easier to learn foreign language pronunc...
Embodying Voice: Singing Verdi, singing Wagner
Embodying Voice: Singing Verdi, singing Wagner
<p>Writers from diverse disciplines have rhapsodised over the impact of the operatic voice on the listener, while musicologists such as Abbate, Duncan, and Risi have explored...
Vocal tract allometry in a mammalian vocal learner
Vocal tract allometry in a mammalian vocal learner
Abstract Acoustic allometry occurs when features of animal vocalisations can be predicted from body size measurements. Despite this being conside...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
A CLINICO - HISTOPATHOLOGICAL STUDY OF VOCAL FOLD LESIONS
A CLINICO - HISTOPATHOLOGICAL STUDY OF VOCAL FOLD LESIONS
Background: This descriptive study aimed to understand the clinical and histopathological characteristics of various vocal fold lesions, which are vital vibratory structures in the...
Avaliação da Percepção do Envelhecimento Vocal em Idosos
Avaliação da Percepção do Envelhecimento Vocal em Idosos
Resumo: Este estudo objetiva avaliar a voz de um grupo de idosos relacionando a qualidade vocal e seu grau de alteração com o impacto causado em relação à vida particular, profissi...
Vocal Cord Palsy Post Chemoradiation in Head and Neck Cancer: Challenges After Cure
Vocal Cord Palsy Post Chemoradiation in Head and Neck Cancer: Challenges After Cure
Abstract Chemoradiotherapy plays an important role in treatment of head and neck cancer. Though it enables cure, it is also associated with range of side effects. Vocal c...

Back to Top