Javascript must be enabled to continue!

A NOVEL APPROACH FOR SINGER IDENTIFICATION AND VOCAL RANGE ESTIMATION USING HYBRID DEEP LEARNING MODEL

Automatic singer identification and estimating vocal range of singer are essential in music information retrieval, with significant applications in music education, digital archiving, personalized training, vocal health monitoring and assisting composers in selecting suitable singers for specific compositions. While previous research focused on singer classification using timbre and spectral features, computational approaches for vocal range estimation of singer remain underexplored. Identification of singer and analysis of vocal range will have a great impact on commercial and academic domains. In this research, we propose a hybrid PA (Pitch Analysis) model that employs CREPE deep learning model for pitch extraction, capturing pitch characteristics directly from vocal tracks after isolating it from instruments in a polyphonic recording. The extracted pitch values are then transformed into scalar embeddings and provided as an input to a SRT (Singer Range Transformer) model, which is used to train a Transformer encoder architecture to identify singers and extract their highest and lowest range of pitch. A customized dataset was curated to ensure diversity and robustness, consisting of twenty distinct Carnatic music compositions by the same singer. Experimental results demonstrated that the proposed model achieved high accuracy in identifying the singer and provides reliable vocal range estimations consistent with musicological expectations. This research introduces a novel computational approach to singer identification and vocal range analysis, offering valuable contributions to music education, training, archiving and vocal-health wellness.

Academic Publications

Renju K

International Journal of Applied Mathematics

2025

Title: A NOVEL APPROACH FOR SINGER IDENTIFICATION AND VOCAL RANGE ESTIMATION USING HYBRID DEEP LEARNING MODEL

Description:

While previous research focused on singer classification using timbre and spectral features, computational approaches for vocal range estimation of singer remain underexplored.

Identification of singer and analysis of vocal range will have a great impact on commercial and academic domains.

In this research, we propose a hybrid PA (Pitch Analysis) model that employs CREPE deep learning model for pitch extraction, capturing pitch characteristics directly from vocal tracks after isolating it from instruments in a polyphonic recording.

The extracted pitch values are then transformed into scalar embeddings and provided as an input to a SRT (Singer Range Transformer) model, which is used to train a Transformer encoder architecture to identify singers and extract their highest and lowest range of pitch.

A customized dataset was curated to ensure diversity and robustness, consisting of twenty distinct Carnatic music compositions by the same singer.

Experimental results demonstrated that the proposed model achieved high accuracy in identifying the singer and provides reliable vocal range estimations consistent with musicological expectations.

This research introduces a novel computational approach to singer identification and vocal range analysis, offering valuable contributions to music education, training, archiving and vocal-health wellness.

Back

The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...

FONOLOGI BAHASA PRANCIS

Understanding phonology is the pivotal thing in learning foreign language. By understanding the target language phonology, learners will be easier to learn foreign language pronunc...

Embodying Voice: Singing Verdi, singing Wagner

<p>Writers from diverse disciplines have rhapsodised over the impact of the operatic voice on the listener, while musicologists such as Abbate, Duncan, and Risi have explored...

Vocal tract allometry in a mammalian vocal learner

Abstract Acoustic allometry occurs when features of animal vocalisations can be predicted from body size measurements. Despite this being conside...

Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)

BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...

A CLINICO - HISTOPATHOLOGICAL STUDY OF VOCAL FOLD LESIONS

Background: This descriptive study aimed to understand the clinical and histopathological characteristics of various vocal fold lesions, which are vital vibratory structures in the...

Avaliação da Percepção do Envelhecimento Vocal em Idosos

Resumo: Este estudo objetiva avaliar a voz de um grupo de idosos relacionando a qualidade vocal e seu grau de alteração com o impacto causado em relação à vida particular, profissi...

Vocal Cord Palsy Post Chemoradiation in Head and Neck Cancer: Challenges After Cure

Abstract Chemoradiotherapy plays an important role in treatment of head and neck cancer. Though it enables cure, it is also associated with range of side effects. Vocal c...

Email:
Password:

Email:

A NOVEL APPROACH FOR SINGER IDENTIFICATION AND VOCAL RANGE ESTIMATION USING HYBRID DEEP LEARNING MODEL

Related Results