Javascript must be enabled to continue!

Single sensor singer/music separation using a source/filter model of the singer voice

Separating the singer voice from polyphonic music signals has many useful applications, such as demixing/remixing, desoloing or audio indexing. In the works of Benaroya on single sensor blind source separation, the signals are modeled by Gaussian mixtures (GMM) such that each state is characterized by a spectral shape. The separation itself is then done by adaptive Wiener filtering. However, to better fit general signals, the number of states for the vocal model should be equal to the number of notes multiplied by the number of vowels (or canonical vocal tract shapes) that the singer uses. Therefore, in order to separate a singer voice from background music, we suggest a source/filter model for the singer signal, keeping the same models as used by Benaroya for the background music signal. Assuming the presence of only one singer, we separate the desired part from the rest by first estimating the sung melody thanks to the source part of our model and then re-evaluating the parameters of our model. This research is partly supported by the European Commission under contract FP6-027026-K-SPACE and by the French AII (Quaero project).

Acoustical Society of America (ASA)

Jean-Louis Durrieu Bertrand David Gaël Richard

The Journal of the Acoustical Society of America

2008

Title: Single sensor singer/music separation using a source/filter model of the singer voice

Description:

Separating the singer voice from polyphonic music signals has many useful applications, such as demixing/remixing, desoloing or audio indexing.

In the works of Benaroya on single sensor blind source separation, the signals are modeled by Gaussian mixtures (GMM) such that each state is characterized by a spectral shape.

The separation itself is then done by adaptive Wiener filtering.

However, to better fit general signals, the number of states for the vocal model should be equal to the number of notes multiplied by the number of vowels (or canonical vocal tract shapes) that the singer uses.

Therefore, in order to separate a singer voice from background music, we suggest a source/filter model for the singer signal, keeping the same models as used by Benaroya for the background music signal.

Assuming the presence of only one singer, we separate the desired part from the rest by first estimating the sung melody thanks to the source part of our model and then re-evaluating the parameters of our model.

This research is partly supported by the European Commission under contract FP6-027026-K-SPACE and by the French AII (Quaero project).

Back

Es ampliamente conocido que los modelos de error para sensores inerciales tienen dos componentes: El primero es un componente determinista que normalmente es calibrado por el fabri...

Music and Mysticism

The word “mystic” has a common meaning in philosophical traditions like neo-Platonism and religions (Hindu, Jewish, Christian, and Muslim)—namely the elevation of a human being to ...

Owner Bound Music: A study of popular sheet music selling and music making in the New Zealand home 1840-1940

From 1840, when New Zealand became part of the British Empire, until 1940 when the nation celebrated its Centennial, the piano was the most dominant instrument in domestic...

Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes

Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...

Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes

Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...

Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes

Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...

Embodying Voice: Singing Verdi, singing Wagner

Writers from diverse disciplines have rhapsodised over the impact of the operatic voice on the listener, while musicologists such as Abbate, Duncan, and Risi have explored...

If I Had Possession over Judgment Day: Augmenting Robert Johnson

augmentvb [ɔːgˈmɛnt]1. to make or become greater in number, amount, strength, etc.; increase2. Music: to increase (a major or perfect interval) by a semitone (Collins English Dicti...

Email:
Password:

Email:

Single sensor singer/music separation using a source/filter model of the singer voice

Related Results