Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Single sensor singer/music separation using a source/filter model of the singer voice

View through CrossRef
Separating the singer voice from polyphonic music signals has many useful applications, such as demixing/remixing, desoloing or audio indexing. In the works of Benaroya on single sensor blind source separation, the signals are modeled by Gaussian mixtures (GMM) such that each state is characterized by a spectral shape. The separation itself is then done by adaptive Wiener filtering. However, to better fit general signals, the number of states for the vocal model should be equal to the number of notes multiplied by the number of vowels (or canonical vocal tract shapes) that the singer uses. Therefore, in order to separate a singer voice from background music, we suggest a source/filter model for the singer signal, keeping the same models as used by Benaroya for the background music signal. Assuming the presence of only one singer, we separate the desired part from the rest by first estimating the sung melody thanks to the source part of our model and then re-evaluating the parameters of our model. This research is partly supported by the European Commission under contract FP6-027026-K-SPACE and by the French AII (Quaero project).
Title: Single sensor singer/music separation using a source/filter model of the singer voice
Description:
Separating the singer voice from polyphonic music signals has many useful applications, such as demixing/remixing, desoloing or audio indexing.
In the works of Benaroya on single sensor blind source separation, the signals are modeled by Gaussian mixtures (GMM) such that each state is characterized by a spectral shape.
The separation itself is then done by adaptive Wiener filtering.
However, to better fit general signals, the number of states for the vocal model should be equal to the number of notes multiplied by the number of vowels (or canonical vocal tract shapes) that the singer uses.
Therefore, in order to separate a singer voice from background music, we suggest a source/filter model for the singer signal, keeping the same models as used by Benaroya for the background music signal.
Assuming the presence of only one singer, we separate the desired part from the rest by first estimating the sung melody thanks to the source part of our model and then re-evaluating the parameters of our model.
This research is partly supported by the European Commission under contract FP6-027026-K-SPACE and by the French AII (Quaero project).

Related Results

Dynamic stochastic modeling for inertial sensors
Dynamic stochastic modeling for inertial sensors
Es ampliamente conocido que los modelos de error para sensores inerciales tienen dos componentes: El primero es un componente determinista que normalmente es calibrado por el fabri...
Music and Mysticism
Music and Mysticism
The word “mystic” has a common meaning in philosophical traditions like neo-Platonism and religions (Hindu, Jewish, Christian, and Muslim)—namely the elevation of a human being to ...
Owner Bound Music: A study of popular sheet music selling and music making in the New Zealand home 1840-1940
Owner Bound Music: A study of popular sheet music selling and music making in the New Zealand home 1840-1940
<p>From 1840, when New Zealand became part of the British Empire, until 1940 when the nation celebrated its Centennial, the piano was the most dominant instrument in domestic...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
Speech, communication, and neuroimaging in Parkinson's disease : Characterisation and intervention outcomes
<p dir="ltr">Most individuals with Parkinson's disease (PD) experience changes in speech, voice or communication. Speech changes often manifest as hypokinetic dysarthria, a m...
Embodying Voice: Singing Verdi, singing Wagner
Embodying Voice: Singing Verdi, singing Wagner
<p>Writers from diverse disciplines have rhapsodised over the impact of the operatic voice on the listener, while musicologists such as Abbate, Duncan, and Risi have explored...
If I Had Possession over Judgment Day: Augmenting Robert Johnson
If I Had Possession over Judgment Day: Augmenting Robert Johnson
augmentvb [ɔːgˈmɛnt]1. to make or become greater in number, amount, strength, etc.; increase2. Music: to increase (a major or perfect interval) by a semitone (Collins English Dicti...

Back to Top