Javascript must be enabled to continue!
Research Of Singing With Note-name Recognition Algorithm Based On End-to-End Model
View through CrossRef
With the rapid development of artificial intelligence, and people are not only seeking material needs, but also pursuing happiness and spiritual aspirations. Especially some singing talent shows appear, but there are many controversial results. In order to solve the problem of singing with note-name evaluation and evaluate the singer's singing level in the professional music education of solfeggio and ear training, the paper uses artificial intelligence algorithms to recognize the note-name. Firstly, preprocess the singing sound data, including pre emphasis, framing, and windowing to reduce the impact of noise, and extract the acoustic features of the singing sound, the acoustic speech features are organized into a form suitable for DCNN input and a DCNN model is designed to obtain the pinyin form of the singing sound, the softmax layer adopts an end-to-end CTC structure, which classifies the input singing speech, finds corresponding phoneme sequences, and obtains the output results. The end-to-end CTC structure is used to classify and optimize the recognition process, and finally the state of the obtained features is output. And then singing pronunciation dictionary generates candidate word sequences based on the mapping relationship between phonemes and notes. Finally, based on the acoustic model score, the candidate note sequence with the highest score is obtained through decoder processing, and obtained the singing with note-name result of the singing sound. In order to further improve recognition performance, the paper introduces a new CTC-DCNN acoustic model. In this model, residual blocks can transfer input features to the output part through shortcuts, allowing the multi-layer convolutional speech features to be preserved as much as possible. At the same time, the deep structure can also better achieve the extraction and analysis of speech features. A new and improved CTC-DCNN acoustic model is obtained by optimizing the maxout function. The algorithm proposed in this paper is fair and equal in obtaining information, and no one participates in the entire process of obtaining scores. It is believed that the scoring results obtained in this way should be more objective.
Science Research Society
Title: Research Of Singing With Note-name Recognition Algorithm Based On End-to-End Model
Description:
With the rapid development of artificial intelligence, and people are not only seeking material needs, but also pursuing happiness and spiritual aspirations.
Especially some singing talent shows appear, but there are many controversial results.
In order to solve the problem of singing with note-name evaluation and evaluate the singer's singing level in the professional music education of solfeggio and ear training, the paper uses artificial intelligence algorithms to recognize the note-name.
Firstly, preprocess the singing sound data, including pre emphasis, framing, and windowing to reduce the impact of noise, and extract the acoustic features of the singing sound, the acoustic speech features are organized into a form suitable for DCNN input and a DCNN model is designed to obtain the pinyin form of the singing sound, the softmax layer adopts an end-to-end CTC structure, which classifies the input singing speech, finds corresponding phoneme sequences, and obtains the output results.
The end-to-end CTC structure is used to classify and optimize the recognition process, and finally the state of the obtained features is output.
And then singing pronunciation dictionary generates candidate word sequences based on the mapping relationship between phonemes and notes.
Finally, based on the acoustic model score, the candidate note sequence with the highest score is obtained through decoder processing, and obtained the singing with note-name result of the singing sound.
In order to further improve recognition performance, the paper introduces a new CTC-DCNN acoustic model.
In this model, residual blocks can transfer input features to the output part through shortcuts, allowing the multi-layer convolutional speech features to be preserved as much as possible.
At the same time, the deep structure can also better achieve the extraction and analysis of speech features.
A new and improved CTC-DCNN acoustic model is obtained by optimizing the maxout function.
The algorithm proposed in this paper is fair and equal in obtaining information, and no one participates in the entire process of obtaining scores.
It is believed that the scoring results obtained in this way should be more objective.
Related Results
Ary Scheffer, een Nederlandse Fransman
Ary Scheffer, een Nederlandse Fransman
AbstractAry Scheffer (1795-1858) is so generally included in the French School (Note 2)- unsurprisingly, since his career was confined almost entirely to Paris - that the fact that...
Singing voice range profile: New objective evaluation methods for voice change after thyroidectomy
Singing voice range profile: New objective evaluation methods for voice change after thyroidectomy
AbstractBackgroundAfter surgery in the thyroid region, patients may present with phonation or singing difficulty, even within their vocal range. We designed a novel voice evaluatio...
Pieter Saenredam: zijn boekenbezit en zijn relatie met de landmeter Pieter Wils
Pieter Saenredam: zijn boekenbezit en zijn relatie met de landmeter Pieter Wils
AbstractAn earlier article on Saenredam's construction drawings (Note, 1 ) left open the question of how he obtained his knowledge of perspective. His teacher Frans de Grebber (Not...
Singing and Well-Being Indicators
Singing and Well-Being Indicators
Previous studies have indicated that there are positive effects of music and singing on well-being in adults. The aim of our study was to examine the associations between singing c...
Sang og syngning i skolen
Sang og syngning i skolen
Songs and singing in school: Danish school singing between tangible and intangible cultural heritage
For centuries, singing has played a vital role in elementary schools in Denmar...
Een serie tekeningen van Johannes Stradanus met scènes uit het leven van de Heilige Giovanni Gualberto
Een serie tekeningen van Johannes Stradanus met scènes uit het leven van de Heilige Giovanni Gualberto
AbstractAmong the extensive collection of pen sketches by Johannes Stradanus (Bruges 1523-Florence 1605) in the Cooper-Hewitt Museum of Design and the Pierpont Morgan Library in Ne...
The use of Bel Canto singing in the Italian opera of the XVIII century
The use of Bel Canto singing in the Italian opera of the XVIII century
Name: Mariana Andrade Pimenta
Main Subject: Early Music Singing
Research Coach: Inês de Avena Braga
Title of Research: The technical principles of Bel Canto in the 18th and 19th ce...
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
BACKGROUND
Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...

