Javascript must be enabled to continue!

Research Of Singing With Note-name Recognition Algorithm Based On End-to-End Model

With the rapid development of artificial intelligence, and people are not only seeking material needs, but also pursuing happiness and spiritual aspirations. Especially some singing talent shows appear, but there are many controversial results. In order to solve the problem of singing with note-name evaluation and evaluate the singer's singing level in the professional music education of solfeggio and ear training, the paper uses artificial intelligence algorithms to recognize the note-name. Firstly, preprocess the singing sound data, including pre emphasis, framing, and windowing to reduce the impact of noise, and extract the acoustic features of the singing sound, the acoustic speech features are organized into a form suitable for DCNN input and a DCNN model is designed to obtain the pinyin form of the singing sound, the softmax layer adopts an end-to-end CTC structure, which classifies the input singing speech, finds corresponding phoneme sequences, and obtains the output results. The end-to-end CTC structure is used to classify and optimize the recognition process, and finally the state of the obtained features is output. And then singing pronunciation dictionary generates candidate word sequences based on the mapping relationship between phonemes and notes. Finally, based on the acoustic model score, the candidate note sequence with the highest score is obtained through decoder processing, and obtained the singing with note-name result of the singing sound. In order to further improve recognition performance, the paper introduces a new CTC-DCNN acoustic model. In this model, residual blocks can transfer input features to the output part through shortcuts, allowing the multi-layer convolutional speech features to be preserved as much as possible. At the same time, the deep structure can also better achieve the extraction and analysis of speech features. A new and improved CTC-DCNN acoustic model is obtained by optimizing the maxout function. The algorithm proposed in this paper is fair and equal in obtaining information, and no one participates in the entire process of obtaining scores. It is believed that the scoring results obtained in this way should be more objective.

Science Research Society

Bintao Deng, Shimin Wang, Xiaodan Ren, Haiyang Ni, Ning Wang, Ming Zhao

Journal of Electrical Systems

2024

Title: Research Of Singing With Note-name Recognition Algorithm Based On End-to-End Model

Description:

With the rapid development of artificial intelligence, and people are not only seeking material needs, but also pursuing happiness and spiritual aspirations.

Especially some singing talent shows appear, but there are many controversial results.

In order to solve the problem of singing with note-name evaluation and evaluate the singer's singing level in the professional music education of solfeggio and ear training, the paper uses artificial intelligence algorithms to recognize the note-name.

Firstly, preprocess the singing sound data, including pre emphasis, framing, and windowing to reduce the impact of noise, and extract the acoustic features of the singing sound, the acoustic speech features are organized into a form suitable for DCNN input and a DCNN model is designed to obtain the pinyin form of the singing sound, the softmax layer adopts an end-to-end CTC structure, which classifies the input singing speech, finds corresponding phoneme sequences, and obtains the output results.

The end-to-end CTC structure is used to classify and optimize the recognition process, and finally the state of the obtained features is output.

And then singing pronunciation dictionary generates candidate word sequences based on the mapping relationship between phonemes and notes.

Finally, based on the acoustic model score, the candidate note sequence with the highest score is obtained through decoder processing, and obtained the singing with note-name result of the singing sound.

In order to further improve recognition performance, the paper introduces a new CTC-DCNN acoustic model.

In this model, residual blocks can transfer input features to the output part through shortcuts, allowing the multi-layer convolutional speech features to be preserved as much as possible.

At the same time, the deep structure can also better achieve the extraction and analysis of speech features.

A new and improved CTC-DCNN acoustic model is obtained by optimizing the maxout function.

The algorithm proposed in this paper is fair and equal in obtaining information, and no one participates in the entire process of obtaining scores.

It is believed that the scoring results obtained in this way should be more objective.

Back

Related Results

Ary Scheffer, een Nederlandse Fransman

AbstractAry Scheffer (1795-1858) is so generally included in the French School (Note 2)- unsurprisingly, since his career was confined almost entirely to Paris - that the fact that...

Singing voice range profile: New objective evaluation methods for voice change after thyroidectomy

AbstractBackgroundAfter surgery in the thyroid region, patients may present with phonation or singing difficulty, even within their vocal range. We designed a novel voice evaluatio...

Co-singing in Families Living with Dementia1

The incidence of dementia is increasing rapidly, and a growing number of persons with dementia live in their private homes, even in stages of severe dementia. Therefore, persons wi...

Pieter Saenredam: zijn boekenbezit en zijn relatie met de landmeter Pieter Wils

AbstractAn earlier article on Saenredam's construction drawings (Note, 1 ) left open the question of how he obtained his knowledge of perspective. His teacher Frans de Grebber (Not...

Singing and Well-Being Indicators

Previous studies have indicated that there are positive effects of music and singing on well-being in adults. The aim of our study was to examine the associations between singing c...

Sang og syngning i skolen

Songs and singing in school: Danish school singing between tangible and intangible cultural heritage For centuries, singing has played a vital role in elementary schools in Denmar...

Een serie tekeningen van Johannes Stradanus met scènes uit het leven van de Heilige Giovanni Gualberto

AbstractAmong the extensive collection of pen sketches by Johannes Stradanus (Bruges 1523-Florence 1605) in the Cooper-Hewitt Museum of Design and the Pierpont Morgan Library in Ne...

Recurrence dynamics and nonlinear system analysis of choral singing

Abstract This study investigates the interplay of cardiac, respiratory, and vocal activity during choral singing using recurrence quantification analysis (RQA) to c...

Email:
Password:

Email: