Javascript must be enabled to continue!
Research on End-to-end Voiceprint Recognition Model Based on Convolutional Neural Network
View through CrossRef
Speech signal is a time-varying signal, which is greatly affected by individual and environment. In order to improve the end-to-end voice print recognition rate, it is necessary to preprocess the original speech signal to some extent. An end-to-end voiceprint recognition algorithm based on convolutional neural network is proposed. In this algorithm, the convolution and down-sampling of convolutional neural network are used to preprocess the speech signals in end-to-end voiceprint recognition. The one-dimensional and two-dimensional convolution operations were established to extract the characteristic parameters of Meier frequency cepstrum coefficient from the preprocessed signals, and the classical universal background model was used to model the recognition model of voice print. In this study, the principle of end-to-end voiceprint recognition was firstly analyzed, and the process of end-to-end voice print recognition, end-to-end voice print recognition features and Res-FD-CNN network structure were studied. Then the convolutional neural network recognition model was constructed, and the data were preprocessed to form the convolutional layer in frequency domain and the algorithm was tested.
Title: Research on End-to-end Voiceprint Recognition Model Based on Convolutional Neural Network
Description:
Speech signal is a time-varying signal, which is greatly affected by individual and environment.
In order to improve the end-to-end voice print recognition rate, it is necessary to preprocess the original speech signal to some extent.
An end-to-end voiceprint recognition algorithm based on convolutional neural network is proposed.
In this algorithm, the convolution and down-sampling of convolutional neural network are used to preprocess the speech signals in end-to-end voiceprint recognition.
The one-dimensional and two-dimensional convolution operations were established to extract the characteristic parameters of Meier frequency cepstrum coefficient from the preprocessed signals, and the classical universal background model was used to model the recognition model of voice print.
In this study, the principle of end-to-end voiceprint recognition was firstly analyzed, and the process of end-to-end voice print recognition, end-to-end voice print recognition features and Res-FD-CNN network structure were studied.
Then the convolutional neural network recognition model was constructed, and the data were preprocessed to form the convolutional layer in frequency domain and the algorithm was tested.
Related Results
Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
The convolutional neural network (CNN) has made great strides in the area of voiceprint recognition; but it needs a huge number of data samples to train a deep neural network. In p...
Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
Novel Fault Diagnosis Method for Rolling Bearing Based on Voiceprint Recognition With FasterNet‐CAM
Novel Fault Diagnosis Method for Rolling Bearing Based on Voiceprint Recognition With FasterNet‐CAM
ABSTRACT
Contact measuring tools are not suitable in some specific working environments, such as high temperature or chemical metallurgical equipment, when non‐co...
Voiceprint recognition based on BP Neural Network and CNN
Voiceprint recognition based on BP Neural Network and CNN
Abstract
At present, speech recognition has become a key technology of human-computer interaction, which can be used in semantic recognition and speaker identificati...
Depth-aware salient object segmentation
Depth-aware salient object segmentation
Object segmentation is an important task which is widely employed in many computer vision applications such as object detection, tracking, recognition, and ret...
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
Multimodal Emotion Recognition and Human Computer Interaction for AI-Driven Mental Health Support (Preprint)
BACKGROUND
Mental health has become one of the most urgent global health issues of the twenty-first century. The World Health Organization (WHO) reports tha...
Method for Constructing Neural Network Means for Recognizing Scenes of Political Extremism in Graphic Materials of Online Social Networks
Method for Constructing Neural Network Means for Recognizing Scenes of Political Extremism in Graphic Materials of Online Social Networks
Countering the spread of calls for political extremism through graphic content on online social networks is becoming an increasingly pressing problem that requires the development ...
Analog Convolutional Operator Circuit for Low-Power Mixed-Signal CNN Processing Chip
Analog Convolutional Operator Circuit for Low-Power Mixed-Signal CNN Processing Chip
In this paper, we propose a compact and low-power mixed-signal approach to implementing convolutional operators that are often responsible for most of the chip area and power consu...

