Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Spectral estimation of speech by mel‐generalized cepstral analysis

View through CrossRef
AbstractThis paper proposes a spectral estimation method of speech based on mel‐generalized cepstral representation. The spectral estimation based on generalized cepstral representation is a method that unifies the cepstral method and the linear prediction method. By this method, the spectral model can be varied continuously from the all‐pole model to the exponential model.This paper also discusses the following: the method of spectral estimation, where the mel‐generalized cepstrum is used instead of the generalized cepstrum; the uniqueness of the solution for the minimization problem; the solution method; and the convergence of the solution. The proposed method has the same features as those of the method based on the generalized cepstrum where the spectral model can be varied continuously and the stability of the obtained transfer function is guaranteed. Another feature is that characteristics similar to the human auditory sensation, i.e., the high resolution in the low‐frequency range and the coarse resolution in the high‐frequency range, are obtained since the mel‐generalized cepstrum is used as the parameter, which is the generalized cepstrum defined on the mel‐frequency axis. Finally, the features and the effectiveness of the proposed method is demonstrated by analysis examples of the synthesized signals and natural speech.
Title: Spectral estimation of speech by mel‐generalized cepstral analysis
Description:
AbstractThis paper proposes a spectral estimation method of speech based on mel‐generalized cepstral representation.
The spectral estimation based on generalized cepstral representation is a method that unifies the cepstral method and the linear prediction method.
By this method, the spectral model can be varied continuously from the all‐pole model to the exponential model.
This paper also discusses the following: the method of spectral estimation, where the mel‐generalized cepstrum is used instead of the generalized cepstrum; the uniqueness of the solution for the minimization problem; the solution method; and the convergence of the solution.
The proposed method has the same features as those of the method based on the generalized cepstrum where the spectral model can be varied continuously and the stability of the obtained transfer function is guaranteed.
Another feature is that characteristics similar to the human auditory sensation, i.
e.
, the high resolution in the low‐frequency range and the coarse resolution in the high‐frequency range, are obtained since the mel‐generalized cepstrum is used as the parameter, which is the generalized cepstrum defined on the mel‐frequency axis.
Finally, the features and the effectiveness of the proposed method is demonstrated by analysis examples of the synthesized signals and natural speech.

Related Results

Melatonin receptor genes (mel‐1a, mel‐1b, mel‐1c) are differentially expressed in the avian germ line
Melatonin receptor genes (mel‐1a, mel‐1b, mel‐1c) are differentially expressed in the avian germ line
AbstractThe presence of melatonin receptor transcripts (mel‐1a, mel‐1b and mel‐1c) was investigated in primordial germ cells (PGCs), immature and mature oocytes, and sperm of Japan...
Adaptive cepstral analysis—adaptive filtering based on cepstral representation
Adaptive cepstral analysis—adaptive filtering based on cepstral representation
AbstractThe unbiased estimation of the log spectrum is a method which is a stricter formulation of the cepstral method from the viewpoint of the spectral estimation. In the unbiase...
Expression of melatonin receptor transcripts (mel-1a, mel-1b and mel-1c) in Japanese quail oocytes and eggs
Expression of melatonin receptor transcripts (mel-1a, mel-1b and mel-1c) in Japanese quail oocytes and eggs
Cloning and sequencing of the cDNA for avian melatonin (MEL) receptors have made it possible to investigate the expression of these receptors in different animal tissues and organs...
Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recogntion
Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recogntion
Abstract The performance of speaker recognition is very well in a clean dataset or without mismatch between training and test set. However, the performance is degraded with...
Fusion of Cochleogram and Mel Spectrogram Features for Deep Learning Based Speaker Recognition
Fusion of Cochleogram and Mel Spectrogram Features for Deep Learning Based Speaker Recognition
Abstract Speaker recognition has crucial application in forensic science, financial areas, access control, surveillance and law enforcement. The performance of speaker reco...
Acoustic and Linguistic Features of Impromptu Speech and Their Association With Anxiety: Validation Study (Preprint)
Acoustic and Linguistic Features of Impromptu Speech and Their Association With Anxiety: Validation Study (Preprint)
BACKGROUND The measurement and monitoring of generalized anxiety disorder requires frequent interaction with psychiatrists or psychologists. Access to menta...
Cepstral analysis with nonuniform spectral weighting for spectral envelope extraction
Cepstral analysis with nonuniform spectral weighting for spectral envelope extraction
AbstractThis paper proposes a method for extraction of the spectral envelope by a spectral evaluation criterion with nonuniform weighting on a log magnitude scale. In the tradition...

Back to Top