改进共振峰提取的语音端点检测

Improved Speech Endpoint Detection Based on Formant

摘要: 语音端点检测是语音信号预处理的重要一步,其准确度对语音合成和语音识别系统的性能起着决定性的作用.根据共振峰谐波能量特征,提出一种采用图像处理技术处理语谱图的语音端点检测算法.首先去除了语谱图中的周期性干扰,然后进行滤噪与分割,最后利用高斯一阶差分滤波器提取共振峰和获取语音端点.实验结果表明,在不同信噪比的白噪声和多种突发性噪声环境下,与其他算法相比,该算法效果更好.

Abstract: Speech endpoint detection is an important step in speech preprocessing.Its accuracy has an important impact on the speech synthesis and recognition systems.Based on the feature of formant-consonance energy,a scheme of speech endpoint detection is presented by taking advantage of image processing technology.This algorithm firstly gets rid of the periodic interference.Then it segments the spectrogram image after de-noising.Finally it derives the feature of formant and speech endpoint by using first order Gaussian differential filter.The experimental results indicate that it can get good quantitative results in the experiments within the environment of different SNR white noise and outburst noise.