计算机工程与应用2012,Vol.48Issue(34):144-147,4.DOI:10.3778/j.issn.1002-8331.1206-0119
基于频域时域联合分析的语音端点检测
Speech endpoint detection based on frequency domain and time domain analyses
王坤赤 1袁燕 1王建强 1张裕胜 1杨永杰1
作者信息
- 1. 南通大学电子信息学院,江苏南通226019
- 折叠
摘要
Abstract
In frequency domain voice activity is detected with the spectral harmonic energy of fundamental wave. The algorithm can effectively eliminate noises of sorts, for harmonics only appear in spectrum of musical tone. So the algorithm is sensitive and accurate. In time domain every pitch is detected by cross-correlation function in virtue of the time of voice activity and fundamental frequency that is obtained through voice activity detection. So the sonant boundary is precisely detected. Second order difference enhances the high frequency component of signal, and cross-correlation function is used to trace the energy of unvoiced sound. Experiments show that the algorithm is reliable and accurate.关键词
谐波/互相关函数/Teager能量算子Key words
harmonic/ cross-correlation function/ Teager energy operator分类
信息技术与安全科学引用本文复制引用
王坤赤,袁燕,王建强,张裕胜,杨永杰..基于频域时域联合分析的语音端点检测[J].计算机工程与应用,2012,48(34):144-147,4.