华中科技大学学报(自然科学版)Issue(11):78-82,5.DOI:10.13245/j.hust.151115
基于子带保留似然比的鲁棒语音激活检测算法
Sub-band reserved likelihood ratio-based robust voice activity detection
摘要
Abstract
To improve voice activity detection (VAD) accuracy in low signal noise ratio environments , a robust VAD algorithm based on sub‐band reserved likelihood ratios (LRs) was proposed ,aiming at alleviating the false alarm problem in detection of non‐speech signal of the statistical model likelihood ratios test (LRT )‐based method .Reserved factor was employed in likelihood ratio decision rule and determined by speech feature strength of sub‐bands ,which were divided global non‐uniformly and lo‐cal uniformly on the basis of human auditory sensing charateristic .The feature was extracted from sub‐band signal correspond to the frequency range ,in which the LRs exceeded certain threshold .The reserved LRs of frequency component were used for final decision .Experiment conducted on various noisy scenarios shows its better performance in comparison with LRT ,MO‐LRT (multiple observa‐tion likelihood ratio tests) ,etc .The clipping rate is lower and the false alarm problem caused by the virtual height of LRs in detection of non‐speech is alleviated ,and the VAD accuracy is increased by an average of 2% ~14% .关键词
语音处理/语音激活检测/统计模型/似然比/低信噪比Key words
speech processing/voice activity detector/statistical model/likelihood ratio/low signal noise ratio (SNR)分类
信息技术与安全科学引用本文复制引用
何伟俊,贺前华,刘杨..基于子带保留似然比的鲁棒语音激活检测算法[J].华中科技大学学报(自然科学版),2015,(11):78-82,5.基金项目
国家自然科学基金资助项目(61571192);广州市科技条件专项基金资助项目(2060503). ()