计算机应用与软件Issue(4):133-136,159,5.DOI:10.3969/j.issn.1000-386x.2015.04.032
基于自适应阈值与基频检测的自发性口语音频分割算法
SPONTANEOUS ORAL SPEAKING AUDIO SEGMENTATION ALGORITHM BASED ON ADAPTIVE THRESHOLD AND PITCH DETECTION
摘要
Abstract
We present an audio energy adaptive threshold calculation method in order to remove the interference of silent and noisy segments in spontaneous oral speaking audio and to improve speech recognition rate and decoding efficiency.Aiming at the application of real-time automatic oral speaking evaluation,we design the energy threshold adaptive coefficient.This method will dynamically calculate and match an energy threshold to all personal single examining audios for every examinee based on the energy threshold adaptive coefficient in order to avoid the detection errors due to threshold selection and hard threshold judging.The pitch detection procedure is added after the audio segmentation based on adaptive energy threshold for estimating whether the segmented audio segments are noises,so that the pure audio components of oral speaking are separated finally.Experimental results show that the proposed algorithm can effectively segment audio,and is quite robust as well.关键词
自发性口语评测/自适应性/音频切分/基频检测Key words
Spontaneous oral speaking evaluation/Adaptivity/Audio segmentation/Pitch detection分类
信息技术与安全科学引用本文复制引用
廖伟,袁纵横..基于自适应阈值与基频检测的自发性口语音频分割算法[J].计算机应用与软件,2015,(4):133-136,159,5.基金项目
贵州省科技厅、贵州民族学院科技联合基金(黔科合J 字 LKM[2011]10号);贵州省科技厅项目 ()