数据采集与处理2019,Vol.34Issue(2):281-287,7.DOI:10.16337/j.1004-9037.2019.02.010
电视剧语音识别中的半监督自动语音分割算法
Semi-supervised Automatic Speech Segmentation for TV-drama Speech Recognition
摘要
Abstract
To deal with the speech segmentation of TV-drama which has large coherent text transcriptions but no time-stamps, an automatic semi-supervised speech segmentation algorithm is proposed in the paper.Firstly, the original text transcriptions are used to build a biased language model, then the model is applied to the TV-drama speech recognition in a semi-supervised way, and finally, the resulting automatic speech decoding hypothesis are well combined with the traditional segmentation methods to improve the performances of speech segmentation. These traditional methods are usually based on the distance metric, model classification and the phone recognizers. Experimental results on the British TV-drama"Doctor Who"database demonstrate that, the proposed approach can achieve significant performance improvement over traditional baseline algorithms. Meanwhile, the proposed approach allows high quality segmentation and the associated transcription alignments for the large coherent TV-drama speech recordings.关键词
语音识别/半监督/语音标注Key words
speech recognition/semi-supervised/speech transcription分类
信息技术与安全科学引用本文复制引用
龙艳花,茅红伟,叶宏..电视剧语音识别中的半监督自动语音分割算法[J].数据采集与处理,2019,34(2):281-287,7.基金项目
上海市青年科技英才扬帆计划(14YF1409300)资助项目 (14YF1409300)
国家自然科学基金(61701306)资助项目 (61701306)