首页|期刊导航|数据采集与处理|电视剧语音识别中的半监督自动语音分割算法

电视剧语音识别中的半监督自动语音分割算法

龙艳花茅红伟叶宏

数据采集与处理2019，Vol.34Issue(2)：281-287,7.

数据采集与处理2019，Vol.34Issue(2)：281-287,7.DOI:10.16337/j.1004-9037.2019.02.010

电视剧语音识别中的半监督自动语音分割算法

Semi-supervised Automatic Speech Segmentation for TV-drama Speech Recognition

龙艳花 ¹茅红伟 ¹叶宏¹

作者信息

1. 上海师范大学信息与机电工程学院,上海,200234
折叠

摘要

Abstract

To deal with the speech segmentation of TV-drama which has large coherent text transcriptions but no time-stamps, an automatic semi-supervised speech segmentation algorithm is proposed in the paper.Firstly, the original text transcriptions are used to build a biased language model, then the model is applied to the TV-drama speech recognition in a semi-supervised way, and finally, the resulting automatic speech decoding hypothesis are well combined with the traditional segmentation methods to improve the performances of speech segmentation. These traditional methods are usually based on the distance metric, model classification and the phone recognizers. Experimental results on the British TV-drama"Doctor Who"database demonstrate that, the proposed approach can achieve significant performance improvement over traditional baseline algorithms. Meanwhile, the proposed approach allows high quality segmentation and the associated transcription alignments for the large coherent TV-drama speech recordings.

关键词

语音识别/半监督/语音标注

Key words

speech recognition/semi-supervised/speech transcription

分类

信息技术与安全科学

引用本文复制引用

龙艳花,茅红伟,叶宏..电视剧语音识别中的半监督自动语音分割算法[J].数据采集与处理,2019,34(2):281-287,7.

基金项目

上海市青年科技英才扬帆计划(14YF1409300)资助项目（14YF1409300）

国家自然科学基金(61701306)资助项目（61701306）

数据采集与处理

OA北大核心CSCDCSTPCD

ISSN：1004-9037

访问量4

下载量0

段落导航