首页|期刊导航|数据采集与处理|基于声学分段模型的无监督语音样例检测

基于声学分段模型的无监督语音样例检测

李勃昊张连海郑永军

数据采集与处理2016，Vol.31Issue(2)：407-414,8.

数据采集与处理2016，Vol.31Issue(2)：407-414,8.DOI:10.16337/j.1004-9037.2016.02.023

基于声学分段模型的无监督语音样例检测

Unsupervised Query-by-Example Spoken Term Detection Based on Acoustic Segment Models

李勃昊 ¹张连海 ¹郑永军¹

作者信息

1. 解放军信息工程大学信息系统工程学院，郑州，450001
折叠

摘要

Abstract

A study of acoustic segment models (ASM s) for unsupervised query‐by‐example spoken term detec‐tion is presented .Firsty ,a Gaussian mixture model(GMM) is trained without any transcription information to label speech frames with Gaussian posteriorgram .Hierarchical agglomerative clustering is used to decompose the posterior features into acoustically exhibiting segments .A label is assigned to each result segment by k‐means clustering ,then posteriorgram is faciltitated to train ASMs .In query matching phase ,Viterbi decode is proposed to represent query and test posteriorgrams as ASM sequences .Dynamic match lattice spotting based on minimum edit distance is used to locate possible occurrences of the query term .Experimental results show that the proposed method outperforms traditional GMM and ASMs tokenizers .

关键词

声学分段模型/语音样例检测/后验概率特征/无监督

Key words

acoustic segment models/query-by-example spoken term detection/posterior features/unsu-pervised

分类

信息技术与安全科学

引用本文复制引用

李勃昊,张连海,郑永军..基于声学分段模型的无监督语音样例检测[J].数据采集与处理,2016,31(2):407-414,8.

基金项目

国家自然科学基金（61175017）资助项目。（）

数据采集与处理

OA北大核心CSCDCSTPCD

ISSN：1004-9037

访问量0

下载量0

段落导航