首页|期刊导航|数据采集与处理|多说话人分离与目标说话人提取的研究现状与展望

多说话人分离与目标说话人提取的研究现状与展望

鲍长春杨雪

数据采集与处理2024，Vol.39Issue(5)：1044-1061,18.

数据采集与处理2024，Vol.39Issue(5)：1044-1061,18.DOI:10.16337/j.1004-9037.2024.05.002

多说话人分离与目标说话人提取的研究现状与展望

Research Situation and Prospects of Multi-speaker Separation and Target Speaker Extraction

鲍长春 ¹杨雪¹

作者信息

1. 北京工业大学信息科学技术学院语音与音频信息处理研究所,北京 100124
折叠

摘要

Abstract

As a cutting-edge technology in speech signal processing,speech separation has significant research value and broad application prospects.Typically,the signal captured by the microphones contains speech signals from multiple speakers,noise and reverberation.To improve the user experience and the performance of backend devices,it is necessary to perform speech separation.Speech separation originated from the well-known cocktail party problem.It aims to separate the speech signals from the mixed signal.In recent years,researchers have proposed a large number of speech separation methods,which have significantly improved separation performance.This paper systematically reviews and summarizes these methods.First,based on whether the auxiliary information of the target speaker is leveraged,speech separation is divided into two categories,i.e.,multi-speaker separation and target speaker extraction.Second,these methods are introduced in detail,following the progression from conventional approaches to deep learning-based techniques.Finally,the existing challenges in speech separation are discussed and prospective research in the future are highlighted.

关键词

语音分离/鸡尾酒会问题/多说话人分离/目标说话人提取/深度学习

Key words

speech separation/cocktail party problem/multi-speaker separation/target speaker extraction/deep learning

分类

信息技术与安全科学

引用本文复制引用

鲍长春,杨雪..多说话人分离与目标说话人提取的研究现状与展望[J].数据采集与处理,2024,39(5):1044-1061,18.

基金项目

国家自然科学基金(61831019). （61831019）

数据采集与处理

OA北大核心CSTPCD

ISSN：1004-9037

访问量0

下载量0

段落导航