计算机工程2025,Vol.51Issue(6):1-19,19.DOI:10.19678/j.issn.1000-3428.0069005
声景识音:数字化时代声学场景分类的探索与前沿
Soundscape Recognition:Explorations and Frontiers of Acoustic Scene Classification in the Digital Era
摘要
Abstract
Acoustic Scene Classification(ASC)aims to enable computers to simulate the human auditory system in the task of recognizing various acoustic environments,which is a challenging task in the field of computer audition.With rapid advancements in intelligent audio processing technologies and neural network learning algorithms,a series of new algorithms and technologies for ASC have emerged in recent years.To comprehensively present the technological development trajectory and evolution in this field,this review systematically examines both early work and recent developments in ASC,providing a thorough overview of the field.This review first describes application scenarios and the challenges encountered in ASC and then details the mainstream frameworks in ASC,with a focus on the application of deep learning algorithms in this domain.Subsequently,it systematically summarizes frontier explorations,extension tasks,and publicly available datasets in ASC and finally discusses the prospects for future development trends in ASC.关键词
声学场景分类/深度学习/音频分类/语音识别/数据增强Key words
Acoustic Scene Classification(ASC)/deep learning/audio classification/speech recognition/Data Augmentation(DA)分类
计算机与自动化引用本文复制引用
庞鑫,葛凤培,李艳玲..声景识音:数字化时代声学场景分类的探索与前沿[J].计算机工程,2025,51(6):1-19,19.基金项目
国家自然科学基金(12204062,62266033,61806103,61562068) (12204062,62266033,61806103,61562068)
无穷维哈密顿系统及其算法应用教育部重点实验室开放课题(2023KFZD03) (2023KFZD03)
内蒙古自治区自然科学基金(2022LHMS06001) (2022LHMS06001)
内蒙古师范大学基本科研业务费专项资金(2022JBQN106,2022JBQN111,2022JBTD016) (2022JBQN106,2022JBQN111,2022JBTD016)
内蒙古师范大学研究生创新基金(CXJJS23066). (CXJJS23066)