| 注册
首页|期刊导航|计算机工程|声景识音:数字化时代声学场景分类的探索与前沿

声景识音:数字化时代声学场景分类的探索与前沿

庞鑫 葛凤培 李艳玲

计算机工程2025,Vol.51Issue(6):1-19,19.
计算机工程2025,Vol.51Issue(6):1-19,19.DOI:10.19678/j.issn.1000-3428.0069005

声景识音:数字化时代声学场景分类的探索与前沿

Soundscape Recognition:Explorations and Frontiers of Acoustic Scene Classification in the Digital Era

庞鑫 1葛凤培 2李艳玲3

作者信息

  • 1. 内蒙古师范大学计算机科学技术学院,内蒙古呼和浩特 010022
  • 2. 北京邮电大学图书馆,北京 100876
  • 3. 内蒙古师范大学计算机科学技术学院,内蒙古呼和浩特 010022||内蒙古师范大学无穷维哈密顿系统及其算法应用教育部重点实验室,内蒙古呼和浩特 010022
  • 折叠

摘要

Abstract

Acoustic Scene Classification(ASC)aims to enable computers to simulate the human auditory system in the task of recognizing various acoustic environments,which is a challenging task in the field of computer audition.With rapid advancements in intelligent audio processing technologies and neural network learning algorithms,a series of new algorithms and technologies for ASC have emerged in recent years.To comprehensively present the technological development trajectory and evolution in this field,this review systematically examines both early work and recent developments in ASC,providing a thorough overview of the field.This review first describes application scenarios and the challenges encountered in ASC and then details the mainstream frameworks in ASC,with a focus on the application of deep learning algorithms in this domain.Subsequently,it systematically summarizes frontier explorations,extension tasks,and publicly available datasets in ASC and finally discusses the prospects for future development trends in ASC.

关键词

声学场景分类/深度学习/音频分类/语音识别/数据增强

Key words

Acoustic Scene Classification(ASC)/deep learning/audio classification/speech recognition/Data Augmentation(DA)

分类

计算机与自动化

引用本文复制引用

庞鑫,葛凤培,李艳玲..声景识音:数字化时代声学场景分类的探索与前沿[J].计算机工程,2025,51(6):1-19,19.

基金项目

国家自然科学基金(12204062,62266033,61806103,61562068) (12204062,62266033,61806103,61562068)

无穷维哈密顿系统及其算法应用教育部重点实验室开放课题(2023KFZD03) (2023KFZD03)

内蒙古自治区自然科学基金(2022LHMS06001) (2022LHMS06001)

内蒙古师范大学基本科研业务费专项资金(2022JBQN106,2022JBQN111,2022JBTD016) (2022JBQN106,2022JBQN111,2022JBTD016)

内蒙古师范大学研究生创新基金(CXJJS23066). (CXJJS23066)

计算机工程

OA北大核心

1000-3428

访问量0
|
下载量0
段落导航相关论文