中国全科医学2025,Vol.28Issue(29):3638-3644,3652,8.DOI:10.12114/j.issn.1007-9572.2025.0238
吞咽声学数据库构建技术与方法探索
Exploration of Technologies and Methods for Constructing a Swallowing Acoustic Database
摘要
Abstract
Dysphagia is common among elderly people and may lead to aspiration,malnutrition,and pulmonary infections if not properly managed.Acoustic-based assessment offers a non-invasive,practical,and remotely applicable approach,yet current research is limited by small sample sizes and a lack of standardized data protocols.This study recruited 650 older adults from 13 care institutions in Beijing and Shijiazhuang,with 635 completing valid audio tasks.A total of 7 922 high-quality recordings were collected,including swallowing,coughing,and speech sounds.From each audio clip,23 acoustic features across time,frequency,energy,and nonlinear domains were extracted,yielding 182 206 feature data points.Waveform,spectrogram,and time-frequency analyses confirmed significant differences across sound types,highlighting the discriminative value of acoustic features.A standardized workflow for audio collection,processing,and feature extraction was developed,resulting in a comprehensive swallowing acoustic database.This database provides essential support for recognizing acoustic biomarkers,building AI-driven identification models and advancing remote dysphagia assessment.It has significant scientific research value and broad application prospects.关键词
吞咽障碍/吞咽困难/老年人/吞咽音/咳嗽音/语音/声学特征/数据库Key words
Deglutition disorders/Dysphagia/Aged/Swallowing sounds/Coughing sounds/Speech sounds/Acoustic features/Database分类
医药卫生引用本文复制引用
李丹,刘涛,罗维,宋红丹,尚少梅..吞咽声学数据库构建技术与方法探索[J].中国全科医学,2025,28(29):3638-3644,3652,8.基金项目
北京大学医学部"医学+X"项目(BMU2024YXXLHGG005) (BMU2024YXXLHGG005)
国家重点研发计划(2020YFC2008800,2020YFC2008801) (2020YFC2008800,2020YFC2008801)