智能科学与技术学报2026,Vol.8Issue(1):47-60,14.DOI:10.11959/j.issn.2096-6652.202601
基于多特征融合的行人过街意图推理方法
Pedestrian crossing intent inference method based on multi-feature fusion
摘要
Abstract
Accurately understanding and predicting pedestrian crossing intent is crucial for ensuring the safety of autono-mous vehicles.Existing approaches are often limited to visual motion cues such as pedestrian trajectories or body poses,while overlooking interactive signals like gestures and head orientations,making it difficult to capture key cues of pedestrian-vehicle interaction.To address these limitations,ARPCI(accurate reasoning for pedestrian crossing intent)was proposed,a multi-feature fusion framework designed for pedestrian intent inference.Specifically,a pedestrian feature module was developed that first focused on skeleton-based features to capture motion trends,and further leveraged Mo-bileNet to extract head pose features.Combined with YOLOv8n for gesture recognition,pedestrian-vehicle interaction signals were captured more comprehensively by the model.In addition,a scene encoding module and a self-vehicle fea-ture module were introduced to integrate contextual and ego-dynamic information,thereby enhancing adaptability to com-plex traffic environments and improving prediction accuracy.Extensive experiments on the widely used JAAD dataset show that the approach achieves an accuracy of 88%,surpassing several state-of-the-art counterparts.Moreover,the abla-tion studies provide further evidence of the effectiveness of the proposed input features.关键词
行人过街意图/多模态特征融合/交互信号/行驶安全Key words
pedestrian crossing intention/multi-feature fusion/interaction signal/traffic safety分类
信息技术与安全科学引用本文复制引用
尹守国,杜泉成,李灵犀,王晓,孙长银..基于多特征融合的行人过街意图推理方法[J].智能科学与技术学报,2026,8(1):47-60,14.基金项目
国家自然科学基金项目(No.62522601)The National Natural Science Foundation of China(No.62522601) (No.62522601)