数据采集与处理2025,Vol.40Issue(5):1139-1152,14.DOI:10.16337/j.1004-9037.2025.05.003
多智能体协同的开放域多模态三维模型识别算法
Recognition Algorithm for Multi-agent Collaborative Open-Domain Multimodal 3D Model
摘要
Abstract
To address the challenge of recognizing unlabeled 3D models in open-domain,this paper proposes a multi-agent collaborative algorithm for open-domain multimodal 3D model recognition.The algorithm employs a reinforcement learning framework to simulate human cognitive processes.Within this framework,a multi-agent system is utilized to extract and fuse multimodal information,which enables a comprehensive understanding of the feature space while leveraging the similarity of multimodal samples to enhance model training.Additionally,a progressive pseudo-label generation method is introduced in the reinforcement learning environment.It dynamically adjusts clustering constraints to generate reliable pseudo-labels for a subset of unlabeled data during training,mimicking human exploratory learning of unknown data.These mechanisms collectively update the network parameters based on environmental feedback rewards,effectively controlling the extent of exploratory learning and ensuring accurate learning for unknown categories.Experimental results show that the average recognition accuracy of the method proposed in this paper on the three-dimensional dataset OS-MN40 reaches 65.6%.After transferring the method to the image domain,the classification accuracy on the CIFAR10 dataset reaches 95.6%,which provdies a universal and efficient solution for the research of open-domain three-dimensional model recognition.关键词
新类发现/开放域/强化学习/深度聚类/多模态特征融合Key words
novel class discovery/open-domain/reinforcement learning/deep clustering/multimodal feature fusion分类
信息技术与安全科学引用本文复制引用
李锵,马秋阳,张宁,聂为之..多智能体协同的开放域多模态三维模型识别算法[J].数据采集与处理,2025,40(5):1139-1152,14.基金项目
国家自然科学基金(62272337,62072232) (62272337,62072232)
天津市自然科学基金(16JCZDJC31100) (16JCZDJC31100)
企业档案多模态信息智能管理大模型关键技术研究及应用(2024-X-001). (2024-X-001)