光学精密工程2026,Vol.34Issue(5):830-846,17.DOI:10.37188/OPE.20263405.0830
慧眼识"新"开放词汇目标检测
Sharp eyes spot the"novel"in open-vocabulary object detection
摘要
Abstract
To address the low detection accuracy of novel classes in open-world scenarios-primarily caused by weak foreground discrimination and strong bias toward base classes-an open-vocabulary object detection framework named Sharp Eyes Spot the"Novel"in Open-Vocabulary Object Detection(SSN-OVD)is proposed.First,a Foreground Feature Discrimination(FFD)module is introduced,in which a fore-ground estimator is employed to model potential novel-class regions and generate high-quality pseudo-la-bels,enabling more precise foreground-background separation and enhancing the discriminability of fore-ground features.Second,a Bidirectional Feature Alignment(BFA)module is designed to leverage bidi-rectional cross-modal alignment together with confidence calibration,thereby mitigating base-class bias during training and strengthening the model's capability to learn robust representations of novel classes.Third,a Contrastive Denoising Training(CDT)module is developed,incorporating noisy visual-text pairs into the contrastive learning process to further improve feature discrimination and generalization for novel categories.Experimental results demonstrate that the proposed approach achieves state-of-the-art performance,yielding novel-class detection accuracies of 44.9% on COCO and 37.4% on the more chal-lenging fine-grained LVIS dataset.These results indicate that the method effectively enhances novel-class detection in open-world environments.关键词
目标检测/开放词汇/前景特征判别/双向特征对齐/对比降噪训练Key words
object detection/open-vocabulary/foreground feature discrimination/bidirectional feature alignment/contrastive denoising training分类
信息技术与安全科学引用本文复制引用
金友,张若楠,邓箴,杨军,刘立波..慧眼识"新"开放词汇目标检测[J].光学精密工程,2026,34(5):830-846,17.基金项目
宁夏自然科学基金资助项目(No.2024AAC02010,No.2023AAC02010) (No.2024AAC02010,No.2023AAC02010)
宁夏科技创新领军人才计划(No.2022GKLRLX03) (No.2022GKLRLX03)
银川市科技计划项目(No.2025RC09) (No.2025RC09)
国家自然科学基金资助项目(No.62262053,No.62506179) (No.62262053,No.62506179)
2024年宁夏回族自治区重点研发计划(引才专项)(No.2024BEH04026) (引才专项)