计算机技术与发展2024,Vol.34Issue(7):24-30,7.DOI:10.20165/j.cnki.ISSN1673-629X.2024.0119
基于多粒度匹配的文本引导服装图像检索
Text Guided Clothing Image Retrieval Based on Multi-granularity Matching
摘要
Abstract
Text guided image retrieval integrates query images and text conditions into a multimodal query.Existing methods improve performance by constructing more advanced fine-grained metric learning,but this may cause the model to overfit the target image under imprecise text conditions and make the retrieval results feature monotonous.To address this issue,we propose a text guided clothing image retrieval method based on feature enhancement and multi granularity matching.Firstly,based on the distribution of target features,noise following a normal distribution is generated,causing small intra-class jittering.Then,constraints are imposed on the enhanced features based on the fluctuations of the target features.The larger the fluctuations,the greater the penalty for the enhanced features,resulting in coarse-grained matching losses.Finally,we optimize the learning strategy by using dynamic weights that continuously decay with training iterations to unify coarse-grained and fine-grained losses.The proposed method reduces the model's rejection of potential target images and improves the diversity of feature recognition.Extensive experiments on two publicly available clothing datasets,FashionIQ and Shoes,have shown that the proposed method can improve recall rates and provide richer retrieval results.关键词
文本引导/图像检索/特征增强/多粒度匹配/多模态融合Key words
text guided/image retrieval/feature enhancement/multi-granularity matching/multi-modal fusion分类
信息技术与安全科学引用本文复制引用
肖华兴,马丽丽,陈金广..基于多粒度匹配的文本引导服装图像检索[J].计算机技术与发展,2024,34(7):24-30,7.基金项目
陕西省自然科学基础研究计划项目(2023-JC-YB-568) (2023-JC-YB-568)
陕西省教育厅科研计划项目(22JP028) (22JP028)
陕西省计算机学会&翔腾公司基金项目(XT-QC-202309-119287) (XT-QC-202309-119287)