| 注册
首页|期刊导航|计算机技术与发展|基于双分支注意力机制的图像自动标注研究

基于双分支注意力机制的图像自动标注研究

张国有 崔永强

计算机技术与发展2024,Vol.34Issue(9):167-173,7.
计算机技术与发展2024,Vol.34Issue(9):167-173,7.DOI:10.20165/j.cnki.ISSN1673-629X.2024.0172

基于双分支注意力机制的图像自动标注研究

Research on Automatic Image Annotation Based on Dual-branch Attention Mechanism

张国有 1崔永强1

作者信息

  • 1. 太原科技大学 计算机科学与技术学院,山西 太原 030024
  • 折叠

摘要

Abstract

Automatic image annotation technology can transform low-level visual features of images into high-level semantic information understood by humans,enhancing the comprehensibility and searchability of images,and has important application value in the fields of image retrieval and classification.At present,automatic image annotation technology based on convolutional neural network models still faces problems such as shallow networks being unable to capture sufficient feature information,easily ignoring the interrelationships between labels,and difficulty in determining the number of labels during annotation.The proposed automatic image annotation method based on dual-branch attention mechanism first uses a dual-branch attention network to enhance the correlation between image features and labels,as well as learn the correlation between labels.Secondly,a multi scale feature extraction module is added to the spatial attention branch to extract multi scale features of the image,solving the problem of insufficient feature extraction in shallow networks.By fusing the outputs of the two branches again through the fusion module,the image features are further enhanced.Finally,the label quantity prediction module is used to predict the number of labels in the image to be annotated,further improving the accuracy of annotation.The proposed model was experimentally analyzed on three benchmark datasets,Corel 5K,ESP Game,and IAPR-TC-12.The experimental results showed that the proposed method can effectively solve the above problems and improve the effectiveness and ac-curacy of labeling.

关键词

图像自动标注/卷积神经网络/多尺度特征/注意力机制/特征融合

Key words

automatic image annotation/convolutional neural network/multi scale feature/attention mechanism/feature fusion

分类

信息技术与安全科学

引用本文复制引用

张国有,崔永强..基于双分支注意力机制的图像自动标注研究[J].计算机技术与发展,2024,34(9):167-173,7.

基金项目

山西省自然科学基金项目(202203021221145) (202203021221145)

国家自然科学基金项目(62072325) (62072325)

太原科技大学科技创新基金项目(20212039) (20212039)

计算机技术与发展

OACSTPCD

1673-629X

访问量0
|
下载量0
段落导航相关论文