计算机与数字工程2024,Vol.52Issue(10):3148-3152,3171,6.DOI:10.3969/j.issn.1672-9722.2024.10.052
基于多尺度注意力的鸟类图像识别
Bird Image Recognition Based on Multiscale Attention
阮涛 1郝智程1
作者信息
- 1. 北京信息科技大学应用数学研究所 北京 100010
- 折叠
摘要
Abstract
Different sub-categories of bird images have similar appearances,while objects of the same category show large in-tra-class variances due to complex backgrounds and pose.To solve this problem,a convolutional neural network model based on multi-scale attention is proposed.The model gradually focuses on the attention from the global image to the target and component im-ages through the target module and component module of parameter-free learning and forms a three-branch network model that can input multi-scale images.Furthermore,an ordering loss is introduced to reduce background interference.On the CUB-200-2011 and NABirds datasets,the recognition accuracy of the model is 87.21%and 85.96%,respectively.Compared with the baseline mod-el,the recognition accuracy is effectively improved,which verifies the effectiveness of the model.关键词
鸟类图像识别/多尺度注意力/排序损失/卷积神经网络Key words
bird image recognition/multiscale attention/rank loss/convolutional neural networks分类
信息技术与安全科学引用本文复制引用
阮涛,郝智程..基于多尺度注意力的鸟类图像识别[J].计算机与数字工程,2024,52(10):3148-3152,3171,6.