高技术通讯2025,Vol.35Issue(10):1059-1068,10.DOI:10.3772/j.issn.1002-0470.2025.10.003
GCANet:面向视觉物联网的标签文本检测方法
GCANet:a text detection methods for the visual Internet of Things
摘要
Abstract
Aiming at the difficulty of box label text detection in complex environment,a text detection method for visual Internet of Things is proposed.In this paper,a text detection network based on global context attention and coordi-nate attention(GCANet)is designed and introduced into the visual Internet of Things.Firstly,an improved coordi-nate attention module is proposed in the algorithm,which avoids the loss of location information caused by two-di-mensional global pooling through two parallel one-dimensional pooling operations,horizontal and vertical.Then,the global context attention module is introduced to avoid the influence of complex background on text detection and prevent dense or distantly spaced texts from being detected incorrectly.The F-measure of the comprehensive index of GCANet proposed in this system on the public datasets ICDAR2015,MSRA-TD500 and Total-Text reaches 87.4%,86.9%and 86.3%,respectively.The precision,recall and F-measure of GCANet on the industrial label dataset Label-Text reach 93.4%,90.9%and 92.1%,respectively.In addition,the accuracy,recall and F-measure of GCANet on the Text dataset Mine-Text under the Mine reach 94.4%,84.9%,and 89.9%,respectively.The experimental results show that the text detection method for visual Internet of things proposed in this paper has ex-cellent effect.关键词
视觉物联网/文本检测/坐标注意力模块/全局上下文注意力模块Key words
visual Internet of Things/text detection/coordinate attention module/global context attention module引用本文复制引用
孔二伟,窦泽亚,张亚邦,贾运红,王满利..GCANet:面向视觉物联网的标签文本检测方法[J].高技术通讯,2025,35(10):1059-1068,10.基金项目
国家自然科学基金(52074305)和河南省科技攻关(242102221006)资助项目. (52074305)