包装与食品机械2026,Vol.44Issue(1):1-12,12.DOI:10.3969/j.issn.1005-1295.2026.01.001
基于频谱增强与跨域注意力的药品包装文本检测方法
Medicine packaging text detection method based on spectral enhancement and cross-dimensional attention
摘要
Abstract
To address complex interference challenges in pharmaceutical packaging text detection within smart healthcare scenarios,such as surface reflections,dense tiny characters,and non-planar deformations,a detection method based on spectral enhancement and cross-domain attention,named ArcCRAFT,is proposed.A dynamic receptive field adaptation mechanism was designed,employing a gradient-guided dilation rate routing strategy to enhance the model's perception of multi-scale broken text.A spectral feature enhancer was constructed,utilizing discrete Fourier transform to map spatial features to the frequency domain,suppressing high-frequency reflective noise while strengthening text edge features.A cross-dimensional global attention mechanism was introduced,enabling parallel interaction of channel,spatial,and pixel-level features to improve the analysis capability for complex textures.Experimental results on the PharmaBox dataset demonstrate that the ArcCRAFT model achieves a precision of 91.1%,a recall of 91.0%,and an F1-score of 91.0%,outperforming advanced methods such as ISTD-DLA and PP-OCRv5-server.This research provides a highly robust technical solution for automated dispensing in smart pharmacies.关键词
药盒文本检测/感受野自适应/频域增强/医疗信息化Key words
pharmaceutical box text detection/receptive field adaptation/frequency domain enhancement/medical informatization分类
通用工业技术引用本文复制引用
陈永辉,王文胜,黄民..基于频谱增强与跨域注意力的药品包装文本检测方法[J].包装与食品机械,2026,44(1):1-12,12.基金项目
国家重点研发计划项目(2020YFB1713203) (2020YFB1713203)
北京市教育委员会科技计划项目(KM202411232023) (KM202411232023)
北京信息科技大学"青年骨干教师"支持计划项目(YBT202403) (YBT202403)