数据采集与处理2017,Vol.32Issue(1):119-125,7.DOI:10.16337/j.1004-9037.2017.01.014
基于数字结构特征的发票号码识别算法
Invoice Number Recognition Algorithm Based on Numerical Structure Characteristics
摘要
Abstract
Interference factors such as seal cover,invoice crease and so on,cause noise adhesion in number area of some invoice,which would seriously lead to the invoice number segmentation error.Aiming at this problem,a noise adhesion area repairing algorithm is proposed.At the same time,according to the font structure and characteristics of ordinary invoice number,invoice number recognition algorithm based on characteristics of digital structure is proposed.Firstly,define number structure features,including four kinds of fill area,two kinds of number of passing through the character,and four kinds of hollow area,which constitute a 10-dimensional feature vector of the number to be identified.Then,match the feature vector with the template features in the standard template library,by obtaining the Euclidean distance,and regard the corresponding number with the minimum Euclidean distances as the last recognition result.The proposed method and printed number recognition method based on the improved left and right contour features are compared.Experimental results indicate that the proposed identification algorithm has higher accuracy,faster recognition speed and stronger robustness to noise.关键词
发票号码识别/噪声粘连区域/数字结构特征Key words
invoice number recognition/noise adhesion area/numerical structure characteristics分类
信息技术与安全科学引用本文复制引用
崔文成,任磊,刘阳,邵虹..基于数字结构特征的发票号码识别算法[J].数据采集与处理,2017,32(1):119-125,7.基金项目
辽宁省自然科学基金(201202162)资助项目 (201202162)
辽宁省高等学校优秀人才支持计划(LJQ2013013)资助项目. (LJQ2013013)