计算机与现代化Issue(9):50-56,7.DOI:10.3969/j.issn.1006-2475.2015.09.011
一种手写体识别误差与用户花费平衡算法
A Balance Algorithm Between Handwriting Error and User Effort
摘要
Abstract
To solve the problem of poor performance in present computer-assisted annotation transcription of handwritten text doc-uments, a new algorithm for predicting the error rate in a block of automatically recognized words is proposed, and estimates how much effort is required to correct a transcription to a certain user-defined error rate. Firstly, the main problem in traditional error estimating methods is analyzed. Then, the estimation of the error is performed for a whole block of words to raise the accuracy rate. Finally, the best-performing techniques presented in previous works are combined to form our method. The proposed method is included in an interactive approach to transcribe handwritten text documents, which efficiently employs user interactions by means of active and semi-supervised learning techniques. Transcription results, in terms of trade-off between user effort and tran-scription accuracy, are reported for two real handwritten documents, and prove the effectiveness of the proposed algorithm.关键词
计算机辅助标注/手写体识别/用户花费/平衡/文本转录/误差评估Key words
computer-assisted annotation/handwriting recognition/user effort/balance/text transcription/error estimation分类
信息技术与安全科学引用本文复制引用
尚雪莲,梁传君..一种手写体识别误差与用户花费平衡算法[J].计算机与现代化,2015,(9):50-56,7.基金项目
新疆维吾尔自治区自然科学基金资助项目(2013211A031) (2013211A031)
新疆工程学院基金资助项目(2014030415) (2014030415)