| 注册
首页|期刊导航|网络与信息安全学报|适用于内容安全审核的图像文本识别质量评估方法

适用于内容安全审核的图像文本识别质量评估方法

张玉龙 梁天一 张茹 刘功申

网络与信息安全学报2026,Vol.12Issue(2):55-64,10.
网络与信息安全学报2026,Vol.12Issue(2):55-64,10.DOI:10.11959/j.issn.2096-109x.AQ25222

适用于内容安全审核的图像文本识别质量评估方法

Quality assessment method for image text recognition in content security review

张玉龙 1梁天一 2张茹 3刘功申4

作者信息

  • 1. 北京邮电大学网络空间安全学院,北京 100876||上海交通大学计算机学院(网络空间安全学院),上海 200240
  • 2. 华东师范大学计算机科学与技术学院,上海 200062
  • 3. 北京邮电大学网络空间安全学院,北京 100876
  • 4. 上海交通大学计算机学院(网络空间安全学院),上海 200240
  • 折叠

摘要

Abstract

A quality assessment method based on the consistency confidence(CC)mechanism was proposed to ad-dress the challenge of image text recognition quality assessment in content security auditing scenarios.Significant deficiencies were identified in existing evaluation methods.Traditional methods were found to rely on manual an-notation and be cost-prohibitive.The visual language model as judge(VLM-as-Judge)approach was observed to suffer from logical paradoxes and biases.Uncertainty quantification methods were shown to require access to inter-nal model parameters.With the proposed CC-based method,unsupervised quality assessment was achieved through multi-model consistency analysis.Edit distance was utilized to quantify the degree of consistency,and dynamic weighting with threshold gating mechanisms was designed to identify low-quality outputs.This method possesses three major advantages:black-box evaluation,complete unsupervisedness,and model-agnostic property.Experi-mental results demonstrated that the quality verification F1 score reached 49.9%~51.61%,which represented a 42.76%improvement over baseline methods.The best performance was exhibited in challenging scenarios such as adversarial text and complex scenes.Through model ensemble,low-cost open-source models were enabled to sur-pass high-cost closed-source models,with a performance improvement of 32 points.

关键词

图像文本识别/质量评估/内容安全/无监督

Key words

image text recognition/quality assessment/content security/unsupervision

分类

信息技术与安全科学

引用本文复制引用

张玉龙,梁天一,张茹,刘功申..适用于内容安全审核的图像文本识别质量评估方法[J].网络与信息安全学报,2026,12(2):55-64,10.

基金项目

国家自然科学基金联合重点项目(No.U21B2020) The Joint Funds of the National Natural Science Foundation of China(No.U21B2020) (No.U21B2020)

网络与信息安全学报

2096-109X

访问量0
|
下载量0
段落导航相关论文