中国铁道科学2026,Vol.47Issue(2):232-243,12.DOI:10.3969/j.issn.1001-4632.2026.02.20
基于数据知识库的铁路客票敏感数据智能识别技术应用研究
Research on the Application of Intelligent Recognition Technology for Sensitive Railway Ticket Data Based on Data Knowledge Base
摘要
Abstract
To address the data security risks arising from the explosive growth of railway passenger transport data,the core lies in achieving intelligent identification and dynamic protection of sensitive information.Then,an intelligent identification technology for sensitive data in railway passenger tickets based on data knowledge base is proposed.Firstly,a three-level knowledge base of"laws and regulations-industry standards-enterprise norms"is constructed.Secondly,combined with historical railway passenger ticket data,a multi-level intelligent identification algorithm for sensitive data is designed,thereby efficiently and accurately identifying sensitive information in multi-modal data.On this basis,the graph technology is finally introduced to construct a data asset and sensitive data lineage graph,and based on the topological relationship of data flow,the efficient propagation of sensitive information labels among related data nodes is achieved.The results show that the sensitive information identification efficiency of the proposed technology reaches about 217 000 messages per second in structured data processing,which is almost twice as high as the traditional solution.In unstructured data processing,through domain knowledge graphs injection,the F1 value of sensitive entity recognition is increased to 91.24%,and the context misjudgment rate is reduced to 5.88%.The accuracy of text extraction and sensitive information recognition of multimedia images reaches 93.71%.This technology can significantly improve the accuracy and processing efficiency of sensitive data identification in railway passenger tickets.关键词
敏感数据/知识库/铁路客票/智能识别/标签传播/血缘关系图谱Key words
Sensitive data/Knowledge base/Railway ticket/Intelligent recognition/Label propagation/Lineage graph分类
交通工程引用本文复制引用
郝晓培,阎志远,张军锋,李雯,刘相坤,石瑞君..基于数据知识库的铁路客票敏感数据智能识别技术应用研究[J].中国铁道科学,2026,47(2):232-243,12.基金项目
中国铁道科学研究院集团有限公司院基金课题(2024YJ228) (2024YJ228)