计算机应用研究2024,Vol.41Issue(1):217-221,5.DOI:10.19734/j.issn.1001-3695.2023.05.0212
基于词性标注的启发式在线日志解析方法
Heuristic online log parsing method based on part-of-speech tagging
摘要
Abstract
To solve the problems of low parsing accuracy and poor generalization caused by the insufficient distinguishing abi-lity of log feature representations for logs used in existing heuristic log parsing methods,this paper proposed PosParser,a heuris-tic online log parsing method.The method used function token sequence(FTS)derived from the concept of trigger words as fea-ture representations,and consisted of the two-stage detection method for solving the problem of complex logs that were prone to over-parsing,and the post-processing for dealing with variable-length parameter logs.PosParser achieved an average parsing ac-curacy of 0.952 on 16 real-life log datasets.The results demonstrate that FTS has adequate distinguishing ability for logs and PosParser is effective and robust.关键词
日志分析/日志解析/触发词提取/词性标注/系统运维Key words
log analysis/log parsing/trigger word extraction/part-of-speech tagging/system maintenance分类
信息技术与安全科学引用本文复制引用
蒋金钊,傅媛媛,徐建..基于词性标注的启发式在线日志解析方法[J].计算机应用研究,2024,41(1):217-221,5.基金项目
国防基础科研计划国防科技重点实验室稳定支持项目(WDZC20225250405) (WDZC20225250405)
国家自然科学基金资助项目(61872186) (61872186)