计算机工程2019,Vol.45Issue(3):175-181,187,8.DOI:10.19678/j.issn.1000-3428.0052686
基于数学文本和表达式转换的融合检索模型
Integration Retrieval Model Based on Transformation of Mathematical Text and Expression
摘要
Abstract
The query and retrieval results in Mathematical Information Retrieval (MIR) are mainly mathematical expressions, ignoring the semantics of mathematical texts in documents.Therefore, a mathematical expression retrieval model incorporating mathematical text features is proposed.The mathematical text is extracted by traversing Chinese scientific and technical documents.Mathematical dictionaries are used to map mathematical texts into LaTeX mathematical expressions and converted into binary tree structures.On this basis, the mathematical expression index is constructed and the matching algorithm is designed to realize the mathematical text and expression retrieval.Experiments show that the method improves the retrieval performance of the mathematical retrieval system.关键词
数学信息检索/数学文本/数学表达式/词典/索引Key words
Mathematical Information Retrieval (MIR)/mathematical text/mathematical expression/dictionary/index分类
信息技术与安全科学引用本文复制引用
张倩倩,田学东,杨芳,李新福..基于数学文本和表达式转换的融合检索模型[J].计算机工程,2019,45(3):175-181,187,8.基金项目
国家自然科学基金(61375075) (61375075)
河北省教育厅河北省高等学校科学技术研究重点项目(ZD2017208,ZD2017209) (ZD2017208,ZD2017209)
河北大学"一省一校"项目. ()