| 注册
首页|期刊导航|计算机技术与发展|基于路径权重的XML文档相似度仿真研究

基于路径权重的XML文档相似度仿真研究

赵艳妮 郭华磊 马军生

计算机技术与发展2016,Vol.26Issue(9):197-200,4.
计算机技术与发展2016,Vol.26Issue(9):197-200,4.DOI:10.3969/j.issn.1673-629X.2016.09.044

基于路径权重的XML文档相似度仿真研究

Simulation Research of XML Document Similarity Based on Path Weighting

赵艳妮 1郭华磊 2马军生3

作者信息

  • 1. 陕西职业技术学院 计算机科学系,陕西 西安 710100
  • 2. 西安理工大学 自动化与信息工程学院,陕西 西安 710048
  • 3. 西安通信学院 信息服务系,陕西 西安 710106
  • 折叠

摘要

Abstract

In order to realize the rapid and accurate retrieval of the XML document information,a tree similarity algorithm based on path weight is proposed. It considers the tree node information similarity and structural similarity,and the information is arranged in each level of the tree in accordance with the degree of importance by object rules of primary and secondary information organization,making the de-gree of importance for tree node information weakened from up to down. According to the characteristics that the node with closer dis-tance from the root node represents the more important information,and the lowest level of the information has minimal importance,the path weight is calculated automatically in accordance with the tree node in XML document tree level,which overcomes the disadvantage of equally distribution or manual setting for tree node information weigh in the traditional XML document,and solves the similarity calcu-lation of XML document tree,and realizes the fast matching of XML query tree and document. Simulation shows that the algorithm is im-proved in query efficiency,precision and recall.

关键词

相似度/路径权重/查询树/文档树

Key words

similarity/path weight/query tree/document tree

分类

信息技术与安全科学

引用本文复制引用

赵艳妮,郭华磊,马军生..基于路径权重的XML文档相似度仿真研究[J].计算机技术与发展,2016,26(9):197-200,4.

基金项目

国家自然科学基金资助项目(61272284) (61272284)

陕西省自然科学基金(2014JM8354) (2014JM8354)

陕西省教育重点实验室科技项目(13JS083) (13JS083)

计算机技术与发展

OACSTPCD

1673-629X

访问量1
|
下载量0
段落导航相关论文