计算机技术与发展2016,Vol.26Issue(9):197-200,4.DOI:10.3969/j.issn.1673-629X.2016.09.044
基于路径权重的XML文档相似度仿真研究
Simulation Research of XML Document Similarity Based on Path Weighting
摘要
Abstract
In order to realize the rapid and accurate retrieval of the XML document information,a tree similarity algorithm based on path weight is proposed. It considers the tree node information similarity and structural similarity,and the information is arranged in each level of the tree in accordance with the degree of importance by object rules of primary and secondary information organization,making the de-gree of importance for tree node information weakened from up to down. According to the characteristics that the node with closer dis-tance from the root node represents the more important information,and the lowest level of the information has minimal importance,the path weight is calculated automatically in accordance with the tree node in XML document tree level,which overcomes the disadvantage of equally distribution or manual setting for tree node information weigh in the traditional XML document,and solves the similarity calcu-lation of XML document tree,and realizes the fast matching of XML query tree and document. Simulation shows that the algorithm is im-proved in query efficiency,precision and recall.关键词
相似度/路径权重/查询树/文档树Key words
similarity/path weight/query tree/document tree分类
信息技术与安全科学引用本文复制引用
赵艳妮,郭华磊,马军生..基于路径权重的XML文档相似度仿真研究[J].计算机技术与发展,2016,26(9):197-200,4.基金项目
国家自然科学基金资助项目(61272284) (61272284)
陕西省自然科学基金(2014JM8354) (2014JM8354)
陕西省教育重点实验室科技项目(13JS083) (13JS083)