桂林电子科技大学学报2017,Vol.37Issue(2):111-115,5.
基于双索引结构的XML文档查询设计及优化
Query design and optimization of XML document content based on dual index structure
摘要
Abstract
In order to solve problems about large XML documents, such as time-consuming retrieval, slow response speed and excessive resource consumption, the dual index structure based on B tree is designed, and a query method based on dual index structure is proposed to quickly locate the target content.The inverted index structure based on the path is adopted for reducing effectively time consumption of the content retrieval by comparing the Dewey encoding.At the same time, for XML document contents, the data units are constructed by the process of word segmentation, and the PathGuide index database is established through the logical relationship between the data units.The index database can effectively avoid the meaningless access to the irrelevant nodes of the query content.Through multiple sets of comparative experiments, the results indicate that the proposed method and the optimization solution show obvious superiority in the query efficiency.关键词
可扩展标记语言/内容查询/数据单元/倒排索引/双索引结构Key words
extensible markup language/content query/data unit/inverted index/dual index structure分类
信息技术与安全科学引用本文复制引用
首照宇,孙颖,张彤,赵晖..基于双索引结构的XML文档查询设计及优化[J].桂林电子科技大学学报,2017,37(2):111-115,5.基金项目
国家自然科学基金(61362021,61661017) (61362021,61661017)
广西科技创新能力与条件建设计划(桂科能1598025-21) (桂科能1598025-21)
广西自然科学基金(2013GXNSFDA019030,2014GXNSFDA118035,2016GXNSFAA380149) (2013GXNSFDA019030,2014GXNSFDA118035,2016GXNSFAA380149)
认知无线电教育部重点实验室基金(CRKL150103,2011KF11) (CRKL150103,2011KF11)