计算机与数字工程2019,Vol.47Issue(5):1208-1211,1239,5.DOI:10.3969/j.issn.1672-9722.2019.05.038
基于Web的Lucene全文搜索排序算法的研究
Research on Lucene Full Text Search Sorting Algorithm Based on Web
沙阳阳 1吴陈1
作者信息
- 1. 江苏科技大学计算机学院 镇江 212000
- 折叠
摘要
Abstract
With the support of computer and network technology becoming more and more mature,the enterprise is filled with a large amount of electronic information. In order to meet the needs of enterprises for efficient and accurate retrieval of the required information,the innovation and development of search engine technology has been put on the agenda,and the sorting algorithm used in text retrieval is an important factor that can not be ignored in the quality of search engines. The original Lucene search en?gine uses the sorting algorithm based on vector model,but this original algorithm has a lot of disadvantages in natural semantic un?derstanding. This paper analyzes Lucene structure,sorting algorithm and sorting algorithm,compared with the classic DirectHit PageRank foundation,a new Vector algorithm based on PageRank algorithm is proposed,the algorithm is optimized and shortcom?ings,based on the algorithm design and the implementation of a suitable enterprise search engine system. The experimental results show that the optimized Lucene sorting algorithm is more accurate and more consistent with the user's concerns.关键词
Lucene向量空间模型/相似度/Vector-PageRankKey words
Lucene VSM/similarity/Vector-PageRank algorithm分类
信息技术与安全科学引用本文复制引用
沙阳阳,吴陈..基于Web的Lucene全文搜索排序算法的研究[J].计算机与数字工程,2019,47(5):1208-1211,1239,5.