计算机工程2011,Vol.37Issue(16):39-41,3.DOI:10.3969/j.issn.1000-3428.2011.16.013
基于Lucene的搜索引擎设计与实现
Design and Implementation of Search Engine Based on Lucene
摘要
Abstract
The number of File Transfer Protocol(FTP) resources on the China Education and Research Network(CERNET) is quite large. It is difficult to find the resources. Because of this problem, a high-performance FTP search engine is designed based on EdtFTPJ and Lucene. In this engine, Struts 1.2 is employed to implement Model View Controller(MVC). Data acquisition module uses finite state machine based on regular expression to grab information, Index module uses inverted index method. Word segmentation algorithm uses maximally match Chinese words segmentation based on dictionary. Query Experimental results indicate that the proposed scheme improves the query efficiency, at the same time to ensure the accuracy of the retrieval results.关键词
FTP搜索引擎/Lucene框架/模型-视图-控制器/有限状态自动机/倒排索引Key words
File Transfer Protocol(FTP) search engine/Lucene framework/Model View Controller(MVC)/finite state automata/inverted index分类
信息技术与安全科学引用本文复制引用
赵珂,逯鹏,李永强..基于Lucene的搜索引擎设计与实现[J].计算机工程,2011,37(16):39-41,3.基金项目
国家自然科学基金资助项目(60841004,60971110) (60841004,60971110)
郑州大学创新性实验基金资助项目(2009cxsy100) (2009cxsy100)