华侨大学学报(自然科学版)2017,Vol.38Issue(3):408-413,6.DOI:10.11830/ISSN.1000-5013.201703022
采用相关反馈和文档相似度的维吾尔语检索词加权方法
Uyghur Retrieval Word Weighting Scheme Using Relevance Feedback and Document Similarity
摘要
Abstract
For the issue that the effective retrieval of Uyghur web documents, a Uyghur retrieval word weighting scheme based on the relevance feedback and document similarity is proposed.First of all, the Uyghur documents are pre-processed to obtain the corresponding stem set.Then, the initial search is executed when the user input a number of retrieval words, and it extracts the top N documents based on local relevance feedback.Follow, the TF-IDF algorithm is used to compute the frequency similarity between retrieval word and feedback documents.At the same time, the cosine distance is used to compute the similarity between documents, so as to make twice weighted for retrieval words.Finally, it performs document retrieval according to the weight of retrieval words.Experimental results show that the proposed method can accurately retrieve the documents required by the user, and can sort them in the front.关键词
维吾尔语/文档检索/检索词加权/相关反馈/文档相似度Key words
Uygur/document retrieval/weighted retrieval words/relevance feedback/document similarity分类
信息技术与安全科学引用本文复制引用
于丽,亚森·艾则孜..采用相关反馈和文档相似度的维吾尔语检索词加权方法[J].华侨大学学报(自然科学版),2017,38(3):408-413,6.基金项目
新疆维吾尔自治区自然科学基金资助项目(2015211A016) (2015211A016)