首页|期刊导航|华侨大学学报（自然科学版）|采用相关反馈和文档相似度的维吾尔语检索词加权方法

采用相关反馈和文档相似度的维吾尔语检索词加权方法

于丽亚森·艾则孜

华侨大学学报（自然科学版）2017，Vol.38Issue(3)：408-413,6.

华侨大学学报（自然科学版）2017，Vol.38Issue(3)：408-413,6.DOI:10.11830/ISSN.1000-5013.201703022

采用相关反馈和文档相似度的维吾尔语检索词加权方法

Uyghur Retrieval Word Weighting Scheme Using Relevance Feedback and Document Similarity

于丽 ¹亚森·艾则孜¹

作者信息

1. 新疆警察学院信息安全工程系, 新疆乌鲁木齐 830011
折叠

摘要

Abstract

For the issue that the effective retrieval of Uyghur web documents, a Uyghur retrieval word weighting scheme based on the relevance feedback and document similarity is proposed.First of all, the Uyghur documents are pre-processed to obtain the corresponding stem set.Then, the initial search is executed when the user input a number of retrieval words, and it extracts the top N documents based on local relevance feedback.Follow, the TF-IDF algorithm is used to compute the frequency similarity between retrieval word and feedback documents.At the same time, the cosine distance is used to compute the similarity between documents, so as to make twice weighted for retrieval words.Finally, it performs document retrieval according to the weight of retrieval words.Experimental results show that the proposed method can accurately retrieve the documents required by the user, and can sort them in the front.

关键词

维吾尔语/文档检索/检索词加权/相关反馈/文档相似度

Key words

Uygur/document retrieval/weighted retrieval words/relevance feedback/document similarity

分类

信息技术与安全科学

引用本文复制引用

于丽,亚森·艾则孜..采用相关反馈和文档相似度的维吾尔语检索词加权方法[J].华侨大学学报（自然科学版）,2017,38(3):408-413,6.

基金项目

新疆维吾尔自治区自然科学基金资助项目(2015211A016) （2015211A016）

华侨大学学报（自然科学版）

OA北大核心CSTPCD

ISSN：1000-5013

访问量0

下载量0

段落导航