常州大学学报(自然科学版)2013,Vol.25Issue(3):71-75,5.DOI:10.3969/j.issn.2095-0411.2013.03.018
新浪微博搜索排序方法研究
Research of Searching and Sorting Method of Sina Microblogging
摘要
Abstract
A searching and sorting method for Chinese microblog called Weibo is presented in this paper,based on the vector space model and latent semantic analysis.APIs,provided by the Sina microblogging public platform,are applied to obtain test data.Weibo posts using vector space model as matrix of " ndex -term content" are presented,and then a latent semantic analysis process on this matrix is performed.The relevance between Weibo contents and query was turned into the similarity between the Weibo content vector and query vector,which was calculated by the cosine value between Weibo content vector and inquiring vector decomposed by SVD.The treatment on the Weibo content and query was simplified as the operation for the vectors in the low-dimensional vector space.A sorting list of Weibo posts will be obtained according to their relevance to the query rather than the simple string-matching and post time descending order approach,which is widely used in many microblogging platforms.The experiment results indicate that the approach is able to retrieve the relevant posts in the top-ranked list.关键词
微博/向量空间模型/潜在语义分析/搜索排序Key words
Weibo/ vector space model/ latent aemantic analysis/ search ranking分类
信息技术与安全科学引用本文复制引用
叶施仁,严水歌,杨长春..新浪微博搜索排序方法研究[J].常州大学学报(自然科学版),2013,25(3):71-75,5.基金项目
国家自然科学基金项目(61003163) (61003163)
江苏省科技厅项目(BZ2010021) (BZ2010021)