计算机工程2019,Vol.45Issue(3):1-6,6.DOI:10.19678/j.issn.1000-3428.0050119
一种分布式用户浏览点击模型算法
A Distributed User Browse Click Model Algorithm
摘要
Abstract
A distributed User Browse Click Model (UBM) algorithm is proposed to quickly mine user behavior from massive search click logs.The validation parameter E derived from the original UBM algorithm is only related to the ranking position of the search results and the click position of the previous document, and is very stable.Based on this characteristic, the EM iteration solution is transformed into a distributed UBM algorithm which estimates the test degree by sampling to solve the attraction degree.Results of simulation on Spark data platform show that compared with the original UBM algorithm, the proposed algorithm can solve the serious data skew problem in click log, and has higher efficiency.关键词
点击日志/点击模型/用户浏览点击模型算法/搜索引擎/Spark平台Key words
click log/click model/User Browse Click Model (UBM) algorithm/search engine/Spark platform分类
信息技术与安全科学引用本文复制引用
张浩盛伦,李翀,柯勇,张士波..一种分布式用户浏览点击模型算法[J].计算机工程,2019,45(3):1-6,6.基金项目
中国科学院信息化专项"中国科学院信息化评估"(Y647021189). (Y647021189)