计算机工程与应用2020,Vol.56Issue(1):180-184,5.DOI:10.3778/j.issn.1002-8331.1906-0192
改进词向量模型的用户画像研究
Research on User Portrait of Improved Word Vector Model
摘要
Abstract
User portrait technology can bring great commercial value to enterprises. For the user’s historical query words, the word vector can be used to obtain the expression of the query word at the semantic level, but the word vector model that generates the word vector for the same word is the same, which makes the model unable to deal with the polysemy of a word. Therefore, this paper uses the LDA topic model to assign topics to each query word, so that the query word and its topic are put together in the neural network model to learn the topical word vector. Finally, the random forest classification algorithm is used to classify the basic attributes of users and build the user portrait. The experimental results show that the classification accuracy of this model is higher than that of the word vector model.关键词
用户画像/词向量/LDA主题模型/随机森林Key words
user portrait/word vector/LDA topic model/random forest分类
信息技术与安全科学引用本文复制引用
陈泽宇,黄勃..改进词向量模型的用户画像研究[J].计算机工程与应用,2020,56(1):180-184,5.基金项目
国家自然科学青年基金(No.61603242) (No.61603242)
江西省经济犯罪侦查与防控技术协同创新中心开放基金(No.JXJZXTCX-030). (No.JXJZXTCX-030)