| 注册
首页|期刊导航|计算机工程与应用|改进词向量模型的用户画像研究

改进词向量模型的用户画像研究

陈泽宇 黄勃

计算机工程与应用2020,Vol.56Issue(1):180-184,5.
计算机工程与应用2020,Vol.56Issue(1):180-184,5.DOI:10.3778/j.issn.1002-8331.1906-0192

改进词向量模型的用户画像研究

Research on User Portrait of Improved Word Vector Model

陈泽宇 1黄勃1

作者信息

  • 1. 上海工程技术大学 电子电气工程学院,上海 201620
  • 折叠

摘要

Abstract

User portrait technology can bring great commercial value to enterprises. For the user’s historical query words, the word vector can be used to obtain the expression of the query word at the semantic level, but the word vector model that generates the word vector for the same word is the same, which makes the model unable to deal with the polysemy of a word. Therefore, this paper uses the LDA topic model to assign topics to each query word, so that the query word and its topic are put together in the neural network model to learn the topical word vector. Finally, the random forest classification algorithm is used to classify the basic attributes of users and build the user portrait. The experimental results show that the classification accuracy of this model is higher than that of the word vector model.

关键词

用户画像/词向量/LDA主题模型/随机森林

Key words

user portrait/word vector/LDA topic model/random forest

分类

信息技术与安全科学

引用本文复制引用

陈泽宇,黄勃..改进词向量模型的用户画像研究[J].计算机工程与应用,2020,56(1):180-184,5.

基金项目

国家自然科学青年基金(No.61603242) (No.61603242)

江西省经济犯罪侦查与防控技术协同创新中心开放基金(No.JXJZXTCX-030). (No.JXJZXTCX-030)

计算机工程与应用

OA北大核心CSCDCSTPCD

1002-8331

访问量0
|
下载量0
段落导航相关论文