| 注册
首页|期刊导航|重庆文理学院学报:自然科学版|基于信息增益的LDA模型的短文本分类

基于信息增益的LDA模型的短文本分类

沈竞

重庆文理学院学报:自然科学版2011,Vol.30Issue(6):64-66,3.
重庆文理学院学报:自然科学版2011,Vol.30Issue(6):64-66,3.

基于信息增益的LDA模型的短文本分类

The classification of LDA model essay based on information gain

沈竞1

作者信息

  • 1. 解放军后勤工程学院图书馆,重庆沙坪坝401311
  • 折叠

摘要

Abstract

In this paper the classification of short essay was improved based on LDA.The information gain of the essay with LDA classification method was put forward.Using the information gain calculation to calculate the text classification vocabulary contribution,to improve "function word" weight,and to filter out "the function word",at last the passage of the filtered was in the LDA theme modeling,and the center vector method was used to establish the text category model.The experimental results prove that with the reducing of function word ratio,classification performance is distinctly improved in the method.

关键词

信息增益/LDA模型/文本分类

Key words

information gain/LDA model/text classification

分类

信息技术与安全科学

引用本文复制引用

沈竞..基于信息增益的LDA模型的短文本分类[J].重庆文理学院学报:自然科学版,2011,30(6):64-66,3.

重庆文理学院学报:自然科学版

OACHSSCD

1673-8012

访问量0
|
下载量0
段落导航相关论文