重庆文理学院学报:自然科学版2011,Vol.30Issue(6):64-66,3.
基于信息增益的LDA模型的短文本分类
The classification of LDA model essay based on information gain
沈竞1
作者信息
- 1. 解放军后勤工程学院图书馆,重庆沙坪坝401311
- 折叠
摘要
Abstract
In this paper the classification of short essay was improved based on LDA.The information gain of the essay with LDA classification method was put forward.Using the information gain calculation to calculate the text classification vocabulary contribution,to improve "function word" weight,and to filter out "the function word",at last the passage of the filtered was in the LDA theme modeling,and the center vector method was used to establish the text category model.The experimental results prove that with the reducing of function word ratio,classification performance is distinctly improved in the method.关键词
信息增益/LDA模型/文本分类Key words
information gain/LDA model/text classification分类
信息技术与安全科学引用本文复制引用
沈竞..基于信息增益的LDA模型的短文本分类[J].重庆文理学院学报:自然科学版,2011,30(6):64-66,3.