电子学报2016,Vol.44Issue(12):3036-3043,8.DOI:10.3969/j.issn.0372-2112.2016.12.032
基于主题模型的(Aspect,Rating)摘要生成方法研究
(Aspect,Rating)Summarization Based on Topic ModeI
摘要
Abstract
This paper proposes a topic model TMPP (Topic Model based on Phrase Parameter),which can extract the aspects and associated with their ratings for the evaluated entities in online reviews.TMPP has three characterisitcs:(1 )It as-sumes the review is represented as a bag-of-phrase.(2)It extends the document-topic parameter from the standard LDA as a set of (aspect ,rating).(3)It incorporates the prior knowledge.We introduce the physical meaning of each parameter for the TMPP,the generative process for the TMPP and the representation of the prior knowledge.Furthermore,the reason and ad-vantage of incorporating the aspect cluster into the TMPP are presented;the mechanism of obtaining the (aspect,rating)is also given by extracting the aspects and associated with their ratings from the online product reviews.We conduct extensive experiments on a very large real life dataset from taobao.com and find that TMPP can produce high quality (aspect,rating) summarization if each review has an overall rating by comparing the performance between existing baseline models and TMPP.关键词
主题模型/(aspect,rating)摘要/短语袋/TMPPKey words
topic model/(aspect,rating)summarization/bag-of-phrase/topic model based on phrase parameter (TMPP)分类
信息技术与安全科学引用本文复制引用
吕品,汪鑫,罗宜元,计春雷..基于主题模型的(Aspect,Rating)摘要生成方法研究[J].电子学报,2016,44(12):3036-3043,8.基金项目
国家自然科学基金青年基金(No.61402280);上海电机学院计算机科学与技术优势学科(No.16YSXK04);上海电机学院科研计划项目 ()