数据采集与处理2017,Vol.32Issue(3):523-532,10.DOI:10.16337/j.1004-9037.2017.03.011
基于滑动窗口的微博时间线摘要算法
Microblog Timeline Summarization Algorithm Based on Sliding Window
摘要
Abstract
Timeline summarization is the process of creating summaries towards topic information and development over time in natural language processing.Some algorithms are proposed to generate summaries towards long text like news,but seldom focus on timeline summaries of short text like microblog.Here,we propose a microblog timeline summarization based on sliding window (MTSW),which simultaneously incorporates content coverage,temporal distribution and influence to evaluate candidate timeline summaries.In the algorithm,representative terms are selected to represent microblog feature according to intensity of terms and entropy.We build a comprehensive indicator for evaluating the timeline summary based on the above three indicators.Then,we use sliding window to generate microblog timeline summary.Experiments on the real-world event datasets verify the effectiveness of the proposed method.关键词
微博摘要/时间线摘要/短文本摘要/事件演化Key words
microblog summary/timeline summary/short text summary/event evolution分类
信息技术与安全科学引用本文复制引用
徐伟,赵斌,吉根林..基于滑动窗口的微博时间线摘要算法[J].数据采集与处理,2017,32(3):523-532,10.基金项目
江苏省高校自然科学基金(13KJB520014)资助项目. (13KJB520014)