| 注册
首页|期刊导航|计算机工程与应用|基于句类特征的作者写作风格分类研究

基于句类特征的作者写作风格分类研究

张运良 朱礼军 乔晓东 张全

计算机工程与应用2009,Vol.45Issue(22):129-131,223,4.
计算机工程与应用2009,Vol.45Issue(22):129-131,223,4.DOI:10.3778/j.issn.1002-8331.2009.22.042

基于句类特征的作者写作风格分类研究

Research on text authorship categorization based on sentence category features

张运良 1朱礼军 1乔晓东 1张全2

作者信息

  • 1. 中国科学技术信息研究所,北京100038
  • 2. 中国科学院声学研究所,北京100080
  • 折叠

摘要

Abstract

Tbere is a lot of difference in the composition style of different authors and the difference can be discovered by features of word,sentence pattern,rhetoric etc.In this paper,sentence category features are adopted for text categorization and author recognition.This paper uses sentence category vector space model,sentence category features,mixed sentence categories dimensionality roduction,ite weighting method,KNN algorithm and integration decision method to build an authorship classifier.The performance of the authorship classifier is acceptable and can be improved by bigger knowledge bese,HNC techniques and machine learning algorithm.

关键词

文本分类/作者写作风格/句类/向量空间模型/概念层次网络(HNC)理论/自然语言理解

Key words

text classification/authorship/sentence category/Vector Space Model( VSM )/Hierarchical Network of Concepts( HNC )theory/nature language processing

分类

计算机与自动化

引用本文复制引用

张运良,朱礼军,乔晓东,张全..基于句类特征的作者写作风格分类研究[J].计算机工程与应用,2009,45(22):129-131,223,4.

基金项目

国家重点基础研究发展规划(973)(the National Grand Fundamental Research 973 Program of China under Grant No.2004CB318104):国家"十一五"科技支撑计划项目资助(the National project of Scientific and Technical Supporting Programs Funded by Ministry of Science&Technology of China During the 11th Five-year Plan.No.2006BAH03B03). (973)

计算机工程与应用

OA北大核心CSCDCSTPCD

1002-8331

访问量0
|
下载量0
段落导航相关论文