| 注册
首页|期刊导航|计算机技术与发展|融合知识图谱和ESA方法的网络新词识别

融合知识图谱和ESA方法的网络新词识别

刘申凯 周霁婷 朱永华 高洪皓

计算机技术与发展2019,Vol.29Issue(3):12-17,6.
计算机技术与发展2019,Vol.29Issue(3):12-17,6.DOI:10.3969/j.issn.1673-629X.2019.03.003

融合知识图谱和ESA方法的网络新词识别

Network New Word Recognition Based on Fusion of Knowledge Graph and ESA

刘申凯 1周霁婷 1朱永华 2高洪皓2

作者信息

  • 1. 上海大学, 上海 200072
  • 2. 上海大学 计算机工程与科学学院, 上海 200444
  • 折叠

摘要

Abstract

With the rapid development of the Internet, the use of Weibo, WeChat and other text forms is gradually increasing. The analysis and understanding of such texts has posed new challenges in the field of natural language processing, especially in the field of network neologism recognition and semantic understanding. In order to overcome the shortcomings of traditional methods that cannot identify network neologism and their semantics, we propose a new method of network neologism recognition by combining knowledge map and explicit semantic analysis methods, which segments the original text with the coarse-grained phrase to preserve the logical relationship between the words. After using the semantic expression phrase of the Baidu knowledge map Schema, the ESA method is used to gradually decompose the remaining texts and extract the phrase encyclopedia information into the core semantic vocabulary, supplementing the unrecognized part of the Schema. Experiment shows that compared with the existing neologism recognition algorithms, the proposed algorithm requires only a small amount of corpus, which reduces the cost of manual rules formulation and improves the recognition of network neologism and the accuracy of word comprehension.

关键词

语义识别/语义相关度/新词识别/知识图谱/显性语义分析

Key words

semantic recognition/semantic relevance/neologism recognition/knowledge graph/explicit semantic analysis

分类

信息技术与安全科学

引用本文复制引用

刘申凯,周霁婷,朱永华,高洪皓..融合知识图谱和ESA方法的网络新词识别[J].计算机技术与发展,2019,29(3):12-17,6.

基金项目

国家重点研发计划专项课题(2017YFD0400101) (2017YFD0400101)

计算机技术与发展

OACSTPCD

1673-629X

访问量3
|
下载量0
段落导航相关论文