首页|期刊导航|计算机应用与软件|插入语分类抽取研究方法探讨

插入语分类抽取研究方法探讨

周宥良狄萍贡正仙周国栋

计算机应用与软件2011，Vol.28Issue(4)：33-36,4.

插入语分类抽取研究方法探讨

PROBING RESEARCH METHOD OF CLASSIFIED EXTRACTION OF PARENTHESES

周宥良 ¹狄萍 ¹贡正仙 ¹周国栋¹

作者信息

1. 苏州大学计算机科学与技术学院,江苏,苏州,215006
折叠

摘要

Abstract

The effect of translations of named entities and terms on many application systems such as NLP and machine translation attracts more and more attention.However, these translations are hard to attain sufficient information from current bilingual dictionary.In this paper we propose a method to automatically acquire high quality phrase translation pairs of the named entities from web corpora, and explore for the first time the automatic complementary way for the lost part of the bilingual corpora.The method utilises the features of bilingual translation pairs in web pages, uses a statistical discriminative model and combines with multiple recognising features to automatically mine ternary bilingual phrases translation pairs in web stations.Experimental results show that the use of the model can effectively deal with bilingual translation pairs of the named entities with high accuracy of 95.6％.

关键词

自然语言处理/中文信息处理/双语翻译对/命名实体短语/对齐补充

Key words

Natural language processing (NLP)/ Chinese information processing Bilingual translation pairs/ Named entities phrase/Complement of bilingual alignment

引用本文复制引用

周宥良,狄萍,贡正仙,周国栋..插入语分类抽取研究方法探讨[J].计算机应用与软件,2011,28(4):33-36,4.

基金项目

国家自然科学基金资助项目(60673041). （60673041）

计算机应用与软件

OA北大核心CSCDCSTPCD

ISSN：1000-386X

访问量0

下载量0

段落导航