计算机应用与软件2011,Vol.28Issue(4):33-36,4.
插入语分类抽取研究方法探讨
PROBING RESEARCH METHOD OF CLASSIFIED EXTRACTION OF PARENTHESES
摘要
Abstract
The effect of translations of named entities and terms on many application systems such as NLP and machine translation attracts more and more attention.However, these translations are hard to attain sufficient information from current bilingual dictionary.In this paper we propose a method to automatically acquire high quality phrase translation pairs of the named entities from web corpora, and explore for the first time the automatic complementary way for the lost part of the bilingual corpora.The method utilises the features of bilingual translation pairs in web pages, uses a statistical discriminative model and combines with multiple recognising features to automatically mine ternary bilingual phrases translation pairs in web stations.Experimental results show that the use of the model can effectively deal with bilingual translation pairs of the named entities with high accuracy of 95.6%.关键词
自然语言处理/中文信息处理/双语翻译对/命名实体短语/对齐补充Key words
Natural language processing (NLP)/ Chinese information processing Bilingual translation pairs/ Named entities phrase/Complement of bilingual alignment引用本文复制引用
周宥良,狄萍,贡正仙,周国栋..插入语分类抽取研究方法探讨[J].计算机应用与软件,2011,28(4):33-36,4.基金项目
国家自然科学基金资助项目(60673041). (60673041)