高技术通讯(英文版)2005,Vol.11Issue(4):359-363,5.
Two-stage approach to full Chinese parsing
Two-stage approach to full Chinese parsing
摘要
Abstract
Natural language parsing is a task of great importance and extreme difficulty. In this paper, we present a full Chinese parsing system based on a two-stage approach. Rather than identifying all phrases by a uniform model, we utilize a divide and conquer strategy. We propose an effective and fast method based on Markov model to identify the base phrases. Then we make the first attempt to extend one of the best English parsing models i.e. the head-driven model to recognize Chinese complex phrases. Our two-stage approach is superior to the uniform approach in two aspects. First, it creates synergy between the Markov model and the head-driven model. Second, it reduces the complexity of full Chinese parsing and makes the parsing system space and time efficient. We evaluate our approach in PARSEVAL measures on the open test set, the parsing system performances at 87.53% precision, 87.95% recall.关键词
natural language processing systems/parsing/markov model/pattern recognitionKey words
natural language processing systems/parsing/markov model/pattern recognition分类
信息技术与安全科学引用本文复制引用
Cao Hailong,Zhao Tiejun,Yang Muyun,Li Sheng..Two-stage approach to full Chinese parsing[J].高技术通讯(英文版),2005,11(4):359-363,5.基金项目
Supported by the High Technology Research and Development Program of China (No. 2004AA117010-09) and National Natural Science Foundation of China (No. 60302021). (No. 2004AA117010-09)