计算机工程与应用2011,Vol.47Issue(12):137-140,4.DOI:10.3778/j.issn.1002-8331.2011.12.039
用决策树指导TBL进行多音字消歧
Polyphone disambiguation based on tree-guided TBL
摘要
Abstract
Polyphone disambiguation is the core issue of the grapheme-to-phoneme conversion in Mandarin Text-To-Speeeh (TTS) system. This paper selects 33 key polyphones and 24 key polyphonic words which are most ambiguous and frequently used as study objects, and builds a polyphone corpus of 5 000 sentences per polyphone on average. Furthermore, a hybrid algorithm called Tree-Guided Transformation-Based Learning(TGTBL),which combines decision tree with Transformation-Based error-driven Learning(TBL),is proposed to resolve the polyphonic ambiguity. It automatically generates TBL templates,thereby avoiding manually summarizing templates, which is time-consuming and laborious in conventional TBL.Results of comparative experiments show that, for the task of polyphone disambiguation, templates automatically generated by decision tree achieve comparable performance to manually summarized templates,and the average precision of TGTBL reaches 90.36%,significantly higher than that of decision tree.关键词
多音字消歧/字音转换/决策树/基于转换的错误驱动的学习(TBL)Key words
polyphone disambiguation/grapheme-to-phoneme/decision tree/Transformation-Based error-driven Learning(TBL)分类
信息技术与安全科学引用本文复制引用
刘方舟,周游..用决策树指导TBL进行多音字消歧[J].计算机工程与应用,2011,47(12):137-140,4.基金项目
湖南省科技计划项目(No.2010FJ4131) (No.2010FJ4131)
湖南省教育厅科研项目(No.10C0955). (No.10C0955)