计算机科学与探索2017,Vol.11Issue(10):1591-1598,8.DOI:10.3778/j.issn.1673-9418.1609028
基于代码模式的软件问答文档检索优化方法
Refine Software Q&A Document Search Results Based on Code Pattern
摘要
Abstract
Developers often need to search related software Q&A documents in Q&A website. In the search results, the Q&A documents which contain good code snippets (usage examples) are preferred. However, how to metric those code snippets in document is still a big challenge. To address this issue, this paper proposes an approach for refining software Q&A document search results based on code pattern. Firstly, code snippets are extracted from each document in the search results. Then, the common code patterns are mined and used to measure the quality of those code snippets. Finally, the documents with high quality are recommended and ranked at the top of the search results. In the experiments, this paper carries out some evaluations with 10 real problems that software developers meet in practice. Compared to the search results of StackOverflow, the proposed approach has an increment of 40% at NDCG@5.关键词
代码模式/软件问答文档/文档检索Key words
code pattern/software Q&A document/document search分类
信息技术与安全科学引用本文复制引用
华晨彦,邹艳珍,朱子骁,谢冰..基于代码模式的软件问答文档检索优化方法[J].计算机科学与探索,2017,11(10):1591-1598,8.基金项目
The National Key Research and Development Program of China under Grant No. 2016YFB1000804 (国家重点研发计划) (国家重点研发计划)
the Na- tional Science Fund for Distinguished Young Scholars of China under Grant No. 61525201 (国家杰出青年科学基金). (国家杰出青年科学基金)