| 注册
首页|期刊导航|计算机科学与探索|基于代码模式的软件问答文档检索优化方法

基于代码模式的软件问答文档检索优化方法

华晨彦 邹艳珍 朱子骁 谢冰

计算机科学与探索2017,Vol.11Issue(10):1591-1598,8.
计算机科学与探索2017,Vol.11Issue(10):1591-1598,8.DOI:10.3778/j.issn.1673-9418.1609028

基于代码模式的软件问答文档检索优化方法

Refine Software Q&A Document Search Results Based on Code Pattern

华晨彦 1邹艳珍 2朱子骁 3谢冰1

作者信息

  • 1. 北京大学 信息科学技术学院,北京 100871
  • 2. 高可信软件技术教育部重点实验室,北京 100871
  • 3. 北京大学(天津滨海)新一代信息技术研究院,天津 300450
  • 折叠

摘要

Abstract

Developers often need to search related software Q&A documents in Q&A website. In the search results, the Q&A documents which contain good code snippets (usage examples) are preferred. However, how to metric those code snippets in document is still a big challenge. To address this issue, this paper proposes an approach for refining software Q&A document search results based on code pattern. Firstly, code snippets are extracted from each document in the search results. Then, the common code patterns are mined and used to measure the quality of those code snippets. Finally, the documents with high quality are recommended and ranked at the top of the search results. In the experiments, this paper carries out some evaluations with 10 real problems that software developers meet in practice. Compared to the search results of StackOverflow, the proposed approach has an increment of 40% at NDCG@5.

关键词

代码模式/软件问答文档/文档检索

Key words

code pattern/software Q&A document/document search

分类

信息技术与安全科学

引用本文复制引用

华晨彦,邹艳珍,朱子骁,谢冰..基于代码模式的软件问答文档检索优化方法[J].计算机科学与探索,2017,11(10):1591-1598,8.

基金项目

The National Key Research and Development Program of China under Grant No. 2016YFB1000804 (国家重点研发计划) (国家重点研发计划)

the Na- tional Science Fund for Distinguished Young Scholars of China under Grant No. 61525201 (国家杰出青年科学基金). (国家杰出青年科学基金)

计算机科学与探索

OA北大核心CSCDCSTPCD

1673-9418

访问量0
|
下载量0
段落导航相关论文