| 注册
首页|期刊导航|华南理工大学学报(自然科学版)|基于双向文本扩展的信息检索重排方法

基于双向文本扩展的信息检索重排方法

涂新辉 郭聪 宗宇航

华南理工大学学报(自然科学版)2025,Vol.53Issue(9):59-67,9.
华南理工大学学报(自然科学版)2025,Vol.53Issue(9):59-67,9.DOI:10.12141/j.issn.1000-565X.240499

基于双向文本扩展的信息检索重排方法

Information Retrieval Re-Ranking Method Based on Bidirectional Text Expansion

涂新辉 1郭聪 1宗宇航1

作者信息

  • 1. 华中师范大学 计算机学院,湖北 武汉 430079
  • 折叠

摘要

Abstract

With the rapid development of large language models(LLMs),remarkable progress has been made in both text matching and text expansion technologies in information retrieval.As two important methods for enhancing text representation,query expansion and document expansion have been widely applied in modern information retrieval systems.Currently,mainstream text expansion methods primarily rely on large language models.However,there are obvious differences in language diversity and style between the text generated by these models and the text created manually.These differences may affect the accuracy of calculating the query-document relevance,ultimately leading to a decline in the performance of the entire information retrieval system.To address this issue,this paper proposed an information retrieval re-ranking method based on bidirectional text expansion(BTE-IRRM).First,zero-shot prompting was used to enable the large language model to generate pseudo-queries for documents and pseudo-documents for queries.Then,the semantic similarity between these pseudo-queries and pseudo-documents was calculated.Finally,the similarity scores of the original query-document and the semantic similarity scores of the pseudo-query-pseudo-document were weighted and fused to obtain the final document ranking result.To validate the effectiveness of the proposed method,experiments were conducted on two public datasets(DL19 and DL20).Experimental results demonstrate that compared with the existing baseline methods,the BTE-IRRM method has achieved significant improvements in multiple evaluation indicators.Therefore,the bidirectional text expansion method proposed in this paper can further enhance the relevance matching between queries and documents,thereby improving the performance of the entire information retrieval system.

关键词

信息检索/大语言模型/查询扩展/文档扩展

Key words

information retrieval/large language model/query expansion/document expansion

分类

信息技术与安全科学

引用本文复制引用

涂新辉,郭聪,宗宇航..基于双向文本扩展的信息检索重排方法[J].华南理工大学学报(自然科学版),2025,53(9):59-67,9.

基金项目

国家自然科学基金项目(62472192)Supported by the National Natural Science Foundation of China(62472192) (62472192)

华南理工大学学报(自然科学版)

OA北大核心

1000-565X

访问量0
|
下载量0
段落导航相关论文