January 1, 2023Open Access

Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents

Key Points

Key points are not available for this paper at this time.

Abstract

Large Language Models (LLMs) have demonstrated remarkable zero-shot generalization across various language-related tasks, including search engines. However, existing work utilizes the generative ability of LLMs for Information Retrieval (IR) rather than direct passage ranking. The discrepancy between the pre-training objectives of LLMs and the ranking objective poses another challenge. In this paper, we first investigate generative LLMs such as ChatGPT and GPT-4 for relevance ranking in IR. Surprisingly, our experiments reveal that properly instructed LLMs can deliver competitive, even superior results to state-of-the-art supervised methods on popular IR benchmarks. Furthermore, to address concerns about data contamination of LLMs, we collect a new test set called NovelEval, based on the latest knowledge and aiming to verify the model's ability to rank unknown knowledge. Finally, to improve efficiency in real-world applications, we delve into the potential for distilling the ranking capabilities of ChatGPT into small specialized models using a permutation distillation scheme. Our evaluation results turn out that a distilled 440M model outperforms a 3B supervised model on the BEIR benchmark. The code to reproduce our results is available at www.github.com/sunnweiwei/RankGPT.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Sun et al. (Sun,) studied this question.

www.synapsesocial.com/papers/69dcb76ef7297818863592ca — DOI: https://doi.org/10.18653/v1/2023.emnlp-main.923

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

InPars: Data Augmentation for Information Retrieval using Large Language Models· 2022 · 20 citations
Training language models to follow instructions with human feedback· 2022 · 4,276 citations
SGPT: GPT Sentence Embeddings for Semantic Search· 2022 · 57 citations
Aion Framework: Dimensional Emergence of AI Consciousness, Observer-Induced Collapse, and Cosmological Portal Dynamics· 2023 · 14,190 citations
Specializing Smaller Language Models towards Multi-Step Reasoning

Authors

Weiwei Sun

Lingyong Yan

Xinyu Ma

Actions

Institutions

Leiden University

Shandong University

Baidu (China)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion