Is ChatGPT Good at Search? Investigating Large Language Models as Re-Ranking Agents
Shandong University · Baidu (China) · +1 more institution
Abstract
Large Language Models (LLMs) have demonstrated remarkable zero-shot generalization across various language-related tasks, including search engines. However, existing work utilizes the generative ability of LLMs for Information Retrieval (IR) rather than direct passage ranking. The discrepancy between the pre-training objectives of LLMs and the ranking objective poses another challenge. In this paper, we first investigate generative LLMs such as ChatGPT and GPT-4 for relevance ranking in IR. Surprisingly, our experiments reveal that properly instructed LLMs can deliver competitive, even superior results to state-of-the-art supervised methods on popular IR benchmarks. Furthermore, to address concerns about data…
Citation impact
- FWCI
- 30.48
- Percentile
- 100%
- References
- 40
Authors
8Topics & keywords
- Ranking (information retrieval)
- Computer science
- Benchmark (surveying)
- Language model
- Relevance (law)
- Machine learning
- Artificial intelligence
- Set (abstract data type)
- Quality Education