"In-Context Learning" or: How I learned to stop worrying and love "Applied Information Retrieval"

被引：0

作者：

Parry, Andrew ^{[1
]}

Ganguly, Debasis ^{[1
]}

Chandra, Manish ^{[1
]}

机构：

[1] Univ Glasgow, Glasgow, Lanark, Scotland

来源：

PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024 | 2024年

关键词：

Large Language Models; In-Context Learning; Ranking Models; Query Performance Prediction;

D O I：

10.1145/3626772.3657842

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the increasing ability of large language models (LLMs), incontext learning (ICL) has evolved as a new paradigm for natural language processing (NLP), where instead of fine-tuning the parameters of an LLM specific to a downstream task with labeled examples, a small number of such examples is appended to a prompt instruction for controlling the decoder's generation process. ICL, thus, is conceptually similar to a non-parametric approach, such as k-NN, where the prediction for each instance essentially depends on the local topology, i.e., on a localised set of similar instances and their labels (called few-shot examples). This suggests that a test instance in ICL is analogous to a query in IR, and similar examples in ICL retrieved from a training set relate to a set of documents retrieved from a collection in IR. While standard unsupervised ranking models can be used to retrieve these few-shot examples from a training set, the effectiveness of the examples can potentially be improved by re-defining the notion of relevance specific to its utility for the downstream task, i.e., considering an example to be relevant if including it in the prompt instruction leads to a correct prediction. With this task-specific notion of relevance, it is possible to train a supervised ranking model (e.g., a bi-encoder or cross-encoder), which potentially learns to optimally select the few-shot examples. We believe that the recent advances in neural rankers can potentially find a use case for this task of optimally choosing examples for more effective downstream ICL predictions.

引用

页码：14 / 25

页数：12

共 50 条

[1] How I learned to stop worrying and love machine learning
Mattessich, Sarah
Tassavor, Michael
Swetter, Susan M.
Grant-Kels, Jane M.
CLINICS IN DERMATOLOGY, 2018, 36 (06) : 777 - 778
[2] Fun with Information or How I Learned to Stop Worrying and Love Objectivity
Iglesia, Daniel
LEONARDO MUSIC JOURNAL, 2014, 24 : 70 - 71
[3] In response to, "How I learned to stop worrying and love machine learning"
Kim, Randie H.
CLINICS IN DERMATOLOGY, 2019, 37 (03) : 291 - 292
[4] How I learned to stop worrying and love polarization
Roth, M. Garrett
JOURNAL OF ELECTIONS PUBLIC OPINION AND PARTIES, 2019, 29 (02): : 143 - 161
[5] How I Learned to Stop Worrying and Love Compilers
Sedlar, Eric
SIGMOD'14: PROCEEDINGS OF THE 2014 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2014, : 1 - 1
[6] How I Learned to Stop Worrying and Love Panpsychism
Tye, Michael
JOURNAL OF CONSCIOUSNESS STUDIES, 2024, 31 (9-10) : 10 - 28
[7] How I Learned to Stop Worrying and Love the Bots
Jones, Steve
SOCIAL MEDIA + SOCIETY, 2015, 1 (01):
[8] How I learned to stop worrying and love the crisis
Fidrmuc, Jan
Tichit, Ariane
ECONOMIC SYSTEMS, 2013, 37 (04) : 542 - 554
[9] How I Learned to Stop Worrying and Love Google
Little, Geoffrey
JOURNAL OF ACADEMIC LIBRARIANSHIP, 2011, 37 (05): : 443 - 444
[10] How I Learned to Stop Worrying and Love the Brain
Hogue, David A.
RELIGIOUS EDUCATION, 2011, 106 (03) : 257 - 261

← 1 2 3 4 5 →