The Efficiency of Question-Asking Strategies in a Real-World Visual Search Task

被引：0

作者：

Testoni, Alberto ^{[1
,7
]}

Bernardi, Raffaella ^{[2
,3
]}

Ruggeri, Azzurra ^{[4
,5
,6
]}

机构：

[1] Univ Amsterdam, Inst Log Language & Computat ILLC, Amsterdam, Netherlands

[2] Univ Trento, Ctr Mind Brain Sci CIMeC, Trento, Italy

[3] Univ Trento, Dept Informat Engn & Comp Sci DISI, Trento, Italy

[4] Max Planck Inst Human Dev, MPRG ISearch, Berlin, Germany

[5] Tech Univ Munich, Sch Social Sci & Technol, Munich, Germany

[6] Cent European Univ, Dept Cognit Sci, Vienna, Austria

[7] Univ Amsterdam, Inst Log Language & Computat ILLC, Sci Pk 107, NL-1098 XG Amsterdam, Netherlands

来源：

COGNITIVE SCIENCE | 2023年 / 47卷 / 12期

基金：

欧洲研究理事会;

关键词：

Visual search; Information search; 20-Questions game; Question asking; Expected information gain; INFORMATION SEARCH; EYE-MOVEMENTS; PROBABILITY; EXPERIENCE; SELECTION; PATTERNS;

D O I：

10.1111/cogs.13396

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

In recent years, a multitude of datasets of human-human conversations has been released for the main purpose of training conversational agents based on data-hungry artificial neural networks. In this paper, we argue that datasets of this sort represent a useful and underexplored source to validate, complement, and enhance cognitive studies on human behavior and language use. We present a method that leverages the recent development of powerful computational models to obtain the fine-grained annotation required to apply metrics and techniques from Cognitive Science to large datasets. Previous work in Cognitive Science has investigated the question-asking strategies of human participants by employing different variants of the so-called 20-question-game setting and proposing several evaluation methods. In our work, we focus on GuessWhat, a task proposed within the Computer Vision and Natural Language Processing communities that is similar in structure to the 20-question-game setting. Crucially, the GuessWhat dataset contains tens of thousands of dialogues based on real-world images, making it a suitable setting to investigate the question-asking strategies of human players on a large scale and in a natural setting. Our results demonstrate the effectiveness of computational tools to automatically code how the hypothesis space changes throughout the dialogue in complex visual scenes. On the one hand, we confirm findings from previous work on smaller and more controlled settings. On the other hand, our analyses allow us to highlight the presence of "uninformative" questions (in terms of Expected Information Gain) at specific rounds of the dialogue. We hypothesize that these questions fulfill pragmatic constraints that are exploited by human players to solve visual tasks in complex scenes successfully. Our work illustrates a method that brings together efforts and findings from different disciplines to gain a better understanding of human question-asking strategies on large-scale datasets, while at the same time posing new questions about the development of conversational systems.

引用

页数：29

共 50 条

[31] Neural Evidence for Distracter Suppression during Visual Search in Real-World Scenes
Seidl, Katharina N.
Peelen, Marius V.
Kastner, Sabine
JOURNAL OF NEUROSCIENCE, 2012, 32 (34): : 11812 - 11819
[32] The Effect of Teacher Feedback to Students' Question-asking in Large-sized Engineering Classes: A Perspective of Instructional Effectiveness and Efficiency
Jin, Sung-Hee
Shin, Soobong
ASIA-PACIFIC EDUCATION RESEARCHER, 2012, 21 (03): : 497 - 506
[33] Asking Questions Like Educational Experts: Automatically Generating Question-Answer Pairs on Real-World Examination Data
Qu, Fanyi
Jia, Xin
Wu, Yunfang
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2583 - 2593
[34] The dark side of visual awareness in sport: Inattentional blindness in a real-world basketball task
Philip Furley
Daniel Memmert
Christian Heller
Attention, Perception, & Psychophysics, 2010, 72 : 1327 - 1337
[35] The dark side of visual awareness in sport: Inattentional blindness in a real-world basketball task
Furley, Philip
Memmert, Daniel
Heller, Christian
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (05) : 1327 - 1337
[36] PHYSICISTS AND POLITICS - STRATEGIES FOR THE REAL-WORLD
HAMMER, B
PHYSICS TODAY, 1995, 48 (05) : 63 - 64
[37] VERTICAL RESTRAINTS, EFFICIENCY, AND THE REAL-WORLD
BURNS, JW
FORDHAM LAW REVIEW, 1993, 62 (03) : 597 - 651
[38] Real-World Task Context: Meanings and Roles
Brown, Jill P.
LINES OF INQUIRY IN MATHEMATICAL MODELLING RESEARCH IN EDUCATION, 2019, : 53 - 81
[39] Can templates-for-rejection suppress real-world affective objects in visual search?
Brown, Chris R. H.
Derakshan, Nazanin
PSYCHONOMIC BULLETIN & REVIEW, 2024, 31 (04) : 1843 - 1855
[40] Not So Fast: Autistic traits and Anxious Apprehension in Real-World Visual Search Scenarios
N. C. C. Russell
S. G. Luke
R. A. Lundwall
M. South
Journal of Autism and Developmental Disorders, 2019, 49 : 1795 - 1806

← 1 2 3 4 5 →