Specification overfitting in artificial intelligence

被引:0
|
作者
Roth, Benjamin [1 ,2 ]
de Araujo, Pedro Henrique Luz [1 ,3 ]
Xia, Yuxi [1 ,3 ]
Kaltenbrunner, Saskia [4 ]
Korab, Christoph [4 ]
机构
[1] Univ Vienna, Fac Comp Sci, Vienna, Austria
[2] Univ Vienna, Fac Philol & Cultural Studies, Vienna, Austria
[3] Univ Vienna, UniVie Doctoral Sch Comp Sci, Vienna, Austria
[4] Univ Vienna, Dept Innovat & Digitalisat Law, Vienna, Austria
关键词
Specification; Overfitting; Fairness; Robustness; Regulation; Artificial intelligence;
D O I
10.1007/s10462-024-11040-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning (ML) and artificial intelligence (AI) approaches are often criticized for their inherent bias and for their lack of control, accountability, and transparency. Consequently, regulatory bodies struggle with containing this technology's potential negative side effects. High-level requirements such as fairness and robustness need to be formalized into concrete specification metrics, imperfect proxies that capture isolated aspects of the underlying requirements. Given possible trade-offs between different metrics and their vulnerability to over-optimization, integrating specification metrics in system development processes is not trivial. This paper defines specification overfitting, a scenario where systems focus excessively on specified metrics to the detriment of high-level requirements and task performance. We present an extensive literature survey to categorize how researchers propose, measure, and optimize specification metrics in several AI fields (e.g., natural language processing, computer vision, reinforcement learning). Using a keyword-based search on papers from major AI conferences and journals between 2018 and mid-2023, we identify and analyze 74 papers that propose or optimize specification metrics. We find that although most papers implicitly address specification overfitting (e.g., by reporting more than one specification metric), they rarely discuss which role specification metrics should play in system development or explicitly define the scope and assumptions behind metric formulations.
引用
收藏
页数:37
相关论文
共 50 条
  • [31] Artificial intelligence
    Conan, Alastair
    TLS-THE TIMES LITERARY SUPPLEMENT, 2023, (6252): : 6 - 6
  • [32] ARTIFICIAL INTELLIGENCE
    Tortajada, Patricia Escribano
    REVISTA DE DERECHO CIVIL, 2023, 10 (02): : 1 - 2
  • [33] Artificial Intelligence?
    Kelly, Rob G.
    ELECTROCHEMICAL SOCIETY INTERFACE, 2024, 33 (02): : 3 - 3
  • [34] Artificial Intelligence
    Pontieri-Lewis, Vittoria
    JOURNAL OF WOUND OSTOMY AND CONTINENCE NURSING, 2024, 51 (06) : 437 - 437
  • [35] Artificial intelligence
    Prosper, HB
    ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2001, 583 : 335 - 337
  • [36] ARTIFICIAL INTELLIGENCE
    Ratnayake D.
    Thomas L.F.
    ITNOW, 2023, 65 (04) : 43
  • [37] Artificial intelligence
    Helmut Malleck
    Gerhard Friedrich
    e & i Elektrotechnik und Informationstechnik, 2005, 122 (7-8) : 225 - 226
  • [38] Artificial intelligence
    Brierley, Rob
    LANCET GASTROENTEROLOGY & HEPATOLOGY, 2018, 3 (01): : 14 - 14
  • [39] Artificial Intelligence
    PALUMBO, S. T. E. F. A. N. O.
    S&F-SCIENZAEFILOSOFIA IT, 2022, (27) : 397 - 403
  • [40] Artificial intelligence
    Rudall, BH
    ROBOTICA, 1996, 14 : 507 - 507