The Missing Science of Knowledge Curation (Improving incentives for large-scale knowledge curation)

被引:1
|
作者
Paritosh, Praveen [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
D O I
10.1145/3184558.3191551
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dictionaries, encyclopedias, knowledge graphs, annotated corpora, library classification systems and world maps are all examples of human-curated knowledge resources that have been highly valuable to science as well as amortized across multiple large-scale systems in practice. Many of these were started and built even before a crowdsourcing research community existed. While the last decade has seen unprecedented growth in research and practice in building crowdsourcing systems to do increasingly complex tasks at scale, many of these resources are still woefully incomplete-lacking coverage in languages and subject matter domains. Moreover, many knowledge resources needed to fill other semantic gaps for artificial intelligence systems simply don't exist or aren't being built. Why? I argue that we don't have the right incentives, and that in order to improve the incentives, we have some fundamental scientific questions to answer. While building a large knowledge resource, we have little more than intuitions when it comes to estimating the reusability, maintainability, and long-term value of the effort. These make it difficult to fund or manage such projects, often requiring herculean personalities or fortunate businesses. Building or expanding a resource is often not seen as "sexy," which results in lack of resources to answer those questions in any principled manner. These problems begin to outline a new science of curation, making progress on which could help improve the discussion around and funding for building sorely needed knowledge resources.
引用
收藏
页码:1105 / 1106
页数:2
相关论文
共 50 条
  • [41] Trust, but Verify: Predicting Contribution Quality for Knowledge Base Construction and Curation
    Tan, Chun How
    Agichtein, Eugene
    Ipeirotis, Panos
    Gabrilovich, Evgeniy
    WSDM'14: PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2014, : 553 - 562
  • [42] Supporting the End-User Curation of Cultural Heritage Knowledge Graphs
    Mulholland, Paul
    Van Kranenburg, Peter
    Carvalho, Jason
    Daga, Enrico
    PROCEEDINGS OF THE 35TH ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA, HT 2024, 2024, : 35 - 44
  • [43] Guided curation of semistructured data in collaboratively-built knowledge bases
    Gassier, Wolfgang
    Zangerle, Eva
    Specht, Guenther
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 31 : 111 - 119
  • [44] ML-Based Knowledge Graph Curation: Current Solutions and Challenges
    Berti-Equille, Laure
    COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, : 938 - 939
  • [45] Facilitating Data Discovery for Large-scale Science Facilities using Knowledge Networks
    Qin, Yubo
    Rodero, Ivan
    Parashar, Manish
    2021 IEEE 35TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2021, : 651 - 660
  • [46] Public Curation and Private Collection: The Production of Knowledge on Pinterest.com
    Lui, Debora
    CRITICAL STUDIES IN MEDIA COMMUNICATION, 2015, 32 (02) : 128 - 142
  • [47] State of the art and open challenges in community-driven knowledge curation
    Groza, Tudor
    Tudorache, Tania
    Dumontier, Michel
    JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (01) : 1 - 4
  • [48] KnoWeb - A knowledge web for large-scale, evolving distributed knowledge resources
    Daniel, RS
    Kiss, PA
    Yalowitz, JS
    1998 IEEE INFORMATION TECHNOLOGY CONFERENCE, PROCEEDINGS, 1998, : 75 - 78
  • [49] Knowledge market for effective knowledge within Large-scale water project
    Chai Chunlai
    2009 INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT, INNOVATION MANAGEMENT AND INDUSTRIAL ENGINEERING, VOL 3, PROCEEDINGS, 2009, : 255 - 258
  • [50] Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
    Martins, Yasmmin Cortes
    Ziviani, Artur
    Nicolas, Marisa Fabiana
    de Vasconcelos, Ana Tereza Ribeiro
    FRONTIERS IN BIOINFORMATICS, 2021, 1