SYNTHETIC DATA AND THE FUTURE OF AI

被引:0
|
作者
Lee, Peter [1 ,2 ]
机构
[1] Univ Calif Davis, Sch Law, Ctr Innovat Law & Soc, Law, Davis, CA 95616 USA
[2] Univ Calif Davis, Sch Law, Ctr Innovat Law & Soc, Davis, CA 95616 USA
关键词
INTELLECTUAL PROPERTY; TRADE SECRETS; INNOVATION; COPYRIGHT; INDUSTRY; HEALTH; RIGHTS; BIAS; FIRM; LAW;
D O I
暂无
中图分类号
D9 [法律]; DF [法律];
学科分类号
0301 ;
摘要
The future of artificial intelligence (AI) is synthetic. Several of the most prominent technical and legal challenges of AI derivefrom the need to amass huge amounts of real-world data to train machine learning (ML) models. Collecting such real- world data can be highly difficult and can threaten privacy, introduce bias in automated decision making, and infringe copyrights on a massive scale. This Article explores the emergence of a seemingly paradoxical technical creation that can mitigate-though not completely eliminate-these concerns: synthetic data. Increasingly, data scientists are using simulated driving environments, fabricated medical records, fake images, and other forms of synthetic data to train ML models. Artificial data, in other words, is training artificial intelligence. Synthetic data offers a host of technical and legal benefits; it promises to radically decrease the cost of obtaining data, sidestep privacy issues, reduce automated discrimination, and avoid copyright infringement. Alongside such promises, however, synthetic data offers perils as well. Deficiencies in the development and deployment of synthetic data can exacerbate the dangers of AI and cause significant social harm. In light of the enormous value and importance of synthetic data, this Article sketches the contours of an innovation ecosystem to promote its robust and responsible development. It identifies three objectives that should guide legal and policy measures shaping the creation of synthetic data: provisioning, disclosure, and democratization. Ideally, such an ecosystem should incentivize the generation of high-quality synthetic data, encourage disclosure of both synthetic data and processes for generating it, and promote multiple sources of innovation. This Article then examines a suite of "innovation mechanisms" that can advance these objectives, ranging from open source production to proprietary approaches based on patents, trade secrets, and copyrights. Throughout, it suggests policy and doctrinal reforms to enhance innovation, transparency, and democratic access to synthetic data. Just as AI will have enormous implications for law, legal regimes can play a central role in shaping the future of AI.
引用
收藏
页码:1 / 74
页数:74
相关论文
共 50 条
  • [21] The future of AI in dermatology and of dermatology with AI
    Ganascia, J. -G.
    ANNALES DE DERMATOLOGIE ET DE VENEREOLOGIE, 2020, 147 (05): : 331 - 333
  • [22] Synthetic Data Generation System for AI-Based Diabetic Foot Diagnosis
    Hyun J.
    Lee Y.
    Son H.M.
    Lee S.H.
    Pham V.
    Park J.U.
    Chung T.-M.
    SN Computer Science, 2021, 2 (5)
  • [23] Scenarios Engineering for Trustworthy AI: Domain Adaptation Approach for Reidentification With Synthetic Data
    Li, Xuan
    Wang, Xiao
    Deng, Fang
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (11): : 6901 - 6910
  • [24] Explainable AI for Medical Data: Current Methods, Limitations, and Future Directions
    Hossain, Md Imran
    Zamzmi, Ghada
    Mouton, Peter r.
    Salekin, Md Sirajus
    Sun, Yu
    Goldgof, Dmitry
    ACM COMPUTING SURVEYS, 2025, 57 (06)
  • [25] Enabling AI in Future Wireless Networks: A Data Life Cycle Perspective
    Nguyen, Dinh C.
    Cheng, Peng
    Ding, Ming
    Lopez-Perez, David
    Pathirana, Pubudu N.
    Li, Jun
    Seneviratne, Aruna
    Li, Yonghui
    Poor, H. Vincent
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (01): : 553 - 595
  • [26] Synthetic data for face recognition: Current state and future prospects
    Boutros, Fadi
    Struc, Vitomir
    Fierrez, Julian
    Damer, Naser
    IMAGE AND VISION COMPUTING, 2023, 135
  • [27] Synthetic data & the future of Women's Health: A synergistic relationship
    Delanerolle, Gayathri
    Phiri, Peter
    Cavalini, Heitor
    Benfield, David
    Shetty, Ashish
    Bouchareb, Yassine
    Shi, Jian Qing
    Zemkoho, Alain
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 179
  • [28] Deep Air A Smart City AI Synthetic Data Digital Twin Solving the Scalability Data Problems
    Almirall, Esteve
    Callegaro, Davide
    Bruins, Peter
    Santamaria, Mar
    Martinez, Pablo
    Cortes, Ulises
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2022, 356 : 83 - 86
  • [29] AI vs "AI": Synthetic Minds or Speech Acts
    Lewis, Peter R.
    Marsh, Stephen
    Pitt, Jeremy
    IEEE TECHNOLOGY AND SOCIETY MAGAZINE, 2021, 40 (02) : 6 - 13
  • [30] Introducing the future of AI
    Hendler, J
    IEEE INTELLIGENT SYSTEMS, 2006, 21 (03) : 2 - 4