SYNTHETIC DATA AND THE FUTURE OF AI

被引:0
|
作者
Lee, Peter [1 ,2 ]
机构
[1] Univ Calif Davis, Sch Law, Ctr Innovat Law & Soc, Law, Davis, CA 95616 USA
[2] Univ Calif Davis, Sch Law, Ctr Innovat Law & Soc, Davis, CA 95616 USA
关键词
INTELLECTUAL PROPERTY; TRADE SECRETS; INNOVATION; COPYRIGHT; INDUSTRY; HEALTH; RIGHTS; BIAS; FIRM; LAW;
D O I
暂无
中图分类号
D9 [法律]; DF [法律];
学科分类号
0301 ;
摘要
The future of artificial intelligence (AI) is synthetic. Several of the most prominent technical and legal challenges of AI derivefrom the need to amass huge amounts of real-world data to train machine learning (ML) models. Collecting such real- world data can be highly difficult and can threaten privacy, introduce bias in automated decision making, and infringe copyrights on a massive scale. This Article explores the emergence of a seemingly paradoxical technical creation that can mitigate-though not completely eliminate-these concerns: synthetic data. Increasingly, data scientists are using simulated driving environments, fabricated medical records, fake images, and other forms of synthetic data to train ML models. Artificial data, in other words, is training artificial intelligence. Synthetic data offers a host of technical and legal benefits; it promises to radically decrease the cost of obtaining data, sidestep privacy issues, reduce automated discrimination, and avoid copyright infringement. Alongside such promises, however, synthetic data offers perils as well. Deficiencies in the development and deployment of synthetic data can exacerbate the dangers of AI and cause significant social harm. In light of the enormous value and importance of synthetic data, this Article sketches the contours of an innovation ecosystem to promote its robust and responsible development. It identifies three objectives that should guide legal and policy measures shaping the creation of synthetic data: provisioning, disclosure, and democratization. Ideally, such an ecosystem should incentivize the generation of high-quality synthetic data, encourage disclosure of both synthetic data and processes for generating it, and promote multiple sources of innovation. This Article then examines a suite of "innovation mechanisms" that can advance these objectives, ranging from open source production to proprietary approaches based on patents, trade secrets, and copyrights. Throughout, it suggests policy and doctrinal reforms to enhance innovation, transparency, and democratic access to synthetic data. Just as AI will have enormous implications for law, legal regimes can play a central role in shaping the future of AI.
引用
收藏
页码:1 / 74
页数:74
相关论文
共 50 条
  • [31] Welcome to the AI future?
    Nature Astronomy, 2023, 7 : 1 - 1
  • [32] The history and future of AI
    Russell, Stuart
    OXFORD REVIEW OF ECONOMIC POLICY, 2021, 37 (03) : 509 - 520
  • [33] Prospecting the future with AI
    Miguel Castillo, Jose
    Cortes, Conchi
    Gonzalez, Julian
    Benito, Armando
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2009, 1 (02): : 1 - 5
  • [34] AI and the future of sex
    Herrera, Leo
    MIT TECHNOLOGY REVIEW, 2024, 127 (05): : 32 - 33
  • [35] Introducing the future of AI
    University of Maryland, United States
    IEEE Intell. Syst., 2006, 3 (2-4):
  • [36] Future robotics, AI
    Panda, Keshab
    Control Engineering, 2019, 66 (02):
  • [37] The future of AI is the market
    Ross M. Darrow
    Journal of Revenue and Pricing Management, 2021, 20 : 381 - 386
  • [38] PROLOG AND THE FUTURE OF AI
    KRAFT, L
    COOPER, JR
    DR DOBBS JOURNAL, 1987, 12 (02): : 134 - &
  • [39] The Future of AI: AI's 10 To Watch
    Subrahmanian, V. S.
    IEEE INTELLIGENT SYSTEMS, 2020, 35 (06) : 3 - 6
  • [40] The future of AI - A manifesto
    McCarthy, J
    AI MAGAZINE, 2005, 26 (04) : 39 - 39