Towards provably efficient quantum algorithms for large-scale machine-learning models

被引:17
|
作者
Liu, Junyu [1 ,2 ,3 ,4 ,5 ,6 ]
Liu, Minzhao [7 ,8 ]
Liu, Jin-Peng [9 ,10 ,11 ]
Ye, Ziyu [2 ]
Wang, Yunfei [12 ]
Alexeev, Yuri [2 ,3 ,8 ]
Eisert, Jens [13 ]
Jiang, Liang [1 ,3 ]
机构
[1] Univ Chicago, Pritzker Sch Mol Engn, Chicago, IL 60637 USA
[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[3] Chicago Quantum Exchange, Chicago, IL 60637 USA
[4] Univ Chicago, Kadanoff Ctr Theoret Phys, Chicago, IL 60637 USA
[5] qBraid Co, Chicago, IL 60615 USA
[6] SeQure, Chicago, IL 60615 USA
[7] Univ Chicago, Dept Phys, Chicago, IL 60637 USA
[8] Argonne Natl Lab, Computat Sci Div, Lemont, IL 60439 USA
[9] Univ Calif Berkeley, Simons Inst Theory Comp, Berkeley, CA 94720 USA
[10] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[11] MIT, Ctr Theoret Phys, Cambridge, MA 02139 USA
[12] Brandeis Univ, Martin A Fisher Sch Phys, Waltham, MA 02453 USA
[13] Free Univ Berlin, Dahlem Ctr Complex Quantum Syst, D-14195 Berlin, Germany
关键词
D O I
10.1038/s41467-023-43957-x
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Large machine learning models are revolutionary technologies of artificial intelligence whose bottlenecks include huge computational expenses, power, and time used both in the pre-training and fine-tuning process. In this work, we show that fault-tolerant quantum computing could possibly provide provably efficient resolutions for generic (stochastic) gradient descent algorithms, scaling as O(T-2 x polylog(n)), where n is the size of the models and T is the number of iterations in the training, as long as the models are both sufficiently dissipative and sparse, with small learning rates. Based on earlier efficient quantum algorithms for dissipative differential equations, we find and prove that similar algorithms work for (stochastic) gradient descent, the primary algorithm for machine learning. In practice, we benchmark instances of large machine learning models from 7 million to 103 million parameters. We find that, in the context of sparse training, a quantum enhancement is possible at the early stage of learning after model pruning, motivating a sparse parameter download and re-upload scheme. Our work shows solidly that fault-tolerant quantum algorithms could potentially contribute to most state-of-the-art, large-scale machine-learning problems.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Large-scale machine-learning molecular dynamics simulation of primary radiation damage in tungsten
    Liu, Jiahui
    Byggmaestar, Jesper
    Fan, Zheyong
    Qian, Ping
    Su, Yanjing
    PHYSICAL REVIEW B, 2023, 108 (05)
  • [22] Large-Scale Atomislic Simulations of Cleavage in BCC Fe using Machine-Learning Potential
    Suzudo T.
    Ebihara K.-I.
    Tsuru T.
    Mori H.
    Zairyo/Journal of the Society of Materials Science, Japan, 2024, 73 (02) : 129 - 135
  • [23] Validating large-scale quantum machine learning: efficient simulation of quantum support vector machines using tensor networks
    Chen, Kuan-Cheng
    Li, Tai-Yue
    Wang, Yun-Yuan
    See, Simon
    Wang, Chun-Chieh
    Wille, Robert
    Chen, Nan-Yow
    Yang, An-Cheng
    Lin, Chun-Yu
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2025, 6 (01):
  • [24] Provably efficient machine learning for quantum many-body problems
    Huang, Hsin-Yuan
    Kueng, Richard
    Torlai, Giacomo
    Albert, Victor V.
    Preskill, John
    SCIENCE, 2022, 377 (6613) : 1397 - +
  • [25] A Survey on Large-Scale Machine Learning
    Wang, Meng
    Fu, Weijie
    He, Xiangnan
    Hao, Shijie
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2574 - 2594
  • [26] Large-scale comparison of machine learning algorithms for target prediction of natural products
    Liang, Lu
    Liu, Ye
    Kang, Bo
    Wang, Ru
    Sun, Meng-Yu
    Wu, Qi
    Meng, Xiang-Fei
    Lin, Jian-Ping
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [27] Efficient algorithms for dualizing large-scale hypergraphs
    Murakami, Keisuke
    Uno, Takeaki
    DISCRETE APPLIED MATHEMATICS, 2014, 170 : 83 - 94
  • [28] Efficient algorithms for large-scale asteroid discovery
    Kubica, Jeremy
    Denneau, Larry, Jr.
    Moore, Andrew
    Jedicke, Robert
    Connolly, Andrew
    ASTRONOMICAL DATA ANALYSIS SOFTWARE AND SYSTEMS XVI, 2007, 376 : 395 - +
  • [29] Efficient algorithms for large-scale temporal aggregation
    Moon, B
    Fernando, I
    Lopez, V
    Immanuel, V
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (03) : 744 - 759
  • [30] Efficient mining algorithms for large-scale graphs
    Kishimoto, Yasunari
    Shiokawa, Hiroaki
    Fujiwara, Yasuhiro
    Onizuka, Makoto
    NTT Technical Review, 2013, 11 (12):