Towards provably efficient quantum algorithms for large-scale machine-learning models

被引：17

作者：

Liu, Junyu ^{[1
,2
,3
,4
,5
,6
]}

Liu, Minzhao ^{[7
,8
]}

Liu, Jin-Peng ^{[9
,10
,11
]}

Ye, Ziyu ^{[2
]}

Wang, Yunfei ^{[12
]}

Alexeev, Yuri ^{[2
,3
,8
]}

Eisert, Jens ^{[13
]}

Jiang, Liang ^{[1
,3
]}

机构：

[1] Univ Chicago, Pritzker Sch Mol Engn, Chicago, IL 60637 USA

[2] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA

[3] Chicago Quantum Exchange, Chicago, IL 60637 USA

[4] Univ Chicago, Kadanoff Ctr Theoret Phys, Chicago, IL 60637 USA

[5] qBraid Co, Chicago, IL 60615 USA

[6] SeQure, Chicago, IL 60615 USA

[7] Univ Chicago, Dept Phys, Chicago, IL 60637 USA

[8] Argonne Natl Lab, Computat Sci Div, Lemont, IL 60439 USA

[9] Univ Calif Berkeley, Simons Inst Theory Comp, Berkeley, CA 94720 USA

[10] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA

[11] MIT, Ctr Theoret Phys, Cambridge, MA 02139 USA

[12] Brandeis Univ, Martin A Fisher Sch Phys, Waltham, MA 02453 USA

[13] Free Univ Berlin, Dahlem Ctr Complex Quantum Syst, D-14195 Berlin, Germany

来源：

NATURE COMMUNICATIONS | 2024年 / 15卷 / 01期

关键词：

D O I：

10.1038/s41467-023-43957-x

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Large machine learning models are revolutionary technologies of artificial intelligence whose bottlenecks include huge computational expenses, power, and time used both in the pre-training and fine-tuning process. In this work, we show that fault-tolerant quantum computing could possibly provide provably efficient resolutions for generic (stochastic) gradient descent algorithms, scaling as O(T-2 x polylog(n)), where n is the size of the models and T is the number of iterations in the training, as long as the models are both sufficiently dissipative and sparse, with small learning rates. Based on earlier efficient quantum algorithms for dissipative differential equations, we find and prove that similar algorithms work for (stochastic) gradient descent, the primary algorithm for machine learning. In practice, we benchmark instances of large machine learning models from 7 million to 103 million parameters. We find that, in the context of sparse training, a quantum enhancement is possible at the early stage of learning after model pruning, motivating a sparse parameter download and re-upload scheme. Our work shows solidly that fault-tolerant quantum algorithms could potentially contribute to most state-of-the-art, large-scale machine-learning problems.

引用

页数：6

共 50 条

[41] High Per Parameter: A Large-Scale Study of Hyperparameter Tuning for Machine Learning Algorithms
Sipper, Moshe
ALGORITHMS, 2022, 15 (09)
[42] An Efficient Approach to Recognize Hand Gestures Using Machine-Learning Algorithms
Wahid, Md Ferdous
Tafreshi, Reza
Al-Sowaidi, Mubarak
Langari, Reza
2018 IEEE 4TH MIDDLE EAST CONFERENCE ON BIOMEDICAL ENGINEERING (MECBME), 2018, : 171 - 176
[43] Large-scale kernel extreme learning machine
Deng, Wan-Yu
Zheng, Qing-Hua
Chen, Lin
Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (11): : 2235 - 2246
[44] Machine learning for large-scale MOF screening
Coupry, Damien
Groot, Laurens
Addicoat, Matthew
Heine, Thomas
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
[45] Robust Large-Scale Machine Learning in the Cloud
Rendle, Steffen
Fetterly, Dennis
Shekita, Eugene J.
Su, Bor-yiing
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1125 - 1134
[46] Large-scale Machine Learning over Graphs
Yang, Yiming
PROCEEDINGS OF THE 2018 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'18), 2018, : 9 - 9
[47] Large-Scale Machine Learning and Neuroimaging in Psychiatry
Thompson, Paul
BIOLOGICAL PSYCHIATRY, 2018, 83 (09) : S51 - S51
[48] Coding for Large-Scale Distributed Machine Learning
Xiao, Ming
Skoglund, Mikael
ENTROPY, 2022, 24 (09)
[49] Resource Elasticity for Large-Scale Machine Learning
Huang, Botong
Boehm, Matthias
Tian, Yuanyuan
Reinwald, Berthold
Tatikonda, Shirish
Reiss, Frederick R.
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 137 - 152
[50] Efficient Algorithms for Large-Scale Local Triangle Counting
Becchetti, Luca
Boldi, Paolo
Castillo, Carlos
Gionis, Aristides
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)

← 1 2 3 4 5 →