STRUCTURED QUASI-NEWTON METHODS FOR OPTIMIZATION WITH ORTHOGONALITY CONSTRAINTS
被引:22
|
作者:
Hu, Jiang
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R ChinaPeking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R China
Hu, Jiang
[1
]
Jiang, Bo
论文数: 0引用数: 0
h-index: 0
机构:
Nanjing Normal Univ, Sch Math Sci, Key Lab NSLSCS Jiangsu Prov, Nanjing 210023, Jiangsu, Peoples R ChinaPeking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R China
Jiang, Bo
[2
]
Lin, Lin
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USAPeking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R China
Lin, Lin
[3
]
Wen, Zaiwen
论文数: 0引用数: 0
h-index: 0
机构:
Peking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R ChinaPeking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R China
Wen, Zaiwen
[1
]
Yuan, Ya-Xiang
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Inst Computat Math & Sci Engn Comp, Beijing, Peoples R ChinaPeking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R China
Yuan, Ya-Xiang
[4
]
机构:
[1] Peking Univ, Beijing Int Ctr Math Res, Beijing, Peoples R China
[2] Nanjing Normal Univ, Sch Math Sci, Key Lab NSLSCS Jiangsu Prov, Nanjing 210023, Jiangsu, Peoples R China
[3] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[4] Chinese Acad Sci, Inst Computat Math & Sci Engn Comp, Beijing, Peoples R China
optimization with orthogonality constraints;
structured quasi-Newton method;
limited-memory Nystrom approximation;
Hartree-Fock total energy minimization;
BFGS METHOD;
CONVERGENCE;
D O I:
10.1137/18M121112X
中图分类号:
O29 [应用数学];
学科分类号:
070104 ;
摘要:
In this paper, we study structured quasi-Newton methods for optimization problems with orthogonality constraints. Note that the Riemannian Hessian of the objective function requires both the Euclidean Hessian and the Euclidean gradient. In particular, we are interested in applications that the Euclidean Hessian itself consists of a computational cheap part and a significantly expensive part. Our basic idea is to keep these parts of lower computational costs but substitute those parts of higher computational costs by the limited-memory quasi-Newton update. More specifically, the part related to the Euclidean gradient and the cheaper parts in the Euclidean Hessian are preserved. The initial quasi-Newton matrix is further constructed from a limited-memory Nystrom approximation to the expensive part. Consequently, our subproblems approximate the original objective function in the Euclidean space and preserve the orthogonality constraints without performing the so-called vector transports. When the subproblems are solved to sufficient accuracy, both global and local q-superlinear convergence can be established under mild conditions. Preliminary numerical experiments on the linear eigenvalue problem and the electronic structure calculation show the effectiveness of our method compared with the state-of-art algorithms.