Quantifying the information lost in optimal covariance matrix cleaning

被引：0

作者：

Bongiorno, Christian ^{[1
]}

Lamrani, Lamia ^{[1
]}

机构：

[1] Univ Paris Saclay, Lab Math & Informat Complex & Syst, 9 Rue Joliot Curie, F-91192 Gif Sur Yvette, France

来源：

PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS | 2025年 / 657卷

关键词：

Random matrix theory; Covariance matrix estimation; Genetic regressor programming; High-dimension statistics; Information theory; DIVERGENCE;

D O I：

10.1016/j.physa.2024.130225

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Obtaining an accurate estimate of the underlying covariance matrix from finite sample data is challenging due to sample size noise. In recent years, sophisticated covariance-cleaning techniques based on random matrix theory have been proposed to address this issue. Most of these methods aim to achieve an optimal covariance matrix estimator by minimizing the Frobenius norm distance as a measure of the discrepancy between the true covariance matrix and the estimator. However, this practice offers limited interpretability in terms information theory. To better understand this relationship, we focus on the Kullback-Leibler divergence to quantify the information lost by the estimator. Our analysis centers on rotationally invariant estimators, which are state-of-art in random matrix theory, and we derive an analytical expression for their Kullback-Leibler divergence. Due to the intricate nature of the calculations, we use genetic programming regressors paired with human intuition. Ultimately, using approach, we formulate a conjecture validated through extensive simulations, showing that the Frobenius distance corresponds to a first-order expansion term of the Kullback-Leibler divergence, thus establishing a more defined link between the two measures.

引用

页数：9

共 50 条

[1] Quantifying lost information due to covariance matrix estimation in parameter inference
Sellentin, Elena
Heavens, Alan F.
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2017, 464 (04) : 4658 - 4665
[2] Optimal covariance cleaning for heavy-tailed distributions: Insights from information theory
Bongiorno, Christian
Berritta, Marco
PHYSICAL REVIEW E, 2023, 108 (05)
[3] THE COVARIANCE-MATRIX OF THE INFORMATION MATRIX TEST
LANCASTER, T
ECONOMETRICA, 1984, 52 (04) : 1051 - 1053
[4] OPTIMAL CLEANING FOR SINGULAR VALUES OF CROSS-COVARIANCE MATRICES
Benaych-Georges, Florent
Bouchaud, Jean-Philippe
Potters, Marc
ANNALS OF APPLIED PROBABILITY, 2023, 33 (02): : 1095 - 1126
[5] OPTIMAL RATES OF CONVERGENCE FOR COVARIANCE MATRIX ESTIMATION
Cai, T. Tony
Zhang, Cun-Hui
Zhou, Harrison H.
ANNALS OF STATISTICS, 2010, 38 (04): : 2118 - 2144
[6] FISHER INFORMATION MATRIX OF THE COHERENTLY AVERAGED COVARIANCE-MATRIX
HUNG, H
KAVEH, M
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (06) : 1433 - 1435
[7] Covariance matrix estimation in the presence of auxiliary information
崔恒建
李国英
ChineseScienceBulletin, 1995, (07) : 529 - 534
[8] OPTIMAL RATES OF CONVERGENCE FOR SPARSE COVARIANCE MATRIX ESTIMATION
Cai, T. Tony
Zhou, Harrison H.
ANNALS OF STATISTICS, 2012, 40 (05): : 2389 - 2420
[9] The information matrix test with bootstrap-based covariance matrix estimation
Dhaene, G
Hoorelbeke, D
ECONOMICS LETTERS, 2004, 82 (03) : 341 - 347
[10] Enriching balancing information using the unbalance covariance matrix
Jiffri, S.
Garvey, S. D.
Rix, A. I. J.
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2009, 223 (08) : 1815 - 1826

← 1 2 3 4 5 →