Effective Data-Aware Covariance Estimator From Compressed Data

被引:3
|
作者
Chen, Xixian [1 ]
Yang, Haiqin [2 ,3 ]
Zhao, Shenglin [1 ]
Lyu, Michael R. [4 ,5 ]
King, Irwin [4 ,5 ]
机构
[1] Tencent, Youtu Lab, Shenzhen 518057, Peoples R China
[2] Meitu, Hong Kong, Peoples R China
[3] Hang Seng Univ Hong Kong, Dept Comp, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
[5] Chinese Univ Hong Kong, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
关键词
Covariance matrices; Sparse matrices; Silicon; Estimation; Distributed databases; Learning systems; Dimensionality reduction; Covariance estimation; dimension reduction; randomized algorithms; unsupervised learning; ALGORITHMS; EXPRESSION; MATRICES;
D O I
10.1109/TNNLS.2019.2929106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Estimating covariance matrix from massive high-dimensional and distributed data is significant for various real-world applications. In this paper, we propose a data-aware weighted sampling-based covariance matrix estimator, namely DACE, which can provide an unbiased covariance matrix estimation and attain more accurate estimation under the same compression ratio. Moreover, we extend our proposed DACE to tackle multiclass classification problems with theoretical justification and conduct extensive experiments on both synthetic and real-world data sets to demonstrate the superior performance of our DACE.
引用
收藏
页码:2441 / 2454
页数:14
相关论文
共 50 条
  • [1] Compressed data structures: Dictionaries and data-aware measures
    Gupta, Ankur
    Hon, Wing-Kai
    Shah, Rahul
    Vitter, Jeffrey Scott
    DCC 2006: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2006, : 213 - +
  • [2] Compressed data structures: Dictionaries and data-aware measures
    Gupta, Ankur
    Hon, Wing-Kai
    Shah, Rahul
    Vitter, Jeffrey Scott
    THEORETICAL COMPUTER SCIENCE, 2007, 387 (03) : 313 - 331
  • [3] Compressed String Dictionaries via Data-Aware Subtrie Compaction
    Boffa, Antonio
    Ferragina, Paolo
    Vinciguerra, Giorgio
    Tosoni, Francesco
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2022, 2022, 13617 : 233 - 249
  • [4] Data-aware multicast
    Baehni, S
    Eugster, PT
    Guerraoui, R
    2004 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2004, : 233 - 242
  • [5] POSH: A Data-Aware Shell
    Raghavan, Deepti
    Fouladi, Sadjad
    Levis, Philip
    Zaharia, Matei
    PROCEEDINGS OF THE 2020 USENIX ANNUAL TECHNICAL CONFERENCE, 2020, : 617 - 631
  • [6] A data-aware resource broker for data grids
    Le, H
    Coddington, P
    Wendelborn, AL
    NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2004, 3222 : 73 - 82
  • [7] A Subthreshold SRAM with Embedded Data-Aware Write-Assist and Adaptive Data-Aware Keeper
    Chiu, Yi-Wei
    Hu, Yu-Hao
    Zhao, Jun-Kai
    Jou, Shyh-Jye
    Chuang, Ching-Te
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 1014 - 1017
  • [8] Modeling Data Transformations in Data-Aware Service Choreographies
    Hahn, Michael
    Breitenbuecher, Uwe
    Leymann, Frank
    Wurster, Michael
    Yussupov, Vladimir
    2018 IEEE 22ND INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE (EDOC 2018), 2018, : 28 - 34
  • [9] Supporting data-aware processes with MERODE
    Snoeck, Monique
    Verbruggen, Charlotte
    De Smedt, Johannes
    De Weerdt, Jochen
    SOFTWARE AND SYSTEMS MODELING, 2023, 22 (06): : 1779 - 1802
  • [10] Data-Aware Compression of Neural Networks
    Falahati, Hajar
    Peyro, Masoud
    Amini, Hossein
    Taghian, Mehran
    Sadrosadati, Mohammad
    Lotfi-Kamran, Pejman
    Sarbazi-Azad, Hamid
    IEEE COMPUTER ARCHITECTURE LETTERS, 2021, 20 (02) : 94 - 97