Testing Identity of Multidimensional Histograms

被引:0
|
作者
Diakonikolas, Ilias [1 ]
Kane, Daniel M. [2 ]
Peebles, John [3 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Univ Calif San Diego, La Jolla, CA USA
[3] MIT, Cambridge, MA USA
来源
关键词
distribution testing; hypothesis testing; goodness of fit; multivariate histograms; MULTIVARIATE HISTOGRAMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the problem of identity testing for multidimensional histogram distributions. A distribution p : D -> R+, where D subset of R-d, is called a k -histogram if there exists a partition of the domain into k axis-aligned rectangles such that p is constant within each such rectangle. Histograms are one of the most fundamental nonparametric families of distributions and have been extensively studied in computer science and statistics. We give the first identity tester for this problem with sub-learning sample complexity in any fixed dimension and a nearly-matching sample complexity lower bound. In more detail, let q be an unknown d-dimensional k -histogram distribution in fixed dimension d, and p be an explicitly given d-dimensional k -histogram. We want to correctly distinguish, with probability at least 2/3, between the case that p = q versus ||p - q||(1) >= epsilon. We design an algorithm for this hypothesis testing problem with sample complexity O ((root k/epsilon(2))2(d/2) log(2:5d) (k/epsilon)) that runs in sample-polynomial time. Our algorithm is robust to model misspecification, i.e., succeeds even if q is only promised to be close to a k-histogram. Moreover, for k = 2(Omega(d),) we show a sample complexity lower bound of (root k/epsilon(2))center dot Omega(log(k)/d)(d-1) when d >= 2. That is, for any fixed dimension d, our upper and lower bounds are nearly matching. Prior to our work, the sample complexity of the d = 1 case was well-understood, but no algorithm with sub-learning sample complexity was known, even for d = 2. Our new upper and lower bounds have interesting conceptual implications regarding the relation between learning and testing in this setting.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] Continued Refinements of Identity Salience: A Multidimensional Specification
    Markowski, Kelly L.
    Serpe, Richard T.
    SOCIAL PSYCHOLOGY QUARTERLY, 2025,
  • [42] Kalman identity in the synthesis of multidimensional linear systems
    Dubovik S.A.
    Journal of Mathematical Sciences, 2001, 107 (6) : 4352 - 4354
  • [43] ACKNOWLEDGING THE OTHER: A MULTIDIMENSIONAL ANALYSIS OF RACE AND IDENTITY
    Ally, Mashuq
    PHILOSOPHIA-INTERNATIONAL JOURNAL OF PHILOSOPHY, 2011, 40 (02): : 170 - 191
  • [44] THE SEXUAL IDENTITY PROFILE - A MULTIDIMENSIONAL BIPOLAR MODEL
    HOLDEN, JM
    HOLDEN, GS
    INDIVIDUAL PSYCHOLOGY-THE JOURNAL OF ADLERIAN THEORY RESEARCH & PRACTICE, 1995, 51 (02): : 102 - 113
  • [45] Development of the multidimensional inventory of physical activity identity
    Brown, Denver
    Meca, Alan
    Osman, Augustine
    JOURNAL OF SPORT & EXERCISE PSYCHOLOGY, 2023, 45 : S64 - S65
  • [46] The construction of a multidimensional spiritual identity via ICT
    Gross, Zehavit
    EDUCATIONAL MEDIA INTERNATIONAL, 2006, 43 (01) : 51 - 63
  • [47] Structure of the sense of identity in adolescents: Development of the Multidimensional Ego Identity Scale (MEIS)
    Tani, F
    JAPANESE JOURNAL OF EDUCATIONAL PSYCHOLOGY, 2001, 49 (03): : 265 - 273
  • [48] On hypothesis testing in multidimensional outlier detection
    Pokrajac, D
    Milutinovich, J
    Jankovic, D
    TELSIKS 2003: 6TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS IN MODERN SATELLITE, CABLE AND BROADCASTING SERVICE, VOLS 1 AND 2, PROCEEDINGS OF PAPERS, 2003, : 530 - 533
  • [49] META: Multidimensional Evaluation of Testing Ability
    Zhou, Tianqi
    Liu, Jiawei
    Wang, Yifan
    Chen, Zhenyu
    2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2022), 2022, : 139 - 143
  • [50] Racial Identity and Changes in Psychological Distress Using the Multidimensional Model of Racial Identity
    Willis, Henry A.
    Neblett, Enrique W.
    CULTURAL DIVERSITY & ETHNIC MINORITY PSYCHOLOGY, 2020, 26 (04): : 509 - 519