MIXTURE CONDITIONAL REGRESSION WITH ULTRAHIGH

被引:0
|
作者
Shi, Jiaxin [1 ]
Wang, Fang [2 ]
Gao, Yuan [1 ]
Song, Xiaojun [3 ,4 ]
Wang, Hansheng [1 ]
机构
[1] Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
[2] Shandong Univ, Data Sci Inst, Jinan, Peoples R China
[3] Peking Univ, Guanghua Sch Management, Beijing, Peoples R China
[4] Peking Univ, Ctr Stat Sci, Beijing, Peoples R China
来源
ANNALS OF APPLIED STATISTICS | 2024年 / 18卷 / 03期
基金
中国国家自然科学基金;
关键词
Key words and phrases. Expectation-maximization algorithm; judicial impartiality; mixture conditional re-; gression; na & iuml; ve Bayes model; ultrahigh dimensional data; MAXIMUM-LIKELIHOOD; EM ALGORITHM; MODEL; SELECTION;
D O I
10.1214/24-AOAS1893
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Testing judicial impartiality is a problem of fundamental importance in empirical legal studies for which standard regression methods have been popularly used to estimate the extralegal factor effects. However, those methods cannot handle control variables with ultrahigh dimensionality, such as those found in judgment documents recorded in text format. To solve this problem, we develop a novel mixture conditional regression (MCR) approach, assuming that the whole sample can be classified into a number of latent classes. Within each latent class, a standard linear regression model can be used to model the relationship between the response and a key feature vector, which is assumed to be of a fixed dimension. Meanwhile, ultrahigh dimensional control variables are then used to determine the latent class membership, where a na & iuml;ve Bayes type model is used to describe the relationship. Hence, the dimension of control variables is allowed to be arbitrarily high. A novel expectation-maximization algorithm is developed for model estimation. Therefore, we are able to estimate the key parameters of interest as efficiently as if the true class membership were known in advance. Simulation studies are presented to demonstrate the proposed MCR method. A real dataset of Chinese burglary offenses is analyzed for illustration purposes.
引用
收藏
页码:2532 / 2550
页数:19
相关论文
共 50 条
  • [1] CONDITIONAL TEST FOR ULTRAHIGH DIMENSIONAL LINEAR REGRESSION COEFFICIENTS
    Guo, Wenwen
    Zhong, Wei
    Duan, Sunpeng
    Cui, Hengjian
    STATISTICA SINICA, 2022, 32 (03) : 1381 - 1409
  • [2] Conditional Linear Regression
    Calderon, Diego
    Juba, Brendan
    Li, Sirui
    Li, Zongyi
    Ruan, Lisa
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 2164 - 2172
  • [3] Conditional Linear Regression
    Calderon, Diego
    Juba, Brendan
    Li, Zongyi
    Ruan, Lisa
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8055 - 8056
  • [4] Conditional logistic regression
    Koletsi, Despina
    Pandis, Nikolaos
    AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2017, 151 (06) : 1191 - 1192
  • [5] Conditional Regression Rules
    Kang, Rui
    Song, Shaoxu
    Wang, Chaokun
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2481 - 2493
  • [6] On mixture autoregressive conditional heteroskedasticity
    Cavicchioli, Maddalena
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2018, 197 : 35 - 50
  • [7] ν-flows: Conditional neutrino regression
    Leigh, Matthew
    Raine, John Andrew
    Zoch, Knut
    Golling, Tobias
    SCIPOST PHYSICS, 2023, 14 (06):
  • [8] ESTIMATION OF CONDITIONAL DENSITY AND REGRESSION
    BOSQ, D
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES SERIE A, 1969, 269 (15): : 661 - &
  • [9] Functional mixture regression
    Yao, Fang
    Fu, Yuejiao
    Lee, Thomas C. M.
    BIOSTATISTICS, 2011, 12 (02) : 341 - 353
  • [10] EFFICIENCY OF THE CONDITIONAL SCORE IN A MIXTURE SETTING
    LINDSAY, BG
    ANNALS OF STATISTICS, 1983, 11 (02): : 486 - 497