Scaling Data from Multiple Sources

被引:0
|
作者
Enamorado, Ted [1 ]
Lopez-Moctezuma, Gabriel [2 ]
Ratkovic, Marc [3 ]
机构
[1] Washington Univ, Dept Polit Sci, St Louis, MO 63130 USA
[2] CALTECH, Div Humanities & Social Sci, Pasadena, CA 91125 USA
[3] Princeton Univ, Dept Polit, Princeton, NJ 08544 USA
关键词
multidimensional scaling; principal component analysis; U; S; Senate; BAYESIAN FACTOR-ANALYSIS; MODELS; PREFERENCES; LIKELIHOOD; FRAMEWORK;
D O I
10.1017/pan.2020.24
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
We introduce a method for scaling two datasets from different sources. The proposed method estimates a latent factor common to both datasets as well as an idiosyncratic factor unique to each. In addition, it offers a flexible modeling strategy that permits the scaled locations to be a function of covariates, and efficient implementation allows for inference through resampling. A simulation study shows that our proposed method improves over existing alternatives in capturing the variation common to both datasets, as well as the latent factors specific to each. We apply our proposed method to vote and speech data from the 112th U.S. Senate. We recover a shared subspace that aligns with a standard ideological dimension running from liberals to conservatives, while recovering the words most associated with each senator's location. In addition, we estimate a word-specific subspace that ranges from national security to budget concerns, and a vote-specific subspace with Tea Party senators on one extreme and senior committee leaders on the other.
引用
收藏
页码:212 / 235
页数:24
相关论文
共 50 条
  • [31] Reliability Growth Projections Based on Data from Multiple Data Sources and Environments
    Crow, Larry H.
    2019 ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS 2019) - R & M IN THE SECOND MACHINE AGE - THE CHALLENGE OF CYBER PHYSICAL SYSTEMS, 2019,
  • [32] Increasing Users' Confidence in Uncertain Data by Aggregating Data from Multiple Sources
    Greis, Miriam
    Avci, Emre
    Schmidt, Albrecht
    Machulla, Tonja
    PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17), 2017, : 828 - 840
  • [33] Error in geometric morphometric data collection: Combining data from multiple sources
    Robinson, Chris
    Terhune, Claire E.
    AMERICAN JOURNAL OF PHYSICAL ANTHROPOLOGY, 2017, 164 (01) : 62 - 75
  • [34] Inferring Feature Relations of Biometric Data From Multiple Sources
    Anil, K. R.
    Raj, Gladston S.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 867 - 873
  • [35] Fusion and inference from multiple data sources in a commensurate space
    Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD, United States
    不详
    Stat. Anal. Data Min., 2012, 3 (187-193):
  • [36] Conceptual Framework for entity integration from multiple data sources
    Orescanin, Drazen
    Tan, Ran
    Ao, Jing
    2019 42ND INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2019, : 1232 - 1237
  • [37] RECONCILING CONTINUOUS ATTRIBUTE VALUES FROM MULTIPLE DATA SOURCES
    Jiang Zhengrui
    12TH PACIFIC ASIA CONFERENCE ON INFORMATION SYSTEMS (PACIS 2008), 2008, : 1548 - 1555
  • [38] Integrated analysis of spatial data from multiple sources: an overview
    Gong, P.
    Canadian Journal of Remote Sensing, 1994, 20 (04) : 349 - 359
  • [39] On Visualizing Heterogeneous Semantic Networks from Multiple Data Sources
    Maureen
    Sun, Aixin
    Lim, Ee-Peng
    Datta, Anwitaman
    Chang, Kuiyu
    DIGITAL LIBRARIES: UNIVERSAL AND UBIQUITOUS ACCESS TO INFORMATION, PROCEEDINGS, 2008, 5362 : 266 - +
  • [40] CHLAMYDIA TRENDS IN THE USA: RESULTS FROM MULTIPLE DATA SOURCES
    Satterwhite, C.
    Weinstock, H.
    Datta, D.
    SEXUALLY TRANSMITTED INFECTIONS, 2011, 87 : A21 - A21