Scaling Data from Multiple Sources

被引:0
|
作者
Enamorado, Ted [1 ]
Lopez-Moctezuma, Gabriel [2 ]
Ratkovic, Marc [3 ]
机构
[1] Washington Univ, Dept Polit Sci, St Louis, MO 63130 USA
[2] CALTECH, Div Humanities & Social Sci, Pasadena, CA 91125 USA
[3] Princeton Univ, Dept Polit, Princeton, NJ 08544 USA
关键词
multidimensional scaling; principal component analysis; U; S; Senate; BAYESIAN FACTOR-ANALYSIS; MODELS; PREFERENCES; LIKELIHOOD; FRAMEWORK;
D O I
10.1017/pan.2020.24
中图分类号
D0 [政治学、政治理论];
学科分类号
0302 ; 030201 ;
摘要
We introduce a method for scaling two datasets from different sources. The proposed method estimates a latent factor common to both datasets as well as an idiosyncratic factor unique to each. In addition, it offers a flexible modeling strategy that permits the scaled locations to be a function of covariates, and efficient implementation allows for inference through resampling. A simulation study shows that our proposed method improves over existing alternatives in capturing the variation common to both datasets, as well as the latent factors specific to each. We apply our proposed method to vote and speech data from the 112th U.S. Senate. We recover a shared subspace that aligns with a standard ideological dimension running from liberals to conservatives, while recovering the words most associated with each senator's location. In addition, we estimate a word-specific subspace that ranges from national security to budget concerns, and a vote-specific subspace with Tea Party senators on one extreme and senior committee leaders on the other.
引用
收藏
页码:212 / 235
页数:24
相关论文
共 50 条
  • [21] Accessible Routes Integrating Data from Multiple Sources
    Luaces, Miguel R.
    Fisteus, Jesus A.
    Sanchez-Fernandez, Luis
    Munoz-Organero, Mario
    Balado, Jesus
    Diaz-Vilarino, Lucia
    Lorenzo, Henrique
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (01)
  • [22] On the Design of Autonomous Agents From Multiple Data Sources
    Garrabe, Emiland
    Russo, Giovanni
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 698 - 703
  • [23] IoT streaming data integration from multiple sources
    Tu, Doan Quang
    Kayes, A. S. M.
    Rahayu, Wenny
    Nguyen, Kinh
    COMPUTING, 2020, 102 (10) : 2299 - 2329
  • [24] Predicting Student Performance from Multiple Data Sources
    Koprinska, Irena
    Stretton, Joshua
    Yacef, Kalina
    ARTIFICIAL INTELLIGENCE IN EDUCATION, AIED 2015, 2015, 9112 : 678 - 681
  • [25] Mining Credit Interest Rate Data from Multiple Data Sources
    Hryhorkiv, Vasyl
    Buiak, Lesia
    Verstiak, Andrii
    Hryhorkiv, Mariia
    Verstiak, Oksana
    Berdnuk, Andrii
    2019 9TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER INFORMATION TECHNOLOGIES (ACIT'2019), 2019, : 265 - 268
  • [26] Scaling access to heterogeneous data sources with disco
    Tomasic, A
    Raschid, L
    Valduriez, P
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (05) : 808 - 823
  • [27] SOURCES OF UNCERTAINTY IN THE RELATIVE SCALING OF SPECTROSCOPIC DATA
    BARIBAUD, T
    SALAMANCA, I
    ALLOIN, D
    WAGNER, S
    ASTRONOMY & ASTROPHYSICS SUPPLEMENT SERIES, 1994, 103 (01): : 121 - 128
  • [28] Scaling limits for internal aggregation models with multiple sources
    Lionel Levine
    Yuval Peres
    Journal d'Analyse Mathématique, 2010, 111 : 151 - 219
  • [29] SCALING LIMITS FOR INTERNAL AGGREGATION MODELS WITH MULTIPLE SOURCES
    Levine, Lionel
    Peres, Yuval
    JOURNAL D ANALYSE MATHEMATIQUE, 2010, 111 : 151 - 219
  • [30] Learning from Multiple Sources for Data-to-Text and Text-to-Data
    Duong, Song
    Lumbreras, Alberto
    Gartrell, Mike
    Gallinari, Patrick
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206