Words or numbers? Macroeconomic nowcasting with textual and macroeconomic data

被引:4
|
作者
Zheng, Tingguo [1 ,2 ,3 ]
Fan, Xinyue [2 ]
Jin, Wei [4 ]
Fang, Kuangnan [3 ]
机构
[1] Xiamen Univ, Ctr Macroecon Res, Xiamen, Peoples R China
[2] Xiamen Univ, Wang Yanan Inst Studies Econ, Xiamen, Peoples R China
[3] Xiamen Univ, Sch Econ, Dept Stat & Data Sci, Xiamen, Peoples R China
[4] Penghua Fund Management Co Ltd, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Nowcasting; Textual data; Macroeconomic data; Machine learning; Latent Dirichlet allocation; MIXED-FREQUENCY DATA; MIDAS REGRESSIONS; COINCIDENT INDEX; BIG DATA; SELECTION; SHRINKAGE;
D O I
10.1016/j.ijforecast.2023.05.006
中图分类号
F [经济];
学科分类号
02 ;
摘要
This paper performs the nowcasting of GDP growth rate and inflation expectation in China with traditional macroeconomic and novel textual data estimated by the latent Dirichlet allocation (LDA) model. We combine the MIDAS model with various machine learning techniques to handle the mixed-frequency and high-dimensional problems. Our empirical findings are threefold. First, we collected 866234 articles published over 20 years of Chinese economic newspapers. We systemically decomposed the textual data into news attention time series, which provide narrative descriptions of the economic and social conditions. Second, news attention data can provide similar or even better precision for nowcast, especially for inflation expectation compared with traditional macroeconomic data. Random forest delivers the most accurate forecast among the three machine learning methods, even for longer horizons. Thirdly, the most informative predictors for the nowcast align with existing literature, and news attention variables provide narrative realism for the forecast targets. (c) 2023 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:746 / 761
页数:16
相关论文
共 50 条