Predicting Housing Market Trends Using Twitter Data

被引:0
|
作者
Velthorst, Marlon [1 ]
Guven, Cicek [1 ]
机构
[1] Tilburg Univ, Dept Cognit Sci & Artificial Intelligence, Tilburg, Netherlands
关键词
text mining; term frequency; inverse document frequency; machine learning; housing prices; classification; DETERMINANTS; PRICES; SENTIMENT;
D O I
10.1109/SDS.2019.00010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we try to predict the Dutch housing market trends using text mining and machine learning as an application of data science methods in finance. Our main goal is to predict the short term upward or downward trend of the average house price in the Dutch market by using text data collected from Twitter. Twitter is widely used as well and has been proven to be a helpful source of data. However, Twitter, text mining (tokenization, bag-of-words, n-grams, weighted term frequencies) and machine learning (classification algorithms) have not been combined yet in order to predict the housing market trends in short term. In this study, tweets including predefined search words are collected relying on domain knowledge, and the corresponding text is grouped by month as documents. Then words and word sequences are transformed into numerical values. These values served as attributes to predict whether the housing market moves up or down, i.e. we approached this as a binomial classification problem relating text data of a month with (up or down) trends for the following month. Our main results reveal there is a correlation between the (weighted) frequency of words and short term housing trends, in other words, we were able to make accurate predictions of trends in short term using multiple machine learning and text mining techniques combined.
引用
收藏
页码:113 / 118
页数:6
相关论文
共 50 条
  • [41] Using Demographics in Predicting Election Results with Twitter
    Sanders, Eric
    de Gier, Michelle
    van den Bosch, Antal
    SOCIAL INFORMATICS, PT II, 2016, 10047 : 259 - 268
  • [42] Predicting Stock Market Trends Using Machine Learning and Deep Learning Algorithms Via Continuous and Binary Data; a Comparative Analysis
    Nabipour, Mojtaba
    Nayyeri, Pooyan
    Jabani, Hamed
    Shahab, S.
    Mosavi, Amir
    IEEE ACCESS, 2020, 8 : 150199 - 150212
  • [43] Predicting Named Entity Location using Twitter
    Shen, Wei
    Liu, Yinan
    Wang, Jianyong
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 161 - 172
  • [44] USING MOVERSHIP DATA TO IMPROVE INTERCENSAL ESTIMATION OF POPULATION AND HOUSING-MARKET
    GRIER, G
    GRIER, ES
    REVIEW OF PUBLIC DATA USE, 1977, 5 (02): : 11 - 19
  • [45] AN ALTERNATIVE APPROACH TO HOUSING-MARKET SEGMENTATION USING HEDONIC PRICE DATA
    DALEJOHNSON, D
    JOURNAL OF URBAN ECONOMICS, 1982, 11 (03) : 311 - 332
  • [46] Neural Network Based Model for Predicting Housing Market Performance
    Department of Civil, Environmental, and Construction Engineering, University of Central Florida, Orlando, FL 32816-2450, United States
    Tsinghua Sci. Tech., 2008, SUPPL. 1 (325-328):
  • [47] Predicting Postdisaster Residential Housing Reconstruction Based on Market Resources
    Arneson, Erin
    Javernick-Will, Amy
    Hallowell, Matthew
    Corotis, Ross
    NATURAL HAZARDS REVIEW, 2020, 21 (01)
  • [48] Neural Network Based Model for Predicting Housing Market Performance
    Ahmed Khalafallah
    Tsinghua Science and Technology, 2008, (S1) : 325 - 328
  • [49] Predicting With Twitter
    Prada, Jesus
    PROCEEDINGS OF THE 2ND EUROPEAN CONFERENCE ON SOCIAL MEDIA (ECSM 2015), 2015, : 534 - 543
  • [50] Predicting Stock Market Trends Using Random Forests: A Sample of the Zagreb Stock Exchange
    Manojlovic, T.
    Stajduhar, I.
    2015 8TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2015, : 1189 - 1193