Predicting Housing Market Trends Using Twitter Data

被引:0
|
作者
Velthorst, Marlon [1 ]
Guven, Cicek [1 ]
机构
[1] Tilburg Univ, Dept Cognit Sci & Artificial Intelligence, Tilburg, Netherlands
关键词
text mining; term frequency; inverse document frequency; machine learning; housing prices; classification; DETERMINANTS; PRICES; SENTIMENT;
D O I
10.1109/SDS.2019.00010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we try to predict the Dutch housing market trends using text mining and machine learning as an application of data science methods in finance. Our main goal is to predict the short term upward or downward trend of the average house price in the Dutch market by using text data collected from Twitter. Twitter is widely used as well and has been proven to be a helpful source of data. However, Twitter, text mining (tokenization, bag-of-words, n-grams, weighted term frequencies) and machine learning (classification algorithms) have not been combined yet in order to predict the housing market trends in short term. In this study, tweets including predefined search words are collected relying on domain knowledge, and the corresponding text is grouped by month as documents. Then words and word sequences are transformed into numerical values. These values served as attributes to predict whether the housing market moves up or down, i.e. we approached this as a binomial classification problem relating text data of a month with (up or down) trends for the following month. Our main results reveal there is a correlation between the (weighted) frequency of words and short term housing trends, in other words, we were able to make accurate predictions of trends in short term using multiple machine learning and text mining techniques combined.
引用
收藏
页码:113 / 118
页数:6
相关论文
共 50 条
  • [32] The turf is always greener: Predicting decommitments in college football recruiting using Twitter data
    Bigsby, Kristina Gavin
    Ohlmann, Jeffrey W.
    Zhao, Kang
    DECISION SUPPORT SYSTEMS, 2019, 116 : 1 - 12
  • [33] Predicting Twitter User Demographics using Distant Supervision from Website Traffic Data
    Culotta, Aron
    Ravi, Nirmal Kumar
    Cutler, Jennifer
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2016, 55 : 389 - 408
  • [34] Extracting Collective Trends from Twitter Using Social-Based Data Mining
    Bello, Gema
    Menendez, Hector
    Okazaki, Shintaro
    Camacho, David
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 622 - 630
  • [35] Predicting Housing Price Trends in Poland: Online Social Engagement - Google Trends
    Belej, Miroslaw
    REAL ESTATE MANAGEMENT AND VALUATION, 2023, 31 (04) : 73 - 87
  • [36] Predicting Reputation in the Sharing Economy with Twitter Social Data
    Prada, Antonio
    Iglesias, Carlos A.
    APPLIED SCIENCES-BASEL, 2020, 10 (08):
  • [37] Predicting Political Mood Tendencies based on Twitter Data
    Hernandez-Suarez, A.
    Sanchez-Perez, G.
    Martinez-Hernandez, V.
    Perez-Meana, H.
    Toscano-Medina, K.
    Nakano, M.
    Sanchez, V.
    2017 5TH INTERNATIONAL WORKSHOP ON BIOMETRICS AND FORENSICS (IWBF 2017), 2017,
  • [38] Predicting Personality with Twitter Data and Machine Learning Models
    Ergu, Izel
    Isik, Zerrin
    Yankayis, Ismail
    2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, : 386 - 390
  • [39] Predicting Social Trends from Non-photographic Images on Twitter
    Yazdani, Mehrdad
    Manovich, Lev
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1653 - 1660
  • [40] Predicting STC Customers' Satisfaction Using Twitter
    Almuqren, Latifah
    Cristea, Alexandra, I
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (01): : 204 - 210