Predicting innovative firms using web mining and deep learning

被引:28
|
作者
Kinne, Jan [1 ,2 ,3 ]
Lenz, David [3 ,4 ]
机构
[1] ZEW Ctr European Econ Res, Dept Econ Innovat & Ind Dynam, Mannheim, Germany
[2] Univ Salzburg, Dept Geoinformat Z GIS, Salzburg, Austria
[3] Istari Ai, Mannheim, Germany
[4] Justus Liebig Univ, Dept Econometr & Stat, Giessen, Germany
来源
PLOS ONE | 2021年 / 16卷 / 04期
关键词
PATENT STATISTICS; NEURAL-NETWORKS;
D O I
10.1371/journal.pone.0249071
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Evidence-based STI (science, technology, and innovation) policy making requires accurate indicators of innovation in order to promote economic growth. However, traditional indicators from patents and questionnaire-based surveys often lack coverage, granularity as well as timeliness and may involve high data collection costs, especially when conducted at a large scale. Consequently, they struggle to provide policy makers and scientists with the full picture of the current state of the innovation system. In this paper, we propose a first approach on generating web-based innovation indicators which may have the potential to overcome some of the shortcomings of traditional indicators. Specifically, we develop a method to identify product innovator firms at a large scale and very low costs. We use traditional firm-level indicators from a questionnaire-based innovation survey (German Community Innovation Survey) to train an artificial neural network classification model on labelled (product innovator/no product innovator) web texts of surveyed firms. Subsequently, we apply this classification model to the web texts of hundreds of thousands of firms in Germany to predict whether they are product innovators or not. We then compare these predictions to firm-level patent statistics, survey extrapolation benchmark data, and regional innovation indicators. The results show that our approach produces reliable predictions and has the potential to be a valuable and highly cost-efficient addition to the existing set of innovation indicators, especially due to its coverage and regional granularity.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Predicting Next Whereabouts Using Deep Learning
    Galarreta, Ana-Paula
    Alatrista-Salas, Hugo
    Nunez-del-Prado, Miguel
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2023, 2023, 13890 : 214 - 225
  • [22] Predicting Prices of Case Furniture Products Using Web Mining Techniques
    Bardak, Timucin
    BIORESOURCES, 2023, 18 (04) : 7412 - 7427
  • [23] Predicting user behavior through Sessions using the Web log mining
    Neelima, G.
    Rodda, Sireesha
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN HUMAN MACHINE INTERACTION (HMI), 2016, : 28 - 32
  • [24] A web-based SQL learning system using web mining techniques
    Tsai, CF
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 349 - 353
  • [25] Learning and predicting the unknown class using evidential deep learning
    Akihito Nagahama
    Scientific Reports, 13
  • [26] Learning and predicting the unknown class using evidential deep learning
    Nagahama, Akihito
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [27] PREDICTING NASH PATIENTS USING INNOVATIVE MACHINE LEARNING TECHNIQUES
    Docherty, M.
    Huang, J.
    Regnier, S. A.
    Capkun, G.
    Balp, M. M.
    Ye, Q.
    Janssens, N.
    Lopez, P.
    Pedrosa, M.
    Schattenberg, J. M.
    VALUE IN HEALTH, 2019, 22 : S595 - S595
  • [28] Opinion Mining on Emojis using Deep Learning Techniques
    Karthik, Valmeekam
    Nair, Dheeraj
    Anuradha, J.
    Procedia Computer Science, 2018, 132 : 167 - 173
  • [29] Textual Mining using Deep Learning in official journals
    Teixeira, Lucas Accioly
    Calado, Raquel Bezerra
    Maciel, Alexandre M.A.
    2023 IEEE Latin American Conference on Computational Intelligence, LA-CCI 2023, 2023,
  • [30] Methods of process mining and prediction using deep learning
    Cieplak, Tomasz
    Rymarczyk, Tomasz
    Klosowski, Grzegorz
    Maj, Michal
    Pliszczuk, Damian
    Rymarczyk, Pawel
    PRZEGLAD ELEKTROTECHNICZNY, 2021, 97 (03): : 146 - 149