Predicting innovative firms using web mining and deep learning

被引:28
|
作者
Kinne, Jan [1 ,2 ,3 ]
Lenz, David [3 ,4 ]
机构
[1] ZEW Ctr European Econ Res, Dept Econ Innovat & Ind Dynam, Mannheim, Germany
[2] Univ Salzburg, Dept Geoinformat Z GIS, Salzburg, Austria
[3] Istari Ai, Mannheim, Germany
[4] Justus Liebig Univ, Dept Econometr & Stat, Giessen, Germany
来源
PLOS ONE | 2021年 / 16卷 / 04期
关键词
PATENT STATISTICS; NEURAL-NETWORKS;
D O I
10.1371/journal.pone.0249071
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Evidence-based STI (science, technology, and innovation) policy making requires accurate indicators of innovation in order to promote economic growth. However, traditional indicators from patents and questionnaire-based surveys often lack coverage, granularity as well as timeliness and may involve high data collection costs, especially when conducted at a large scale. Consequently, they struggle to provide policy makers and scientists with the full picture of the current state of the innovation system. In this paper, we propose a first approach on generating web-based innovation indicators which may have the potential to overcome some of the shortcomings of traditional indicators. Specifically, we develop a method to identify product innovator firms at a large scale and very low costs. We use traditional firm-level indicators from a questionnaire-based innovation survey (German Community Innovation Survey) to train an artificial neural network classification model on labelled (product innovator/no product innovator) web texts of surveyed firms. Subsequently, we apply this classification model to the web texts of hundreds of thousands of firms in Germany to predict whether they are product innovators or not. We then compare these predictions to firm-level patent statistics, survey extrapolation benchmark data, and regional innovation indicators. The results show that our approach produces reliable predictions and has the potential to be a valuable and highly cost-efficient addition to the existing set of innovation indicators, especially due to its coverage and regional granularity.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Classification of handwritten digits on the web using deep learning
    Purve, Shrawan J.
    Runwal, Rutuj
    Chandak, Mohit
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 192 - 198
  • [42] Web Phishing Detection Using a Deep Learning Framework
    Yi, Ping
    Guan, Yuxiang
    Zou, Futai
    Yao, Yao
    Wang, Wei
    Zhu, Ting
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [43] Deep Learning for the Web
    Jung, Kyomin
    Zhang, Byoung-Tak
    Mitra, Prasenjit
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 1525 - 1526
  • [44] PREDICTING DIABETES FROM PHOTOPLETHYSMOGRAPHY USING DEEP LEARNING
    Avram, Robert
    Tison, Geoffrey
    Kuhar, Peter
    Marcus, Gregory
    Pletcher, Mark
    Olgin, Jeffrey E.
    Aschbacher, Kirstin
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 73 (09) : 16 - 16
  • [45] Predicting the Number of Software Faults using Deep Learning
    Alkaberi, Wahaj
    Assiri, Fatmah
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2024, 14 (02) : 13222 - 13231
  • [46] Predicting Proteolysis in Complex Proteomes Using Deep Learning
    Ozols, Matiss
    Eckersley, Alexander
    Platt, Christopher I.
    Stewart-McGuinness, Callum
    Hibbert, Sarah A.
    Revote, Jerico
    Li, Fuyi
    Griffiths, Christopher E. M.
    Watson, Rachel E. B.
    Song, Jiangning
    Bell, Mike
    Sherratt, Michael J.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (06) : 1 - 20
  • [47] Predicting DNA structure using a deep learning method
    Jinsen Li
    Tsu-Pei Chiu
    Remo Rohs
    Nature Communications, 15
  • [48] Predicting Stealthy Watermarks in Files Using Deep Learning
    Sabir, Maha
    Jones, James H.
    Liu, Hang
    Mbaziira, Alex, V
    2019 7TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS), 2019,
  • [49] Predicting demographics from meibography using deep learning
    Wang, Jiayun
    Graham, Andrew D.
    Yu, Stella X.
    Lin, Meng C.
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [50] Predicting the Stages of Diabetic Retinopathy using Deep Learning
    Harshitha, Chava
    Asha, Alla
    Pushkala, Jangala Lakshmi Sai
    Anogini, Rayapudi Naga Swetha
    Karthikeyan, C.
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 989 - 994