Using machine learning to select variables in data envelopment analysis: Simulations and application using electricity distribution data

被引:12
|
作者
Duras, Toni [1 ]
Javed, Farrukh [2 ]
Mansson, Kristofer [1 ]
Sjolander, Paer [1 ]
Soderberg, Magnus [3 ]
机构
[1] Jonkoping Univ, Jonkoping Int Business Sch, POB 1026, SE-55111 Jonkoping, Sweden
[2] Lund Univ, Lund, Sweden
[3] Griffith Univ, Brisbane, Australia
关键词
Data envelopment analysis; Curse of dimensionality; Machine learning; Variable selection; Regulation; EFFICIENCY; REGRESSION;
D O I
10.1016/j.eneco.2023.106621
中图分类号
F [经济];
学科分类号
02 ;
摘要
Agencies that regulate electricity providers often apply nonparametric data envelopment analysis (DEA) to assess the relative efficiency of each firm. The reliability and validity of DEA are contingent upon selecting relevant input variables. In the era of big (wide) data, the assumptions of traditional variable selection techniques are often violated due to challenges related to high-dimensional data and their standard empirical properties. Currently, regulators have access to a large number of potential input variables. Therefore, our aim is to introduce new machine learning methods for regulators of the energy market. We also propose a new two-step analytical approach where, in the first step, the machine learning-based adaptive least absolute shrinkage and selection operator (ALASSO) is used to select variables and, in the second step, selected variables are used in a DEA model. In contrast to previous research, we find, by using a more realistic data-generating process common for production functions (i.e., Cobb-Douglas and Translog), that the performance of different machine learning techniques differs substantially in different empirically relevant situations. Simulations also reveal that the ALASSO is superior to other machine learning and regression-based methods when the collinearity is low or moderate. However, in situations of multicollinearity, the LASSO approach exhibits the best performance. We also use real data from the Swedish electricity distribution market to illustrate the empirical relevance of selecting the most appropriate variable selection method.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Group Performance Analysis of State Electricity Boards in India using Data Envelopment Analysis
    Jayamani, S.
    Jagadeeshwaran, D.
    Meenakumari, R.
    Kamaraj, N.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION, COMMUNICATION AND ENERGY CONSERVATION INCACEC 2009 VOLUME II, 2009, : 913 - +
  • [22] Data envelopment analysis using the binary-data
    Pourmahmoud, Jafar
    Azad, Maedeh Gholam
    JOURNAL OF MODELLING IN MANAGEMENT, 2022, 17 (01) : 49 - 65
  • [23] The efficiency distribution approach in data envelopment analysis: An application
    Sengupta, JK
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1996, 47 (11) : 1387 - 1397
  • [24] RELATIVE EFFICIENCY ASSESSMENTS USING DATA ENVELOPMENT ANALYSIS - AN APPLICATION TO DATA ON RATES DEPARTMENTS
    THANASSOULIS, E
    DYSON, RG
    FOSTER, MJ
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1987, 38 (05) : 397 - 411
  • [25] Asymptotic Distribution of the Sum of Skew-Normal Random Variables: Application in Data Envelopment Analysis
    Nazari, Ali
    Behzadi, Mohammad Hassan
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY TRANSACTION A-SCIENCE, 2017, 41 (A1): : 199 - 207
  • [26] Asymptotic Distribution of the Sum of Skew-Normal Random Variables: Application in Data Envelopment Analysis
    Ali Nazari
    Mohammad Hassan Behzadi
    Iranian Journal of Science and Technology, Transactions A: Science, 2017, 41 : 199 - 207
  • [27] Analysis of Banking Data Using Machine Learning
    Patil, Priyanka S.
    Dharwadkar, Nagaraj V.
    2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 876 - 881
  • [28] Data envelopment analysis with interactive variables
    Ji, Aibing
    Liu, Hui
    Qiu, Hong-jie
    Lin, Haobo
    MANAGEMENT DECISION, 2015, 53 (10) : 2390 - 2406
  • [29] Variables reduction in data envelopment analysis
    Amirteimoori, Alireza
    Despotis, Dimitris K.
    Kordrostami, Sohrab
    OPTIMIZATION, 2014, 63 (05) : 735 - 745
  • [30] Efficiency control of the electrical distribution utilities using data envelopment analysis
    Khodr, H. M.
    Feijoo, D.
    Perez, E.
    Zerpa, I. J.
    De Oliveira-De Jesus, P. M.
    Yusta, J. M.
    2006 IEEE/PES TRANSMISSION & DISTRIBUTION CONFERENCE & EXPOSITION: LATIN AMERICA, VOLS 1-3, 2006, : 1075 - 1080