Novel Fuzzy Correlation Coefficient and Variable Selection Method for Fuzzy Regression Analysis Based on Distance Approach

被引:3
|
作者
Yoon, Jin Hee [1 ]
Kim, Dae Jong [2 ]
Koo, Yoo Young [3 ]
机构
[1] Sejong Univ, Dept Math & Stat, Seoul 05006, South Korea
[2] Sejong Univ, Dept Business & Adm, Seoul 05006, South Korea
[3] Univ Coll, Yonsei Univ, Incheon 21983, South Korea
基金
新加坡国家研究基金会;
关键词
Fuzzy correlation coefficient; Fuzzy Regression; Fuzzy variable Selection Method; L2; Distance;
D O I
10.1007/s40815-023-01546-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
data analysis, analyzing the relationships between the variables such as correlation analysis and regression analysis are very important. Correlation analysis and regression analysis are not only very important in analyzing the influence relationship and causal relationship of variables but also serve as the basis for statistical analysis. Furthermore, they are essential and important as basic analysis for machine learning analysis such as deep learning. This is because in analyzing the input and output in deep learning, variables with high correlation are selected first, and in analyzing the causal relationship, it is basic to first conduct basic analysis such as regression analysis. Especially, when data are observed as fuzzy data with ambiguous information, it is difficult to propose unique methods for those analyses due to its complexity. However, the application of fuzzy theory to correlation analysis for data with such ambiguous information has not been an effective study, and several studies have been conducted in cases where the data is not general fuzzy data or interval estimation. As a result, the effectiveness of the fuzzy theory was not highlighted. In particular, the variable selection method for selecting important variables in multiple regression analysis is a very important and essential process in regression analysis. A variable that is significant in simple regression analysis may not be significant in multiple regression analysis due to its relationship with other variables. Therefore, not all variables that affect the dependent variable can be used as independent variables in multiple regression analysis. Therefore, multiple regression analysis goes through the process of excluding some variables. But until now, the process of fuzzy multiple regression analysis has not been applied without a variable selection method and the significance of important variables has not been emphasized that much. In this paper, a fuzzy correlation coefficient and multiple fuzzy regression analysis using variable section method are proposed. For this, first defuzzification and fuzzy ordering are defined. And then fuzzy correlation coefficient is proposed using L2 distance. Next, fuzzy sum of squares are defined for F-statistics to test the significance of the regression model. Using this F-statistics, fuzzy R2, and fuzzy RMSE, several variable selection methods are proposed based on distance approach. For the data analysis, foreign exchange reserve data and house price of South Korea have been applied which are important indicators for economic crisis. The financial data is mostly recorded as closing values, but the closing values cannot be the representative of the given period of time. Therefore, we can deal with the financial data as fuzzy data which have some fluctuation that can be considered as vagueness that the data originally include. We have used foreign exchange reserve data and house price data with several financial variables. And the proposed fuzzy correlation coefficient and variable selection for fuzzy regression analysis are applied to these financial data.
引用
收藏
页码:2969 / 2985
页数:17
相关论文
共 50 条
  • [1] Novel Fuzzy Correlation Coefficient and Variable Selection Method for Fuzzy Regression Analysis Based on Distance Approach
    Jin Hee Yoon
    Dae Jong Kim
    Yoo Young Koo
    International Journal of Fuzzy Systems, 2023, 25 : 2969 - 2985
  • [2] A METHOD OF VARIABLE SELECTION FOR FUZZY REGRESSION - THE POSSIBILITY APPROACH
    Gladysz, Barbara
    Kuchta, Dorota
    OPERATIONS RESEARCH AND DECISIONS, 2011, 21 (02) : 5 - 15
  • [3] A Forward Variable Selection Method for Fuzzy Logistic Regression
    Fatemeh Salmani
    Seyed Mahmoud Taheri
    Alireza Abadi
    International Journal of Fuzzy Systems, 2019, 21 : 1259 - 1269
  • [4] A Forward Variable Selection Method for Fuzzy Logistic Regression
    Salmani, Fatemeh
    Taheri, Seyed Mahmoud
    Abadi, Alireza
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (04) : 1259 - 1269
  • [5] A SIMULATION BASED APPROACH TO CALCULATE THE FUZZY CORRELATION COEFFICIENT OF FUZZY OBSERVATIONS
    Aladag, Cagdas Hakan
    Egrioglu, Erol
    Yolcu, Ufuk
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2012, 41 (03): : 361 - 364
  • [6] A novel Pythagorean fuzzy correlation coefficient based on Spearman's technique of correlation coefficient with applications in supplier selection process
    Ejegwa, Paul Augustine
    Kausar, Nasreen
    Aydin, Nezir
    Deveci, Muhammet
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2025, 44
  • [7] A novel failure mode and effect analysis method with spherical fuzzy entropy and spherical fuzzy weight correlation coefficient
    Ma, Qian-Xia
    Zhu, Xiao-Min
    Bai, Kai-Yuan
    Zhang, Run-Tong
    Liu, Dong-Wei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122
  • [8] A fuzzy penalized regression model with variable selection
    Kashani, M.
    Arashi, M.
    Rabiei, M. R.
    D'Urso, P.
    De Giovanni, L.
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 175
  • [9] Bicriteria variable selection in a fuzzy regression equation
    Wang, HF
    Tsaur, RC
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2000, 40 (6-7) : 877 - 883
  • [10] Fuzzy Distance-Based Approach for the Assessment and Selection of Programming Languages: Fuzzy-Based Hybrid Approach for Selection of PL
    Garg, Rakesh
    Raheja, Supriya
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2023, 15 (01)