Quality Assessment of Data Using Statistical and Machine Learning Methods

被引:8
|
作者
Singh, Prerna [1 ]
Suri, Bharti [2 ]
机构
[1] Jagan Inst Management Studies, New Delhi, India
[2] USICT, New Delhi, India
关键词
Conceptual model; Data warehouse quality; Multidimensional data model; Statistical; Understand ability; CONCEPTUAL MODELS; METRICS;
D O I
10.1007/978-81-322-2208-8_10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data warehouses are used in organization for efficiently managing the information. The data from various heterogeneous data sources are integrated in data warehouse in order to do analysis and make decision. Data warehouse quality is very important as it is the main tool for strategic decision. Data warehouse quality is influenced by Data model quality which is further influenced by conceptual data model. In this paper, we first summarize the set of metrics for measuring the understand ability of conceptual data model for data warehouses. The statistical and machine learning methods are used to predict effect of structural metrics, on understand ability, efficiency and effectiveness of Data warehouse Multidimensional (MD) conceptual model.
引用
收藏
页码:89 / 97
页数:9
相关论文
共 50 条
  • [1] Exploring the Quality of Dynamic Open Government Data Using Statistical and Machine Learning Methods
    Karamanou, Areti
    Brimos, Petros
    Kalampokis, Evangelos
    Tarabanis, Konstantinos
    SENSORS, 2022, 22 (24)
  • [2] Empirical Validation of Website Quality Using Statistical and Machine Learning Methods
    Dhiman, Poonam
    Anjali
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 286 - 291
  • [3] OpenStreetMap quality assessment using unsupervised machine learning methods
    Jacobs, Kent T.
    Mitchell, Scott W.
    TRANSACTIONS IN GIS, 2020, 24 (05) : 1280 - 1298
  • [4] Fault Prediction Using Statistical and Machine Learning Methods for Improving Software Quality
    Malhotra, Ruchika
    Jain, Ankita
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2012, 8 (02): : 241 - 262
  • [5] Prediction of Air Quality and Pollution using Statistical Methods and Machine Learning Techniques
    Devasekhar, V.
    Natarajan, P.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 927 - 937
  • [6] Assessment of the Quality and Mechanical Parameters of Castings Using Machine Learning Methods
    Jaskowiec, Krzysztof
    Wilk-Kolodziejczyk, Dorota
    Bartlomiej, Sniezynski
    Reczek, Witor
    Bitka, Adam
    Malysza, Marcin
    Doroszewski, Maciej
    Pirowski, Zenon
    Boron, Lukasz
    MATERIALS, 2022, 15 (08)
  • [7] Big Data Analysis Using Modern Statistical and Machine Learning Methods in Medicine
    Yoo, Changwon
    Ramirez, Luis
    Liuzzi, Juan
    INTERNATIONAL NEUROUROLOGY JOURNAL, 2014, 18 (02) : 50 - 57
  • [8] Data science and machine learning: Mathematical and statistical methods
    Lai, Yin-Ju
    Hsiao, Chuhsing Kate
    Botev, Zdravko
    BIOMETRICS, 2021, 77 (04) : 1503 - 1504
  • [9] Comparison of statistical and machine learning methods in modelling of data with multicollinearity
    Garg, Akhil
    Tai, Kang
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2013, 18 (04) : 295 - 312
  • [10] Groundwater Quality Assessment Using machine learning
    Mullasseri, Sileesh
    Mishra, Ravi
    Singh, Archana
    Chandra, G. Sharath
    Jhariya, D. C.
    Mishra, Shwetakshi
    Jadav, Ravindra
    Hans, Aradhana L.
    Buch, Khuban
    CURRENT SCIENCE, 2021, 121 (05): : 606 - 607