A study of real-world micrograph data quality and machine learning model robustness

被引:10
|
作者
Zhong, Xiaoting [1 ]
Gallagher, Brian [2 ]
Eves, Keenan [3 ]
Robertson, Emily [1 ]
Mundhenk, T. Nathan [4 ]
Han, T. Yong-Jin [1 ]
机构
[1] Lawrence Livermore Natl Lab, Mat Sci Div, Livermore, CA 94550 USA
[2] Lawrence Livermore Natl Lab, Ctr Appl Sci Comp, Livermore, CA 94550 USA
[3] Lawrence Livermore Natl Lab, Def Technol Engn Div, Livermore, CA 94550 USA
[4] Lawrence Livermore Natl Lab, Computat Engn Div, Livermore, CA 94550 USA
关键词
ADAPTIVE HISTOGRAM EQUALIZATION; COMPUTER VISION; IMAGE; PREDICTION; TIME;
D O I
10.1038/s41524-021-00616-3
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Machine-learning (ML) techniques hold the potential of enabling efficient quantitative micrograph analysis, but the robustness of ML models with respect to real-world micrograph quality variations has not been carefully evaluated. We collected thousands of scanning electron microscopy (SEM) micrographs for molecular solid materials, in which image pixel intensities vary due to both the microstructure content and microscope instrument conditions. We then built ML models to predict the ultimate compressive strength (UCS) of consolidated molecular solids, by encoding micrographs with different image feature descriptors and training a random forest regressor, and by training an end-to-end deep-learning (DL) model. Results show that instrument-induced pixel intensity signals can affect ML model predictions in a consistently negative way. As a remedy, we explored intensity normalization techniques. It is seen that intensity normalization helps to improve micrograph data quality and ML model robustness, but microscope-induced intensity variations can be difficult to eliminate.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] A study of real-world micrograph data quality and machine learning model robustness
    Xiaoting Zhong
    Brian Gallagher
    Keenan Eves
    Emily Robertson
    T. Nathan Mundhenk
    T. Yong-Jin Han
    npj Computational Materials, 7
  • [2] Predicting the Robustness of Large Real-World Social Networks Using a Machine Learning Model
    Nguyen, Ngoc-Kim-Khanh
    Nguyen, Quang
    Pham, Hai-Ha
    Le, Thi-Trang
    Nguyen, Tuan-Minh
    Cassi, Davide
    Scotognella, Francesco
    Alfieri, Roberto
    Bellingeri, Michele
    Complexity, 2022, 2022
  • [3] Predicting the Robustness of Large Real-World Social Networks Using a Machine Learning Model
    Nguyen, Ngoc-Kim-Khanh
    Nguyen, Quang
    Pham, Hai-Ha
    Le, Thi-Trang
    Nguyen, Tuan-Minh
    Cassi, Davide
    Scotognella, Francesco
    Alfieri, Roberto
    Bellingeri, Michele
    COMPLEXITY, 2022, 2022
  • [4] Predicting the Robustness of Large Real-World Social Networks Using a Machine Learning Model
    Nguyen, Ngoc-Kim-Khanh
    Nguyen, Quang
    Pham, Hai-Ha
    Le, Thi-Trang
    Nguyen, Tuan-Minh
    Cassi, Davide
    Scotognella, Francesco
    Alfieri, Roberto
    Bellingeri, Michele
    COMPLEXITY, 2022, 2022
  • [5] Meaningful Machine Learning Robustness Evaluation in Real-World Machine Learning Enabled System Contexts
    Hiett, Ben
    Boyd, Peter
    Fletcher, Charles
    Gowland, Sam
    Sharp, James H.
    Sloggett, David
    Banks, Alec
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS IV, 2022, 12276
  • [6] A Comprehensive Study of Real-World Bugs in Machine Learning Model Optimization
    Guan, Hao
    Xiao, Ying
    Li, Jiaying
    Liu, Yepang
    Bai, Guangdong
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 147 - 158
  • [7] Towards Machine Learning with Zero Real-World Data
    Kang, Cholmin
    Jung, Hyunwoo
    Lee, Youngki
    WEARSYS'19: PROCEEDINGS OF THE 5TH ACM WORKSHOP ON WEARABLE SYSTEMS AND APPLICATIONS, 2019, : 41 - 46
  • [8] Machine learning approaches for anomaly detection of water quality on a real-world data set*
    Muharemi, Fitore
    Logofatu, Doina
    Leon, Florin
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2019, 3 (03) : 294 - 307
  • [9] Real-World Data and Machine Learning to Predict Cardiac Amyloidosis
    Garcia-Garcia, Elena
    Maria Gonzalez-Romero, Gracia
    Martin-Perez, Encarna M.
    Zapata Cornejo, Enrique de Dios
    Escobar-Aguilar, Gema
    Cardenas Bonnet, Marlon Felix
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (03) : 1 - 15
  • [10] Real-World Evidence: Integrating Machine Learning with Real-World Big Data for Predictive Analytics in Healthcare
    Vecchio, Nicolas
    CARDIOLOGY, 2024,