Predicting the maximum absorption wavelength of azo dyes using an interpretable machine learning strategy

被引:25
|
作者
Mai, Jiaqi [1 ]
Lu, Tian [2 ]
Xu, Pengcheng [2 ]
Lian, Zhengheng [2 ]
Li, Minjie [1 ]
Lu, Wencong [1 ,2 ,3 ]
机构
[1] Shanghai Univ, Coll Sci, Dept Chem, Shanghai 200444, Peoples R China
[2] Shanghai Univ, Mat Genome Inst, Shanghai 200444, Peoples R China
[3] Zhejiang Lab, Hangzhou 311100, Peoples R China
关键词
Azo dyes; Machine learning; Maximum absorption wavelength; SHAP; HOLOGRAPHIC DISPLAY; ORGANIC-MOLECULES; TD-DFT; DEFINITION; SPECTRA;
D O I
10.1016/j.dyepig.2022.110647
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
The maximum absorption wavelength (lambda(max)) is one of the most important properties of azo dyes. It is essential to obtain lambda(max) of azo dyes for the development of new molecules in a short time. Herein, the machine learning algorithm "XGBoost " was used to establish a model for predicting lambda(max )of azo dyes. It was found that the coef-ficient of determinations (R-2) of leave-one-out cross-validation (LOOCV) and test set were 0.87, 0.73, respec-tively. According to SHapley Additive exPlanations (SHAP) analysis, the number of sulfur atoms of R-2 group has a strong positive correlation with lambda(max). The more C-N pairs of topological distance 4 appear in R1 group, the more likely the molecular lambda(max )is red-shifted. Further, the high-throughput screening strategy was adopted to screen out 26 azo molecules with larger lambda(max )from nearly 20,000 virtual samples. These molecular lambda(max )are expected to be red shifted from the 610 nm in the dataset. Our study provides a convenient way to search for azo dyes with larger lambda(max).
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Interpretable Machine Learning Approach to Predicting Electric Vehicle Buying Decisions
    Naseri, Hamed
    Waygood, E. O. D.
    Wang, Bobin
    Patterson, Zachary
    TRANSPORTATION RESEARCH RECORD, 2023, 2677 (12) : 704 - 717
  • [42] StratoMod: predicting sequencing and variant calling errors with interpretable machine learning
    Dwarshuis, Nathan
    Tonner, Peter
    Olson, Nathan D.
    Sedlazeck, Fritz J.
    Wagner, Justin
    Zook, Justin M.
    COMMUNICATIONS BIOLOGY, 2024, 7 (01)
  • [43] Interpretable machine learning for predicting sepsis risk in emergency triage patients
    Liu, Zheng
    Shu, Wenqi
    Li, Teng
    Zhang, Xuan
    Chong, Wei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [44] An Interpretable Machine Learning Approach for Predicting Hospital Length of Stay and Readmission
    Liu, Yuxi
    Qin, Shaowen
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT I, 2022, 13087 : 73 - 85
  • [45] Predicting Hospital No-Shows: Interpretable Machine Learning Models Approach
    Toffaha, Khaled M.
    Simsekler, Mecit Can Emre
    Alshehhi, Aamna
    Omar, Mohammed Atif
    IEEE ACCESS, 2024, 12 : 166058 - 166067
  • [46] Interpretable machine learning scheme for predicting bridge pier scour depth
    Kim, Taeyoon
    Shahriar, Azmayeen R.
    Lee, Woo-Dong
    Gabr, Mohammed A.
    COMPUTERS AND GEOTECHNICS, 2024, 170
  • [47] Interpretable machine learning for predicting evaporation from Awash reservoirs, Ethiopia
    Kidist Demessie Eshetu
    Tena Alamirew
    Tekalegn Ayele Woldesenbet
    Earth Science Informatics, 2023, 16 (4) : 3209 - 3226
  • [48] Interpretable machine-learning models for predicting creep recovery of concrete
    Mei, Shengqi
    Liu, Xiaodong
    Wang, Xingju
    Li, Xufeng
    STRUCTURAL CONCRETE, 2024,
  • [49] Predicting and interpreting digital platform survival: An interpretable machine learning approach
    Zhu, Xinyu
    Zhang, Qiang
    Ma, Baojun
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2024, 67
  • [50] Interpretable Machine Learning Model for Predicting Postpartum Depression: Retrospective Study
    Zhang, Ren
    Liu, Yi
    Zhang, Zhiwei
    Luo, Rui
    Lv, Bin
    JMIR MEDICAL INFORMATICS, 2025, 13