The prediction of crystal densities of a big data set using 1D and 2D structure features

被引:0
|
作者
Li, Xianlan [1 ]
Kong, Dingling [1 ]
Luan, Yue [1 ]
Guo, Lili [1 ]
Lu, Yanhua [2 ]
Li, Wei [2 ]
Tang, Meng [3 ]
Zhang, Qingyou [1 ]
Pang, Aimin [2 ]
机构
[1] Henan Univ, Henan Engn Res Ctr Ind Circulating Water Treatment, Henan Joint Int Res Lab Environm Pollut Control Ma, Kaifeng 475004, Peoples R China
[2] Hubei Inst Aerosp Chemotechnol, Sci & Technol Aerosp Chem Power Lab, Xiangyang 441003, Hubei, Peoples R China
[3] Harbin Inst Technol, Sch Phys, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Density; Quantitative structure-property relationships; Big data set; Partial least squares; Random forest; NITRATE ESTERS; IONIC LIQUIDS; QSPR; ENTHALPIES; VAPORIZATION; EXPLOSIVES; NITRAMINES; SURFACE; HEAT;
D O I
10.1007/s11224-024-02279-4
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A large data set of over 30 thousand organic compounds containing carbon, nitrogen, oxygen, fluorine, and hydrogen was collected, and the density of each compound was predicted by 1D descriptors derived from its molecular formula and 2D descriptors derived from its constitutional structural features. The 2D structural features are composed of Benson's groups, corrected groups, and 2D structural features of the whole molecular structures. All the descriptors were extracted by an in-house program in Java with a function to ensure that each atom (or bond) of molecules is represented by Benson's groups once for atom-based (or bond-based) descriptors. Partial least square (PLS) and random forest (RF) methods were used separately to build models to predict the density. Further, the variable selection of descriptors was performed by variable importance of RF. For partial least square, the combination of the models constructed by descriptors based on the atoms and the bonds achieved the best results in this paper: for the cross-validation of the training set, the Pearson correlation coefficient (R) = 0.9270, mean absolute error (MAE) = 0.0270 g center dot cm-3, and root mean squared error (RMSE) = 0.0426 g center dot cm-3; for the prediction of the test set, R = 0.9454, MAE = 0.0263 g center dot cm-3, and RMSE = 0.0375 g center dot cm-3.
引用
收藏
页码:1375 / 1385
页数:11
相关论文
共 50 条
  • [31] Pyridine Carboxylate Lanthanide Coordination Complexes with 1D and 2D Structure
    Zhang, Fang
    Huang, Fang
    Yao, Xu
    Jin, Ying
    Chen, Qifan
    Liu, Fei
    Li, Guangming
    JOURNAL OF INORGANIC AND ORGANOMETALLIC POLYMERS AND MATERIALS, 2015, 25 (05) : 1183 - 1188
  • [32] 1D to 2D transitional structure of plasmonic crystals: fabrication and characterization
    Kang, H. K.
    Lee, K. H.
    Wong, C. C.
    Romanato, F.
    APPLIED PHYSICS B-LASERS AND OPTICS, 2009, 97 (03): : 671 - 677
  • [33] Design of 2D filter banks based on 1D lattice structure
    Isogimi, K
    Horibe, T
    Ikehara, M
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2002, 85 (12): : 38 - 46
  • [34] Extended 1D defect induced magnetism in 2D MoS2 crystal
    Zhang, Kai
    Pan, Yu
    Wang, Lu
    Mei, Wai-Ning
    Wu, Xiaojun
    JOURNAL OF PHYSICS-CONDENSED MATTER, 2020, 32 (21)
  • [35] Single cycle THz pulses in 1D and 2D photonic crystal structures
    Peier, Peter
    Pilz, Soenke
    Merbold, Hannes
    Pashinin, Vladimir
    Kononenko, Taras
    Pimenov, Sergei
    Feurer, Thomas
    ULTRAFAST PHENOMENA XVI, 2009, 92 : 678 - +
  • [36] Synthesis and structure elucidation of five series of aminoflavones using 1D and 2D NMR spectroscopy
    Barros, Ana I. R. N. A.
    Silva, Artur M. S.
    MAGNETIC RESONANCE IN CHEMISTRY, 2006, 44 (12) : 1122 - 1127
  • [37] Structure elucidation of a novel group of dithiocarbamate derivatives using 1D and 2D NMR spectroscopy
    Li, Qin
    Yang, Chunhui
    Ge, Ze-Mei
    Li, Runtao
    Cui, Yuxin
    MAGNETIC RESONANCE IN CHEMISTRY, 2006, 44 (07) : 720 - 723
  • [38] Assembly and Crystal Structure of a 2D lead(II) Coordination Polymer with 1D Metal-Oxygen Chains
    Wen, Gui-Lin
    Ma, Lu-Fang
    Chen, Yong-Hong
    Liu, Dao-Fu
    SYNTHESIS AND REACTIVITY IN INORGANIC METAL-ORGANIC AND NANO-METAL CHEMISTRY, 2014, 44 (05) : 687 - 691
  • [39] Impact of substituent position on crystal structure and photoconductivity in 1D and 2D lead(ii) benzenethiolate coordination polymers
    Akiyoshi, Ryohei
    Saeki, Akinori
    Ogasawara, Kazuyoshi
    Tanaka, Daisuke
    JOURNAL OF MATERIALS CHEMISTRY C, 2024, 12 (06) : 1958 - 1964
  • [40] 1D/1D及1D/2D耦合水动力模型构建方法研究
    周倩倩
    苏炯恒
    梅胜
    许明华
    水资源与水工程学报, 2019, 30 (05) : 21 - 25