On application of constitutional descriptors for merging of quinoxaline data sets, using linear statistical methods

被引:4
|
作者
Ghosh, Payell [1 ]
Vracko, Marjan [2 ]
Chattopadhyay, Asis Kumar [3 ]
Bagchi, Manish C. [1 ]
机构
[1] Indian Inst Chem Biol, Struct Biol & Bioinformat Div, Kolkata 700032, India
[2] Natl Inst Chem, Lab Chemometr, Ljubljana 1000, Slovenia
[3] Univ Calcutta, Univ Coll Sci, Dept Stat, Kolkata 700019, India
关键词
principal component analysis; partial least squares; quantitative structure-activity relationship; quinoxaline compounds; theoretical molecular descriptors;
D O I
10.1111/j.1747-0285.2008.00686.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The present paper is an attempt for unifying two different quinoxaline data sets with a wide range of substituents in 2, 3, 7, and 8 positions having excellent antitubercular activities with a view to developing robust and reliable structure-activity relationships. The merging has been performed for these two sets of quinoxaline 1,4-di-N-oxides derivatives comprising 29 and 18 compounds, respectively, on the basis of constitutional descriptors, which denotes the structural characterization of the molecules. Principal component analysis was performed to see the distribution of the compounds from two data sets for the constitutional descriptors. The distribution of compounds in score plot based on constitutional descriptors suggests unification of quinoxaline data sets which is useful for the model development. Outlier detection was performed from the standpoint of residual analysis of the partial least squares regression models. The superiority of the constitutional descriptors over other calculated molecular descriptors has been established from the standpoint of leave-one-out cross-validation technique associated with partial least squares regression analysis. Internal validation through the leave-many-out methodology was also performed with good results, assuring the stability of the models. The results obtained from linear partial least squares regression analysis lead to a statistically significant and robust quantitative structure-activity relationship modeling.
引用
收藏
页码:155 / 162
页数:8
相关论文
共 50 条
  • [1] Methods for merging data sets in electron cryo-microscopy
    Wilkinson, Max E.
    Kumar, Ananthanarayanan
    Casanal, Ana
    ACTA CRYSTALLOGRAPHICA SECTION D-STRUCTURAL BIOLOGY, 2019, 75 : 782 - 791
  • [2] Statistical Methods for Generating Synthetic Email Data Sets
    Babalola, Karolyn O.
    Jennings, Otis B.
    Urdiales, Esteban
    DeBardelaben, James A.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3986 - 3990
  • [3] A review of statistical methods for comparing two data sets
    Duffy, A.
    Orlandi, A.
    APPLIED COMPUTATIONAL ELECTROMAGNETICS SOCIETY JOURNAL, 2008, 23 (01): : 90 - 97
  • [4] Statistical Methods for Comparison of Data Sets of Construction Methods and Building Evaluation
    Niroumand, Hamed
    Zain, M. F. M.
    Jamil, Maslina
    2ND CYPRUS INTERNATIONAL CONFERENCE ON EDUCATIONAL RESEARCH (CY-ICER 2013), 2013, 89 : 218 - 221
  • [5] PRECIPITATION DATA MERGING USING GENERAL LINEAR REGRESSION
    Turlapaty, Anish C.
    Younan, Nicolas H.
    Anantharaj, Valentine
    2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 1561 - +
  • [6] Statistical properties of large data sets with linear latent features
    Fleig, Philipp
    Nemenman, Ilya
    PHYSICAL REVIEW E, 2022, 106 (01)
  • [7] APPLICATION OF PREDICTIVE METHODS TO FINANCIAL DATA SETS
    Habibi, Reza
    FINANCIAL INTERNET QUARTERLY, 2021, 17 (01) : 50 - 61
  • [8] Construction of Confidence Absorbing Sets Using Statistical Methods
    A.I. Kibzun
    S.V. Ivanov
    Automation and Remote Control, 2020, 81 : 2206 - 2219
  • [9] Construction of Confidence Absorbing Sets Using Statistical Methods
    Kibzun, A. I.
    Ivanov, S. V.
    AUTOMATION AND REMOTE CONTROL, 2020, 81 (12) : 2206 - 2219
  • [10] StatTeacherAssistant: An Application for Creating, Adjusting, and Checking the Suitability of Data Sets for Courses that Incorporate Introductory Statistical Methods
    Casement, Christopher J.
    McSweeney, Laura A.
    JOURNAL OF STATISTICS AND DATA SCIENCE EDUCATION, 2024, 32 (01): : 36 - 46