On application of constitutional descriptors for merging of quinoxaline data sets, using linear statistical methods

被引:4
|
作者
Ghosh, Payell [1 ]
Vracko, Marjan [2 ]
Chattopadhyay, Asis Kumar [3 ]
Bagchi, Manish C. [1 ]
机构
[1] Indian Inst Chem Biol, Struct Biol & Bioinformat Div, Kolkata 700032, India
[2] Natl Inst Chem, Lab Chemometr, Ljubljana 1000, Slovenia
[3] Univ Calcutta, Univ Coll Sci, Dept Stat, Kolkata 700019, India
关键词
principal component analysis; partial least squares; quantitative structure-activity relationship; quinoxaline compounds; theoretical molecular descriptors;
D O I
10.1111/j.1747-0285.2008.00686.x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The present paper is an attempt for unifying two different quinoxaline data sets with a wide range of substituents in 2, 3, 7, and 8 positions having excellent antitubercular activities with a view to developing robust and reliable structure-activity relationships. The merging has been performed for these two sets of quinoxaline 1,4-di-N-oxides derivatives comprising 29 and 18 compounds, respectively, on the basis of constitutional descriptors, which denotes the structural characterization of the molecules. Principal component analysis was performed to see the distribution of the compounds from two data sets for the constitutional descriptors. The distribution of compounds in score plot based on constitutional descriptors suggests unification of quinoxaline data sets which is useful for the model development. Outlier detection was performed from the standpoint of residual analysis of the partial least squares regression models. The superiority of the constitutional descriptors over other calculated molecular descriptors has been established from the standpoint of leave-one-out cross-validation technique associated with partial least squares regression analysis. Internal validation through the leave-many-out methodology was also performed with good results, assuring the stability of the models. The results obtained from linear partial least squares regression analysis lead to a statistically significant and robust quantitative structure-activity relationship modeling.
引用
收藏
页码:155 / 162
页数:8
相关论文
共 50 条
  • [41] Consistent merging of satellite ocean color data sets using a bio-optical model
    Maritorena, S
    Siegel, DA
    REMOTE SENSING OF ENVIRONMENT, 2005, 94 (04) : 429 - 440
  • [42] Statistical and Non-linear Dynamics Methods of Earthquake Forecast: Application in the Caucasus
    Chelidze, Tamaz
    Melikadze, Giorgi
    Kiria, Tengiz
    Jimsheladze, Tamar
    Kobzev, Gennady
    FRONTIERS IN EARTH SCIENCE, 2020, 8
  • [43] Training support vector data descriptors using converging linear particle swarm optimization
    Hongbo Wang
    Guangzhou Zhao
    Nan Li
    Neural Computing and Applications, 2012, 21 : 1099 - 1105
  • [44] Methods of Estimation in Multiple Linear Regression: Application to Clinical Data
    Coelho-Barros, Emilio Augusto
    Simoes, Priscila Angelotti
    Achcar, Jorge Alberito
    Martinez, Edson Zangiacomi
    Shimano, Antonio Carlos
    REVISTA COLOMBIANA DE ESTADISTICA, 2008, 31 (01): : 111 - 129
  • [45] Application of linear and nonlinear equalization methods for holographic data storage
    Nabavi, S
    Kumar, BVKV
    JAPANESE JOURNAL OF APPLIED PHYSICS PART 1-REGULAR PAPERS BRIEF COMMUNICATIONS & REVIEW PAPERS, 2006, 45 (2B): : 1079 - 1083
  • [46] Application of linear and nonlinear equalization methods for holographic data storage
    Nabavi, Sheida
    Kumar, B. V. K. Vijaya
    Japanese Journal of Applied Physics, Part 1: Regular Papers and Short Notes and Review Papers, 1600, 45 (2 B): : 1079 - 1083
  • [47] Demonstration of a software design and statistical analysis methodology with application to patient outcomes data sets
    Mayo, Charles
    Conners, Steve
    Warren, Christopher
    Miller, Robert
    Court, Laurence
    Popple, Richard
    MEDICAL PHYSICS, 2013, 40 (11)
  • [48] Training support vector data descriptors using converging linear particle swarm optimization
    Wang, Hongbo
    Zhao, Guangzhou
    Li, Nan
    NEURAL COMPUTING & APPLICATIONS, 2012, 21 (06): : 1099 - 1105
  • [49] A nonparametric algorithm for automatic classification of large multivariate statistical data sets and its application
    Zenkov, I., V
    Lapko, A., V
    Lapko, V. A.
    Im, S. T.
    Tuboltsev, V. P.
    Avdeenok, V. L.
    COMPUTER OPTICS, 2021, 45 (02) : 253 - +
  • [50] Training Support Vector Data Descriptors Using Converging Linear Particle Swarm Optimization
    Wang, Hongbo
    Zhao, Guangzhou
    Li, Nan
    LIFE SYSTEM MODELING AND INTELLIGENT COMPUTING, PT I, 2010, 6328 : 196 - 204