Visualizing High Dimensional Datasets Using Parallel Coordinates: Application to Gene Prioritization

被引:0
|
作者
Boogaerts, Thomas [1 ]
Tranchevent, Leon-Charles [1 ]
Pavlopoulos, Georgios A. [1 ]
Aerts, Jan [1 ]
Vandewalle, Joos [1 ]
机构
[1] Katholieke Univ Leuven, ESAT SCD SISTA IBBT, KU Leuven Future Hlth Dept, B-3001 Louvain, Belgium
关键词
data visualization; parallel coordinates; genetic algorithm; gene prioritization;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In this paper, we introduce a visualization tool for interactive and efficient exploration of high dimensional data using parallel coordinates. An algorithm is developed to find an optimal permutation of dimensions, which allows the data miner to immediately see the most important features or irregularities in the dataset. This is implemented as a genetic algorithm based on the travelling salesman problem using maximal correlation as fitness. Other features of the tool include selection operators to group the data such as selection by intersection or by angle, orthogonal and density plots complementing the parallel coordinates plot, manual arrangement of permutation order of the dimensions, possibility to show all plots necessary to see all dimensional relations and displaying a certain number of standard deviations for each dimension separately. The tool is applied to multiple gene prioritization cases in search of genes that are relevant to certain genetic disorders. The used datasets are obtained with the MerKator and Endeavour tools and include a Breast cancer, Cataract, Charcoth-Marie-Tooth and Cardiomyopathy dataset, as well as a dataset relating 29 diseases with 22206 genes. Our tool, manual and data can be downloaded from http://www.toomas.be/parcoord/.
引用
收藏
页码:52 / 57
页数:6
相关论文
共 50 条
  • [41] Visualizing high-dimensional structures by dimension ordering and filtering using subspace analysis
    Ferdosi, Bilkis J.
    Roerdink, Jos B. T. M.
    Eurovis: Eurographics/IEEE Symposium on Visualization, 2011, 30 (03): : 1121 - 1130
  • [42] A Novel Approach of Ensemble Methods Using the Stacked Generalization for High-dimensional Datasets
    Sharma, Suvita Rani
    Singh, Birmohan
    Kaur, Manpreet
    IETE JOURNAL OF RESEARCH, 2023, 69 (10) : 6802 - 6817
  • [43] Using Shannon's entropy to sample heterogeneous and high-dimensional atmospheric datasets
    Paul, M.
    Aires, F.
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2015, 141 (687) : 469 - 476
  • [44] Map-in-Parallel-Coordinates Plot (MPCP): Field Trial Studies of High-Dimensional Geographical Data Analysis
    Liu, Jia
    Wan, Gang
    Jia, Yutong
    Liu, Wei
    Xie, Zhuli
    Su, Zhijuan
    Li, Chu
    Peng, Siqing
    ELECTRONICS, 2023, 12 (09)
  • [45] Visualizing dependence in high-dimensional data: An application to S&P 500 constituent data
    Hofert, Marius
    Oldford, Wayne
    ECONOMETRICS AND STATISTICS, 2018, 8 : 161 - 183
  • [46] Gene Selection Using High Dimensional Gene Expression Data: An Appraisal
    Bhola, Abhishek
    Singh, Shailendra
    CURRENT BIOINFORMATICS, 2018, 13 (03) : 225 - 233
  • [47] GPU cards as a low cost solution for efficient and fast classification of high dimensional gene expression datasets
    Benso, A.
    Di Carlo, S.
    Politano, G.
    Savino, A.
    Scionti, A.
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2010, 12 (03): : 34 - 40
  • [48] Filter vs. Wrapper approach for optimum gene selection of high dimensional gene expression dataset: An analysis with cancer datasets
    Srivastava, Bhavna
    Jangid, Mahesh
    Srivastava, Rajeev
    2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND APPLICATIONS (ICHPCA), 2014,
  • [49] Learning a single-hidden layer feedforward neural network using a rank correlation-based strategy with application to high dimensional gene expression and proteomic spectra datasets in cancer detection
    Belciug, Smaranda
    Gorunescu, Florin
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 83 : 159 - 166
  • [50] Visualizing Three-Dimensional Hybrid Atomic Orbitals Using Winplot: An Application for Student Self Instruction
    Saputra, Andrian
    Canaval, Lorentz R.
    Sunyono
    Fadiawati, Noor
    Diawati, Chansyanah
    Setyorini, M.
    Kadaritna, Nina
    Kadaryanto, Budi
    JOURNAL OF CHEMICAL EDUCATION, 2015, 92 (09) : 1557 - 1558