Effectively using unsupervised machine learning in next generation astronomical surveys

被引:10
|
作者
Reis, I. [1 ]
Rotman, M. [2 ]
Poznanski, D. [1 ]
Prochaska, J. X. [3 ]
Wolf, L. [2 ,4 ]
机构
[1] Tel Aviv Univ, Sch Phys & Astron, IL-69978 Tel Aviv, Israel
[2] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[3] Univ Calif Santa Cruz, CO Lick Observ, 156 High St, Santa Cruz, CA 95064 USA
[4] Facebook AI Res, Tel Aviv, Israel
关键词
D O I
10.1016/j.ascom.2020.100437
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
In recent years many works have shown that unsupervised Machine Learning (ML) can help detect unusual objects and uncover trends in large astronomical datasets, but a few challenges remain. We show here, for example, that different methods, or even small variations of the same method, can produce significantly different outcomes. While intuitively somewhat surprising, this can naturally occur when applying unsupervised ML to highly dimensional data, where there can be many reasonable yet different answers to the same question. In such a case the outcome of any single unsupervised ML method should be considered a sample from a conceivably wide range of possibilities. We therefore suggest an approach that eschews finding an optimal outcome, instead facilitating the production and examination of many valid ones. This can be achieved by incorporating unsupervised ML into data visualization portals. We present here such a portal that we are developing, applied to the sample of SDSS spectra of galaxies. The main feature of the portal is interactive 2D maps of the data. Different maps are constructed by applying dimensionality reduction to different subspaces of the data, so that each map contains different information that in turn gives a different perspective on the data. The interactive maps are intuitive to use, and we demonstrate how peculiar objects and trends can be detected by means of a few button clicks. We believe that including tools in this spirit in next generation astronomical surveys will be important for making unexpected discoveries, either by professional astronomers or by citizen scientists, and will generally enable the benefits of visual inspection even when dealing with very complex and extensive datasets. Our portal is available online at galaxyportal.space. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Bot detection using unsupervised machine learning
    Wu, Wei
    Alvarez, Jaime
    Liu, Chengcheng
    Sun, Hung-Min
    MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2018, 24 (01): : 209 - 217
  • [22] GUASOM: an adaptive visualization tool for unsupervised clustering in spectrophotometric astronomical surveys
    M. A. Álvarez
    C. Dafonte
    M. Manteiga
    D. Garabato
    R. Santoveña
    Neural Computing and Applications, 2022, 34 : 1993 - 2006
  • [23] The next generation of machine learning for tracking adaptation texts
    Sietsma, Anne J.
    Ford, James D.
    Minx, Jan C.
    NATURE CLIMATE CHANGE, 2023, 14 (1) : 31 - 39
  • [24] The next generation of machine learning for tracking adaptation texts
    Sietsma, Anne J.
    Ford, James D.
    Minx, Jan C.
    NATURE CLIMATE CHANGE, 2024, 14 (01) : 31 - 39
  • [25] The next generation of machine learning for tracking adaptation texts
    Anne J. Sietsma
    James D. Ford
    Jan C. Minx
    Nature Climate Change, 2024, 14 : 31 - 39
  • [26] Next-Generation Machine Learning for Biological Networks
    Camacho, Diogo M.
    Collins, Katherine M.
    Powers, Rani K.
    Costello, James C.
    Collins, James J.
    CELL, 2018, 173 (07) : 1581 - 1592
  • [27] Machine learning to boost the next generation of visualization technology
    Ma, Kwan-Liu
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2007, 27 (05) : 6 - 9
  • [28] GUASOM: an adaptive visualization tool for unsupervised clustering in spectrophotometric astronomical surveys
    Alvarez, M. A.
    Dafonte, C.
    Manteiga, M.
    Garabato, D.
    Santovena, R.
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (03): : 1993 - 2006
  • [29] Unsupervised Feature Learning Classification Using An Extreme Learning Machine
    Lam, Dao
    Wunsch, Donald
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [30] Breast cancer prediction using an optimal machine learning technique for next generation sequences
    Kurian, Babymol
    Jyothi, V. L.
    CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2021, 29 (01): : 49 - 57