Data augmentation in microscopic images for material data mining

被引:0
|
作者
Boyuan Ma
Xiaoyan Wei
Chuni Liu
Xiaojuan Ban
Haiyou Huang
Hao Wang
Weihua Xue
Stephen Wu
Mingfei Gao
Qing Shen
Michele Mukeshimana
Adnan Omer Abuassba
Haokai Shen
Yanjing Su
机构
[1] University of Science and Technology Beijing,Beijing Advanced Innovation Center for Materials Genome Engineering
[2] University of Science and Technology Beijing,School of Computer and Communication Engineering
[3] Beijing Key Laboratory of Knowledge Engineering for Materials Science,Institute for Advanced Materials and Technology
[4] University of Science and Technology Beijing,School of Materials Science and Engineering
[5] University of Science and Technology Beijing,School of Materials Science and Technology
[6] Liaoning Technical University,The Institute of Statistical Mathematics
[7] Research Organization of Information and Systems,Faculty of Engineering Sciences
[8] Tachikawa,College of Information Science and Engineering
[9] National Intellectual Property Administration,Key Lab of Petroleum Data Mining
[10] University of Burundi,undefined
[11] Faculty of Engineering and Technology,undefined
[12] Palestine Technical University – Kadoorie,undefined
[13] China University of Petroleum,undefined
[14] China University of Petroleum,undefined
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Recent progress in material data mining has been driven by high-capacity models trained on large datasets. However, collecting experimental data (real data) has been extremely costly owing to the amount of human effort and expertise required. Here, we develop a novel transfer learning strategy to address problems of small or insufficient data. This strategy realizes the fusion of real and simulated data and the augmentation of training data in a data mining procedure. For a specific task of grain instance image segmentation, this strategy aims to generate synthetic data by fusing the images obtained from simulating the physical mechanism of grain formation and the “image style” information in real images. The results show that the model trained with the acquired synthetic data and only 35% of the real data can already achieve competitive segmentation performance of a model trained on all of the real data. Because the time required to perform grain simulation and to generate synthetic data are almost negligible as compared to the effort for obtaining real data, our proposed strategy is able to exploit the strong prediction power of deep learning without significantly increasing the experimental burden of training data preparation.
引用
收藏
相关论文
共 50 条
  • [1] Data augmentation in microscopic images for material data mining (vol 6, 125, 2020)
    Ma, Boyuan
    Wei, Xiaoyan
    Liu, Chuni
    Ban, Xiaojuan
    Huang, Haiyou
    Wang, Hao
    Xue, Weihua
    Wu, Stephen
    Gao, Mingfei
    Shen, Qing
    Mukeshimana, Michele
    Abuassba, Adnan Omer
    Shen, Haokai
    Su, Yanjing
    NPJ COMPUTATIONAL MATERIALS, 2020, 6 (01)
  • [2] Mining images of material nanostructure data
    Varde, Aparna
    Liang, Jianyu
    Rundensteiner, Elke
    Sisson, Richard, Jr.
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, PROCEEDINGS, 2006, 4317 : 403 - +
  • [3] Data augmentation in material images using the improved HP-VAE-GAN
    Han, Yuexing
    Liu, Yuhong
    Chen, Qiaochuan
    COMPUTATIONAL MATERIALS SCIENCE, 2023, 226
  • [4] On data augmentation for segmenting hyperspectral images
    Nalepa, Jakub
    Myller, Michal
    Kawulok, Michal
    Smolka, Bogdan
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2019, 2019, 10996
  • [5] Data augmentation and data mining towards microstructure and property relationship for composites
    Guo, Ziyan
    Liu, Xuhao
    Pan, Zehua
    Zhou, Yexin
    Zhong, Zheng
    Yan, Zilin
    ENGINEERING COMPUTATIONS, 2023, 40 (7/8) : 1617 - 1632
  • [6] Data Augmentation for Images of Chronic Foot Wounds
    Gutbrod, Max
    Geisler, Benedikt
    Rauber, David
    Palm, Christoph
    BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 261 - 266
  • [7] Text and data mining for material synthesis
    Olivetti, Elsa
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [8] Imputing manufacturing material in data mining
    Ruey-Ling Yeh
    Ching Liu
    Ben-Chang Shia
    Yu-Ting Cheng
    Ya-Fang Huwang
    Journal of Intelligent Manufacturing, 2008, 19 : 109 - 118
  • [9] Imputing manufacturing material in data mining
    Yeh, Ruey-Ling
    Liu, Ching
    Shia, Ben-Chang
    Cheng, Yu-Ting
    Huwang, Ya-Fang
    JOURNAL OF INTELLIGENT MANUFACTURING, 2008, 19 (01) : 109 - 118
  • [10] Data augmentation on mice liver cirrhosis microscopic images employing convolutional neural networks and support vector machine
    Zheng, Longfei
    Wang, Yu
    Hemanth, D. Jude
    Sangiah, Arun Kumar
    Shi, Fuqian
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 10 (10) : 4023 - 4032