Specifics of Data Collection and Data Processing during Formation of RailVista Dataset for Machine Learning- and Deep Learning-Based Applications

被引:0
|
作者
Abisheva, Gulsipat [1 ]
Goranin, Nikolaj [2 ]
Razakhova, Bibigul [1 ]
Aidynov, Tolegen [3 ]
Satybaldina, Dina [3 ]
机构
[1] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Artificial Intelligence Technol, KZ-010000 Astana, Kazakhstan
[2] Vilnius Gediminas Tech Univ, Fac Fundamental Sci, Dept Informat Syst, LT-08412 Vilnius, Lithuania
[3] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Informat Secur, KZ-010000 Astana, Kazakhstan
关键词
dataset; data collection; machine learning; railway; railway track defects; DEFECT DETECTION; RAILWAY;
D O I
10.3390/s24165239
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents the methodology and outcomes of creating the Rail Vista dataset, designed for detecting defects on railway tracks using machine and deep learning techniques. The dataset comprises 200,000 high-resolution images categorized into 19 distinct classes covering various railway infrastructure defects. The data collection involved a meticulous process including complex image capture methods, distortion techniques for data enrichment, and secure storage in a data warehouse using efficient binary file formats. This structured dataset facilitates effective training of machine/deep learning models, enhancing automated defect detection systems in railway safety and maintenance applications. The study underscores the critical role of high-quality datasets in advancing machine learning applications within the railway domain, highlighting future prospects for improving safety and reliability through automated recognition technologies.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] A Survey on Machine Learning-Based Mobile Big Data Analysis: Challenges and Applications
    Xie, Jiyang
    Song, Zeyu
    Li, Yupeng
    Zhang, Yanting
    Yu, Hong
    Zhan, Jinnan
    Ma, Zhanyu
    Qiao, Yuanyuan
    Zhang, Jianhua
    Guo, Jun
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [22] Learning nodes: machine learning-based energy and data management strategy
    Yunmin Kim
    Tae-Jin Lee
    EURASIP Journal on Wireless Communications and Networking, 2021
  • [23] Learning nodes: machine learning-based energy and data management strategy
    Kim, Yunmin
    Lee, Tae-Jin
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2021, 2021 (01)
  • [24] Deep learning-based gesture recognition for surgical applications: A data augmentation approach
    Santiago, Sofia Sorbet
    Cifuentes, Jenny Alexandra
    EXPERT SYSTEMS, 2024, 41 (12)
  • [25] Optimizing Data Collection for Machine Learning
    Mahmood, Rafid
    Lucas, James
    Alvarez, Jose M.
    Fidler, Sanja
    Law, Marc T.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [26] Machine Learning-Based Sentiment Analysis of Twitter Data
    Karthiga, M.
    Kumar, Sathish G.
    Aravindhraj, N.
    Priyanka, S.
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [27] Omics Data and Data Representations for Deep Learning-Based Predictive Modeling
    Tsimenidis, Stefanos
    Vrochidou, Eleni
    Papakostas, George A.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (20)
  • [28] Deep Learning-Based DAS to Geophone Data Transformation
    Fu, Lei
    Li, Weichang
    Ma, Yong
    IEEE SENSORS JOURNAL, 2023, 23 (12) : 12853 - 12860
  • [29] A Comprehensive Review on Deep Learning-Based Data Fusion
    Hussain, Mazhar
    O'Nils, Mattias
    Lundgren, Jan
    Mousavirad, Seyed Jalaleddin
    IEEE ACCESS, 2024, 12 : 180093 - 180124
  • [30] Deep learning-based denoising for PennPET Explorer data
    Wu, Jing
    Daube-Witherspoon, Margaret
    Liu, Hui
    Lu, Wenzhuo
    Onofrey, John
    Karp, Joel
    Liu, Chi
    JOURNAL OF NUCLEAR MEDICINE, 2019, 60