Specifics of Data Collection and Data Processing during Formation of RailVista Dataset for Machine Learning- and Deep Learning-Based Applications

被引:0
|
作者
Abisheva, Gulsipat [1 ]
Goranin, Nikolaj [2 ]
Razakhova, Bibigul [1 ]
Aidynov, Tolegen [3 ]
Satybaldina, Dina [3 ]
机构
[1] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Artificial Intelligence Technol, KZ-010000 Astana, Kazakhstan
[2] Vilnius Gediminas Tech Univ, Fac Fundamental Sci, Dept Informat Syst, LT-08412 Vilnius, Lithuania
[3] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Informat Secur, KZ-010000 Astana, Kazakhstan
关键词
dataset; data collection; machine learning; railway; railway track defects; DEFECT DETECTION; RAILWAY;
D O I
10.3390/s24165239
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents the methodology and outcomes of creating the Rail Vista dataset, designed for detecting defects on railway tracks using machine and deep learning techniques. The dataset comprises 200,000 high-resolution images categorized into 19 distinct classes covering various railway infrastructure defects. The data collection involved a meticulous process including complex image capture methods, distortion techniques for data enrichment, and secure storage in a data warehouse using efficient binary file formats. This structured dataset facilitates effective training of machine/deep learning models, enhancing automated defect detection systems in railway safety and maintenance applications. The study underscores the critical role of high-quality datasets in advancing machine learning applications within the railway domain, highlighting future prospects for improving safety and reliability through automated recognition technologies.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Deep learning-based data analytics for safety in construction
    Liu, Jiajing
    Luo, Hanbin
    Liu, Henry
    AUTOMATION IN CONSTRUCTION, 2022, 140
  • [32] Deep learning-based enhancement of epigenomics data with AtacWorks
    Lal, Avantika
    Chiang, Zachary D.
    Yakovenko, Nikolai
    Duarte, Fabiana M.
    Israeli, Johnny
    Buenrostro, Jason D.
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [33] Deep learning-based enhancement of epigenomics data with AtacWorks
    Avantika Lal
    Zachary D. Chiang
    Nikolai Yakovenko
    Fabiana M. Duarte
    Johnny Israeli
    Jason D. Buenrostro
    Nature Communications, 12
  • [34] Deep Learning-Based Classification of Massive Electrocardiography Data
    Zhou, Lin
    Yan, Yan
    Qin, Xingbin
    Yuan, Chan
    Que, Dashun
    Wang, Lei
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 780 - 785
  • [35] Deep Learning-based Localization in Limited Data Regimes
    Mitchell, Frost
    Baset, Aniqua
    Patwari, Neal
    Kasera, Sneha
    Bhaskara, Aditya
    PROCEEDINGS OF THE 2022 ACM WORKSHOP ON WIRELESS SECURITY AND MACHINE LEARNIG (WISEML '22), 2022, : 15 - 20
  • [36] Latest Trend and Challenges in Machine Learning- and Deep Learning-Based Computational Techniques in Poultry Health and Disease Management: A Review
    Shwetha, V.
    Maddodi, B. S.
    Laxmi, Vijaya
    Kumar, Abhinav
    Shrivastava, Sakshi
    JOURNAL OF COMPUTER NETWORKS AND COMMUNICATIONS, 2024, 2024
  • [37] Acquisition/Processing: Machine learning-based deblending: Dispersed source array data example
    Baardman R.H.
    Hegge R.F.
    Leading Edge, 2021, 40 (10): : 759 - 767
  • [38] Special issue on deep learning-based neural information processing for big data analytics
    Huang, Chuanchao
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (06): : 1513 - 1515
  • [39] An Efficient Training Data Collection Method for Machine Learning-Based Frequency Selective Surface Design
    Liu, Yan-Fang
    Xiao, Li-Ye
    Shao, Wei
    Peng, Lin
    Liu, Qing Huo
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2024, 23 (12): : 4568 - 4572
  • [40] Special issue on deep learning-based neural information processing for big data analytics
    Chuanchao Huang
    Neural Computing and Applications, 2020, 32 : 1513 - 1515