Specifics of Data Collection and Data Processing during Formation of RailVista Dataset for Machine Learning- and Deep Learning-Based Applications

被引:0
|
作者
Abisheva, Gulsipat [1 ]
Goranin, Nikolaj [2 ]
Razakhova, Bibigul [1 ]
Aidynov, Tolegen [3 ]
Satybaldina, Dina [3 ]
机构
[1] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Artificial Intelligence Technol, KZ-010000 Astana, Kazakhstan
[2] Vilnius Gediminas Tech Univ, Fac Fundamental Sci, Dept Informat Syst, LT-08412 Vilnius, Lithuania
[3] LN Gumilyov Eurasian Natl Univ, Fac Informat Technol, Dept Informat Secur, KZ-010000 Astana, Kazakhstan
关键词
dataset; data collection; machine learning; railway; railway track defects; DEFECT DETECTION; RAILWAY;
D O I
10.3390/s24165239
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper presents the methodology and outcomes of creating the Rail Vista dataset, designed for detecting defects on railway tracks using machine and deep learning techniques. The dataset comprises 200,000 high-resolution images categorized into 19 distinct classes covering various railway infrastructure defects. The data collection involved a meticulous process including complex image capture methods, distortion techniques for data enrichment, and secure storage in a data warehouse using efficient binary file formats. This structured dataset facilitates effective training of machine/deep learning models, enhancing automated defect detection systems in railway safety and maintenance applications. The study underscores the critical role of high-quality datasets in advancing machine learning applications within the railway domain, highlighting future prospects for improving safety and reliability through automated recognition technologies.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Garbage collection optimization with data separation for large data storage in deep learning applications
    Zhou, Qiang
    Peng, Sirui
    Shen, Taoran
    Yin, Jie
    Sun, Tieli
    Xue, Xiaoyong
    MICROELECTRONICS JOURNAL, 2025, 158
  • [42] Evolution of Machine Learning in Tuberculosis Diagnosis: A Review of Deep Learning-Based Medical Applications
    Singh, Manisha
    Pujar, Gurubasavaraj Veeranna
    Kumar, Sethu Arun
    Bhagyalalitha, Meduri
    Akshatha, Handattu Shankaranarayana
    Abuhaija, Belal
    Alsoud, Anas Ratib
    Abualigah, Laith
    Beeraka, Narasimha M.
    Gandomi, Amir H.
    ELECTRONICS, 2022, 11 (17)
  • [43] Hepatitis C Prediction Using Machine Learning and Deep Learning-Based Hybrid Approach with Biomarker and Clinical Data
    Rokiya Ripa
    Khandaker Mohammad Mohi Uddin
    Mir Jafikul Alam
    Md. Mahbubur Rahman
    Biomedical Materials & Devices, 2025, 3 (1): : 558 - 575
  • [44] Deep Reinforcement Learning-Based Collaborative Data Collection in UAV-Assisted Underwater IoT
    Fu, Xiuwen
    Kang, Shengqi
    IEEE SENSORS JOURNAL, 2025, 25 (01) : 1611 - 1626
  • [45] Machine Learning-Based Embedding for Discontinuous Time Series Machine Data
    Aremu, Oluseun Omotola
    Hyland-Wood, David
    McAree, Peter Ross
    2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1321 - 1326
  • [46] Applications of deep learning-based resolution-enhanced seismic data in fault identification
    Lin, Lei
    Li, Chenglong
    Kuang, Yanbin
    Xin, Xing
    Zhong, Zhi
    GEOPHYSICAL PROSPECTING, 2025, 73 (02) : 523 - 542
  • [47] A Novel Weighted Ensemble Transferred U-Net Based Model (WETUM) for Postearthquake Building Damage Assessment From UAV Data: A Comparison of Deep Learning- and Machine Learning-Based Approaches
    Khankeshizadeh, Ehsan
    Mohammadzadeh, Ali
    Arefi, Hossein
    Mohsenifar, Amin
    Pirasteh, Saied
    Fan, En
    Li, Huxiong
    Li, Jonathan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
  • [48] Machine learning-based modeling in food processing applications: State of the art
    Khan, Md. Imran H.
    Sablani, Shyam S.
    Nayak, Richi
    Gu, Yuantong
    COMPREHENSIVE REVIEWS IN FOOD SCIENCE AND FOOD SAFETY, 2022, 21 (02): : 1409 - 1438
  • [49] Machine learning-based modeling in food processing applications: State of the art
    Khan, Md. Imran H.
    Sablani, Shyam S.
    Nayak, Richi
    Gu, Yuantong
    Comprehensive Reviews in Food Science and Food Safety, 2022, 21 (02): : 1409 - 1438
  • [50] Deep Learning-Based Data Storage for Low Latency in Data Center Networks
    Liao, Zhuofan
    Zhang, Ruiming
    He, Shiming
    Zeng, Daojian
    Wang, Jin
    Kim, Hye-Jin
    IEEE ACCESS, 2019, 7 : 26411 - 26417