Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines

被引:0
|
作者
Flynn, Patrick [1 ,2 ]
Vanderbruggen, Tristan [1 ]
Liao, Chunhua [1 ]
Lin, Pei-Hung [1 ]
Emani, Murali [3 ]
Shen, Xipeng [4 ]
机构
[1] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
[2] Univ North Carolina Charlotte, Charlotte, NC 28223 USA
[3] Argonne Natl Lab, Lemont, IL 60439 USA
[4] North Carolina State Univ, Raleigh, NC 27695 USA
关键词
reusable datasets; reusable machine learning; programming language processing; interoperable pipelines;
D O I
10.1007/978-3-031-36889-9_27
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Programming Language Processing (PLP) using machine learning has made vast improvements in the past few years. Increasingly more people are interested in exploring this promising field. However, it is challenging for new researchers and developers to find the right components to construct their own machine learning pipelines, given the diverse PLP tasks to be solved, the large number of datasets and models being released, and the set of complex compilers or tools involved. To improve the findability, accessibility, interoperability and reusability (FAIRness) of machine learning components, we collect and analyze a set of representative papers in the domain of machine learning-based PLP. We then identify and characterize key concepts including PLP tasks, model architectures and supportive tools. Finally, we show some example use cases of leveraging the reusable components to construct machine learning pipelines to solve a set of PLP tasks.
引用
收藏
页码:402 / 417
页数:16
相关论文
共 50 条
  • [21] Machine learning in medicine: a practical introduction to natural language processing
    Harrison, Conrad J.
    Sidey-Gibbons, Chris J.
    BMC MEDICAL RESEARCH METHODOLOGY, 2021, 21 (01)
  • [22] Machine learning in medicine: a practical introduction to natural language processing
    Conrad J. Harrison
    Chris J. Sidey-Gibbons
    BMC Medical Research Methodology, 21
  • [23] Railroad accident analysis by machine learning and natural language processing
    Bridgelall, Raj
    Tolliver, Denver D.
    JOURNAL OF RAIL TRANSPORT PLANNING & MANAGEMENT, 2024, 29
  • [24] Automotive fault nowcasting with machine learning and natural language processing
    John Pavlopoulos
    Alv Romell
    Jacob Curman
    Olof Steinert
    Tony Lindgren
    Markus Borg
    Korbinian Randl
    Machine Learning, 2024, 113 : 843 - 861
  • [25] Automotive fault nowcasting with machine learning and natural language processing
    Pavlopoulos, John
    Romell, Alv
    Curman, Jacob
    Steinert, Olof
    Lindgren, Tony
    Borg, Markus
    Randl, Korbinian
    MACHINE LEARNING, 2024, 113 (02) : 843 - 861
  • [26] Application of Natural Language Processing and Machine Learning to Radiology Reports
    Jeon, Seoungdeok
    Colburn, Zachary
    Sakai, Joshua
    Hung, Ling-Hong
    Yeung, Ka Yee
    12TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS (ACM-BCB 2021), 2021,
  • [27] Natural language processing and machine learning to assist radiation oncology incident learning
    Mathew, Felix
    Wang, Hui
    Montgomery, Logan
    Kildea, John
    JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2021, 22 (11): : 172 - 184
  • [28] Natural language processing and machine learning to assist radiation oncology incident learning
    Mathew, Felix
    Wang, Hui
    Montgomery, Logan
    Kildea, John
    MEDICAL PHYSICS, 2021, 48 (08) : 4704 - 4705
  • [29] Herschel vision: A hyperspectral image processing software for data preparation in machine learning pipelines
    Ram, Billy G.
    Sunil, G. C.
    Sun, Xin
    SOFTWAREX, 2025, 30
  • [30] Arabic natural language processing and machine learning-based systems
    Larabi Marie-Sainte S.
    Alalyani N.
    Alotaibi S.
    Ghouzali S.
    Abunadi I.
    IEEE Access, 2019, 7 : 7011 - 7020