Finding Reusable Machine Learning Components to Build Programming Language Processing Pipelines

被引:0
|
作者
Flynn, Patrick [1 ,2 ]
Vanderbruggen, Tristan [1 ]
Liao, Chunhua [1 ]
Lin, Pei-Hung [1 ]
Emani, Murali [3 ]
Shen, Xipeng [4 ]
机构
[1] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
[2] Univ North Carolina Charlotte, Charlotte, NC 28223 USA
[3] Argonne Natl Lab, Lemont, IL 60439 USA
[4] North Carolina State Univ, Raleigh, NC 27695 USA
关键词
reusable datasets; reusable machine learning; programming language processing; interoperable pipelines;
D O I
10.1007/978-3-031-36889-9_27
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Programming Language Processing (PLP) using machine learning has made vast improvements in the past few years. Increasingly more people are interested in exploring this promising field. However, it is challenging for new researchers and developers to find the right components to construct their own machine learning pipelines, given the diverse PLP tasks to be solved, the large number of datasets and models being released, and the set of complex compilers or tools involved. To improve the findability, accessibility, interoperability and reusability (FAIRness) of machine learning components, we collect and analyze a set of representative papers in the domain of machine learning-based PLP. We then identify and characterize key concepts including PLP tasks, model architectures and supportive tools. Finally, we show some example use cases of leveraging the reusable components to construct machine learning pipelines to solve a set of PLP tasks.
引用
收藏
页码:402 / 417
页数:16
相关论文
共 50 条
  • [41] Machine Learning Techniques for Biomedical Natural Language Processing: A Comprehensive Review
    Houssein, Essam H.
    Mohamed, Rehab E.
    Ali, Abdelmgeid A.
    IEEE ACCESS, 2021, 9 : 140628 - 140653
  • [42] Machine Learning and Natural Language Processing in Mental Health: Systematic Review
    Le Glaz, Aziliz
    Haralambous, Yannis
    Kim-Dufor, Deok-Hee
    Lenca, Philippe
    Billot, Romain
    Ryan, Taylor C.
    Marsh, Jonathan
    DeVylder, Jordan
    Walter, Michel
    Berrouiguet, Sofian
    Lemey, Christophe
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (05)
  • [43] Coding in the Liberal Arts through Natural Language Processing and Machine Learning
    Wolz, Ursula
    Wilson, Jennifer
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13506 - 13507
  • [44] Machine Learning and Natural Language Processing for Automating Software Testing (Tutorial)
    Pezze, Mauro
    PROCEEDINGS OF THE 30TH ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2022, 2022, : 1821 - 1821
  • [45] Distributed peer review enhanced with natural language processing and machine learning
    Wolfgang E. Kerzendorf
    Ferdinando Patat
    Dominic Bordelon
    Glenn van de Ven
    Tyler A. Pritchard
    Nature Astronomy, 2020, 4 : 711 - 717
  • [46] Distributed peer review enhanced with natural language processing and machine learning
    Kerzendorf, Wolfgang E.
    Patat, Ferdinando
    Bordelon, Dominic
    van de Ven, Glenn
    Pritchard, Tyler A.
    NATURE ASTRONOMY, 2020, 4 (07) : 711 - 717
  • [47] Arabic Natural Language Processing and Machine Learning-Based Systems
    Marie-Sainte, Souad Larabi
    Alalyani, Nada
    Alotaibi, Sihaam
    Ghouzali, Sanaa
    Abunadi, Ibrahim
    IEEE ACCESS, 2019, 7 : 7011 - 7020
  • [48] Detecting hate crimes through machine learning and natural language processing
    Salazar, Ana Ortiz
    POLICE PRACTICE AND RESEARCH, 2024,
  • [49] CATEGORIZING TELEMEDICINE VISITS USING NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING
    Sudaria, T.
    Overcash, J.
    Nguyen, N.
    Oguntuga, A.
    VALUE IN HEALTH, 2022, 25 (07) : S597 - S597
  • [50] A DICOM Framework for Machine Learning and Processing Pipelines Against Real-time Radiology Images
    Pradeeban Kathiravelu
    Puneet Sharma
    Ashish Sharma
    Imon Banerjee
    Hari Trivedi
    Saptarshi Purkayastha
    Priyanshu Sinha
    Alexandre Cadrin-Chenevert
    Nabile Safdar
    Judy Wawira Gichoya
    Journal of Digital Imaging, 2021, 34 : 1005 - 1013