Applying genetic programming to PSB2: the next generation program synthesis benchmark suite

被引:0
|
作者
Thomas Helmuth
Peter Kelly
机构
[1] Hamilton College,Computer Science
关键词
Automatic program synthesis; Benchmarking; Genetic programming;
D O I
暂无
中图分类号
学科分类号
摘要
For the past seven years, researchers in genetic programming and other program synthesis disciplines have used the General Program Synthesis Benchmark Suite (PSB1) to benchmark many aspects of systems that conduct programming by example, where the specifications of the desired program are given as input/output pairs. PSB1 has been used to make notable progress toward the goal of general program synthesis: automatically creating the types of software that human programmers code. Many of the systems that have attempted the problems in PSB1 have used it to demonstrate performance improvements granted through new techniques. Over time, the suite has gradually become outdated, hindering the accurate measurement of further improvements. The field needs a new set of more difficult benchmark problems to move beyond what was previously possible and ensure that systems do not overfit to one benchmark suite. In this paper, we describe the 25 new general program synthesis benchmark problems that make up PSB2, a new benchmark suite. These problems are curated from a variety of sources, including programming katas and college courses. We selected these problems to be more difficult than those in the original suite, and give results using PushGP showing this increase in difficulty. We additionally give an example of benchmarking using a state-of-the-art parent selection method, showing improved performance on PSB2 while still leaving plenty of room for improvement. These new problems will help guide program synthesis research for years to come.
引用
收藏
页码:375 / 404
页数:29
相关论文
共 50 条
  • [1] Applying genetic programming to PSB2: the next generation program synthesis benchmark suite
    Helmuth, Thomas
    Kelly, Peter
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2022, 23 (03) : 375 - 404
  • [2] PSB2: The Second Program Synthesis Benchmark Suite
    Helmuth, Thomas
    Kelly, Peter
    PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 785 - 794
  • [3] Towards Understanding and Refining the General Program Synthesis Benchmark Suite with Genetic Programming
    Forstenlechner, Stefan
    Fagan, David
    Nicolau, Miguel
    O'Neill, Michael
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 2151 - 2158
  • [4] General Program Synthesis Benchmark Suite
    Helmuth, Thomas
    Spector, Lee
    GECCO'15: PROCEEDINGS OF THE 2015 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2015, : 1039 - 1046
  • [5] GARDENIA: A Graph Processing Benchmark Suite for Next-Generation Accelerators
    Xu, Zhen
    Chen, Xuhao
    Shen, Jie
    Zhang, Yang
    Chen, Cheng
    Yang, Canqun
    ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2019, 15 (01)
  • [6] Benchmark problem suite for reactor physics study of LWR next generation fuels
    Yamamoto, A
    Ikehara, T
    Ito, T
    Saji, E
    JOURNAL OF NUCLEAR SCIENCE AND TECHNOLOGY, 2002, 39 (08) : 900 - 912
  • [7] Genetic Network Programming for Automatic Program Generation
    Mabu, Shingo
    Hirasawa, Kotaro
    Matsuya, Yuko
    Hu, Jinglu
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2005, 9 (04) : 430 - 436
  • [8] Genetic network programming with subroutines for automatic program generation
    Li, Bing
    Mabu, Shingo
    Hirasawa, Kotaro
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2012, 7 (02) : 197 - 207
  • [9] A Generalizability Measure for Program Synthesis with Genetic Programming
    Sobania, Dominik
    Rothlauf, Franz
    PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 822 - 829
  • [10] Proposal of Multimodal Program Optimization Benchmark and Its Application to Multimodal Genetic Programming
    Harada, Tomohiro
    Murano, Kei
    Thawonmas, Ruck
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,