In this paper, we show how Behavior Trees that have performance guarantees, in terms of safety and goal convergence, can be extended with components that were designed using machine learning, without destroying those performance guarantees. Machine learning approaches such as reinforcement learning or learning from demonstration can be very appealing to AI designers that want efficient and realistic behaviors in their agents. However, those algorithms seldom provide guarantees for solving the given task in all different situations while keeping the agent safe. Instead, such guarantees are often easier to find for manually designed model-based approaches. In this paper we exploit the modularity of behavior trees to extend a given design with an efficient, but possibly unreliable, machine learning component in a way that preserves the guarantees. The approach is illustrated with an inverted pendulum example.
机构:
Xi’an University of Architecture and Technology,Civil Engineering SchoolXi’an University of Architecture and Technology,Civil Engineering School
Lijuan Han
Guangning Pu
论文数: 0引用数: 0
h-index: 0
机构:
Universities of Shaanxi Province,Engineering Research Center of Green Construction &Smart Maintenance of Urban InfrastructureXi’an University of Architecture and Technology,Civil Engineering School
Guangning Pu
Qi Guo
论文数: 0引用数: 0
h-index: 0
机构:
Xi’an University of Architecture and Technology,Civil Engineering SchoolXi’an University of Architecture and Technology,Civil Engineering School
Qi Guo
Donglei Shi
论文数: 0引用数: 0
h-index: 0
机构:
Universities of Shaanxi Province,Engineering Research Center of Green Construction &Smart Maintenance of Urban InfrastructureXi’an University of Architecture and Technology,Civil Engineering School
Donglei Shi
Bin Liu
论文数: 0引用数: 0
h-index: 0
机构:
Xi’an University of Architecture and Technology,Civil Engineering SchoolXi’an University of Architecture and Technology,Civil Engineering School