The theory of reinforcement learning (RL) was originally motivated by animal learning of sequential behavior, but has been developed and extended in the field of machine learning as an approach to Markov decision processes. Recently, a number of neuroscience studies have suggested a relationship between reward-related activities in the brain and functions necessary for RL. Regarding the history of RL, we introduce in this article the theory of RL and present two engineering applications. Then we discuss possible implementations in the brain.
机构:
Univ Ottawa, Sch Psychol, 136 Jean Jacques Lussier, Ottawa, ON K1N 6N5, Canada
Univ Ottawa, Brain & Mind Res Inst, Ottawa, ON, CanadaUniv Ottawa, Sch Psychol, 136 Jean Jacques Lussier, Ottawa, ON K1N 6N5, Canada
Thivierge, J. P.
Giraud, Eloise
论文数: 0引用数: 0
h-index: 0
机构:
Univ Ottawa, Sch Psychol, 136 Jean Jacques Lussier, Ottawa, ON K1N 6N5, Canada
Univ Ottawa, Brain & Mind Res Inst, Ottawa, ON, CanadaUniv Ottawa, Sch Psychol, 136 Jean Jacques Lussier, Ottawa, ON K1N 6N5, Canada
Giraud, Eloise
Lynn, Michael
论文数: 0引用数: 0
h-index: 0
机构:
Univ Ottawa, Brain & Mind Res Inst, Ottawa, ON, Canada
Univ Ottawa, Dept Cellular & Mol Med, Ottawa, ON, CanadaUniv Ottawa, Sch Psychol, 136 Jean Jacques Lussier, Ottawa, ON K1N 6N5, Canada
机构:
Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Yang Jing
Li Bin
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Li Bin
Li Shaobo
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Li Shaobo
Wang Qi
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R ChinaGuizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Wang Qi
Yu Liya
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Yu Liya
Hu Jianjun
论文数: 0引用数: 0
h-index: 0
机构:
Univ South Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USAGuizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
Hu Jianjun
Yuan Kun
论文数: 0引用数: 0
h-index: 0
机构:
Guizhou Univ, Sch Mech Engn, Guiyang 550025, Peoples R ChinaGuizhou Univ, State Key Lab Publ Big Data, Guiyang 550025, Peoples R China
机构:
School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan,430074, ChinaSchool of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan,430074, China
Zeng, Haochen
Hu, Bin
论文数: 0引用数: 0
h-index: 0
机构:
School of Future Technology, South China University of Technology, Guangzhou,510641, China
Guangdong Artificial Intelligence and Digital Economy Laboratory (Guangzhou), Guangzhou,510335, ChinaSchool of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan,430074, China
Hu, Bin
Guan, Zhihong
论文数: 0引用数: 0
h-index: 0
机构:
School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan,430074, ChinaSchool of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan,430074, China