This course will provide a comprehensive introduction to reinforcement learning as an approach to artificial intelligence, emphasizing the design of complete agents interacting with stochastic, incompletely known environments. Reinforcement learning has adapted key ideas from machine learning, operations research, psychology, and neuroscience to produce some strikingly successful engineering applications. The focus is on algorithms for learning what actions to take, and when to take them, so as to optimize long-term performance. This may involve sacrificing immediate reward to obtain greater reward in the long-term or just to obtain more information about the environment. The course will cover Markov decision processes, dynamic programming, temporal-difference learning, Monte Carlo reinforcement learning methods, eligibility traces, the role of function approximation, and the integration of learning and planning. The course will emphasize the development of intuition relating the mathematical theory of reinforcement learning to the design of human-level artificial intelligence.
通過條件
成 績 :60 分
-
Chapter1
-
Chapter2
-
Chapter3
-
Chapter4
-
Chapter5
-
Chapter6
-
Chapter7
-
Chapter8
-
Chapter9
-
Chapter10
-
Chapter11
- 課程介紹
- 課程安排
- 評論