انضم إلينا في "بيندنج هوك لايف" في 27 أكتوبر في أندربيلي بوليفارد سوهو في لندن

انضم إلينا في بيندنج هوك لايف

التعلُّم المعزز

This module explores reinforcement learning, a type of machine learning where agents learn to make decisions by interacting with an environment to maximize cumulative reward. It covers key concepts such as the Markov decision process, policy optimization, and value-based methods, along with applications in areas like gaming, robotics, and autonomous systems..

منشئ المناهج الدراسية

هل تحتاج إلى مساعدة؟ انقر هنا للحصول على التعليمات.

ساتون وريتشارد س. وأندرو ج. بارتو. التعلم المعزز: مقدمة. الطبعة الثانية. سلسلة الحوسبة التكيفية والتعلم الآلي. كامبريدج، ماساتشوستس: مطبعة معهد ماساتشوستس للتكنولوجيا، 2018.

Kochenderfer, Mykel J., Tim A. Wheeler, and Kyle H. Wray. Algorithms for Decision Making. Cambridge, Massachusetts: The MIT Press, 2022.

Agarwal, Alekh, Nan Jiang, and S. Kakade. “Reinforcement Learning: Theory and Algorithms,” 2019.

https://www.semanticscholar.org/paper/Reinforcement-Learning%3A-Theory-and-Algorithms-Agarwal-Jiang/8ef87e938b53c7f3ffdf47dfc317aa9b82848535

Bertsekas, Dimitri P. Reinforcement Learning and Optimal Control. 2nd printing (includes editorial revisions). Belmont, Massachusetts: Athena Scientific, 2019.