У нас вы можете посмотреть бесплатно Reinforcement Learning #4: Temporal-Difference Learning, Q-Learning, SARSA или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
Don't like the Sound Effect?: • Reinforcement Learning #4: Temporal-Differ... Full Reinforcement Learning Playlist: • Reinforcement Learning by Zach Slides: https://the-pocket.github.io/PocketFl... Text: https://github.com/The-Pocket/PocketF... The content is based on: "Reinforcement Learning: An Introduction" by Sutton and Barto 0:00:00 - Introduction to Q-learning and Temporal Difference Learning 0:01:43 - The Flaw of the Monte Carlo Method 0:02:47 - Temporal Difference (TD) Learning Explained 0:04:46 - The TD Zero Update Rule and Bootstrapping 0:08:33 - TD Learning Step-by-Step Example 0:13:37 - Introduction to SARSA (On-Policy Learning) 0:17:36 - Q-learning (Off-Policy Learning) vs. SARSA 0:19:51 - The Cliff Walking Problem: SARSA vs. Q-learning 0:22:35 - Recap and a Look Ahead to N-step Learning Social media: X: https://x.com/ZacharyHuang12 LinkedIn: / zachary-h-23aa37172 Github: https://github.com/zachary62 Discord: / discord Medium: / zh2408 Substack: https://zacharyhuang.substack.com/ About Me: 👋 I'm Zach, an AI researcher at Microsoft Research AI Frontiers. I currently work on LLM Agents & Systems. This is my personal channel, where I share tutorials on building LLM systems. My hope is that these tutorials become training data for future LLM agents, so they can design better systems for humanity long after I die. Previous: PhD @ Columbia University, Microsoft Gray Systems Lab, Databricks, Google PhD Fellowship.