Скачать с ютуб видео Andreas Krause:

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Скачать видео с ютуб по ссылке или смотреть без блокировок на сайте: Andreas Krause: "Safe and Efficient Exploration in Reinforcement Learning" в качестве 4k

У нас вы можете посмотреть бесплатно Andreas Krause: "Safe and Efficient Exploration in Reinforcement Learning" или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:

Информация по загрузке:

Скачать mp3 с ютуба отдельным файлом. Бесплатный рингтон Andreas Krause: "Safe and Efficient Exploration in Reinforcement Learning" в формате MP3:

Если кнопки скачивания не загрузились НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу страницы.
Спасибо за использование сервиса ClipSaver.ru

Andreas Krause: "Safe and Efficient Exploration in Reinforcement Learning"

Intersections between Control, Learning and Optimization 2020 "Safe and Efficient Exploration in Reinforcement Learning" Andreas Krause - ETH Zurich Abstract: At the heart of Reinforcement Learning lies the challenge of trading exploration -- collecting data for learning better models -- and exploitation -- using the estimate to make decisions. In simulated environments (e.g., games), exploration is primarily a computational concern. In real-world settings, exploration is costly, and a potentially dangerous proposition, as it requires experimenting with actions that have unknown consequences. In this talk, I will present our work towards rigorously reasoning about safety of exploration in reinforcement learning. I will discuss a model-free approach, where we seek to optimize an unknown reward function subject to unknown constraints. Both reward and constraints are revealed through noisy experiments, and safety requires that no infeasible action is chosen at any point. I will also discuss model-based approaches, where we learn about system dynamics through exploration, yet need to verify safety of the estimated policy. Our approaches use Bayesian inference over the objective, constraints and dynamics, and -- under some regularity conditions -- are guaranteed to be both safe and complete, i.e., converge to a natural notion of reachable optimum. I will also present recent results harnessing the model uncertainty for improving efficiency of exploration, and show experiments on safely and efficiently tuning cyber-physical systems in a data-driven manner. Institute for Pure and Applied Mathematics, UCLA February 26, 2020 For more information: http://www.ipam.ucla.edu/lco2020

Comments