• ClipSaver
ClipSaver
Русские видео
  • Смешные видео
  • Приколы
  • Обзоры
  • Новости
  • Тесты
  • Спорт
  • Любовь
  • Музыка
  • Разное
Сейчас в тренде
  • Фейгин лайф
  • Три кота
  • Самвел адамян
  • А4 ютуб
  • скачать бит
  • гитара с нуля
Иностранные видео
  • Funny Babies
  • Funny Sports
  • Funny Animals
  • Funny Pranks
  • Funny Magic
  • Funny Vines
  • Funny Virals
  • Funny K-Pop

TinyRL: Can AI Learn to Swing Up a Real Pendulum? | DigiKey скачать в хорошем качестве

TinyRL: Can AI Learn to Swing Up a Real Pendulum? | DigiKey 1 year ago

DigiKey

Shawn

ai

arduino

control theory

esp32

machine learning

microcontroller

ml

pendulum

reinforcement learning

rl

tinyml

tinyrl

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
TinyRL: Can AI Learn to Swing Up a Real Pendulum? | DigiKey
  • Поделиться ВК
  • Поделиться в ОК
  •  
  •  


Скачать видео с ютуб по ссылке или смотреть без блокировок на сайте: TinyRL: Can AI Learn to Swing Up a Real Pendulum? | DigiKey в качестве 4k

У нас вы можете посмотреть бесплатно TinyRL: Can AI Learn to Swing Up a Real Pendulum? | DigiKey или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:

  • Информация по загрузке:

Скачать mp3 с ютуба отдельным файлом. Бесплатный рингтон TinyRL: Can AI Learn to Swing Up a Real Pendulum? | DigiKey в формате MP3:


Если кнопки скачивания не загрузились НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу страницы.
Спасибо за использование сервиса ClipSaver.ru



TinyRL: Can AI Learn to Swing Up a Real Pendulum? | DigiKey

Reinforcement learning (RL) is a form of machine learning that involves training agents to interact with an environment in order to maximize cumulative rewards. In this video, we teach an AI to swing up a pendulum using real hardware and RL. A write-up of the project can be found here: https://www.digikey.com/en/maker/proj... An RL agent learns to interact with its environment using trial and error. Shawn creates an interface in Arduino that can control a stepper motor and read the position of an encoder attached to the pendulum. The goal is to train an agent to learn to swing up the pendulum on its own. Intro to Reinforcement Learning video:    • Introduction to Reinforcement Learning | D...   Hyperparameter Optimization video:    • Hyperparameter Optimization for Reinforcem...   To accomplish this, the Arduino is connected to a computer running the Farama gymnasium and Stable Baselines3 frameworks. These frameworks take in the observations, have the agent guess an action, and tell the Arduino what action to take. The agent is updated using the proximal policy optimization (PPO) algorithm found in Stable Baselines3. Initially, Shawn tried to perform a full swing-up and balance with a continuous action set. However, this proved too difficult for the agent, as the round trip time to and from the Arduino along with model updates took too long to successfully balance the pole. To reduce the scope, the action set was made into a discrete set (+10 deg, 0 deg, -10 deg), and the episode ended when the pendulum reached the top under a particular speed. If the pendulum moved too fast near the top, it was considered to have “crashed,” and a penalty was applied. Once the agent successfully learned how to perform the swing-up, it was deployed to the Arduino. To perform the deployment, the critic portion of the actor-critic model in the PPO agent was stripped away, and the remaining actor model (3-layer dense neural network) was optimized using Edge Impulse. The model was then deployed to an ESP32S3 to perform the swing-up without any input from the computer. Product Links: STEVAL-EDUKIT01 - https://www.digikey.com/en/products/d... Seeed Studio XIAO ESP32S3 - https://www.digikey.com/en/products/d... Related Videos:    • Introduction to Reinforcement Learning | D...      • Exploring Reinforcement Learning: Can AI L...   Related Project Links: https://www.digikey.com/en/maker/proj... https://www.digikey.com/en/maker/proj... Learn more: Maker.io - https://www.digikey.com/en/maker DigiKey’s Blog – TheCircuit https://www.digikey.com/en/blog Connect with DigiKey on Facebook   / digikey.electronics   And follow us on X (formerly Twitter)   / digikey   00:00 - Introduction 01:10 - Hardware overview 03:00 - Modifying the pendulum tower 04:20 - Arduino communication interface 04:49 - Overview of reinforcement learning 06:17 - Reward function 08:32 - Agent actor-critic deep neural network 09:33 - Hyperparameter optimization overview 09:51 - Agent training with Python 14:57 - Troubleshooting an agent that does not learn 16:46 - Reduce scope to just swing up and use discrete action space 18:03 - Train simpler agent 18:22 - Deploy agent to ESP32 19:56 - Test agent on the pendulum 20:46 - Conclusion and further areas of research

Comments
  • Hyperparameter Optimization for Reinforcement Learning using Meta’s Ax | DigiKey 1 year ago
    Hyperparameter Optimization for Reinforcement Learning using Meta’s Ax | DigiKey
    Опубликовано: 1 year ago
    2794
  • Introduction to FPGA Part 1 - What is an FPGA? | Digi-Key Electronics 3 years ago
    Introduction to FPGA Part 1 - What is an FPGA? | Digi-Key Electronics
    Опубликовано: 3 years ago
    452051
  • What is a PID Controller? | DigiKey 1 year ago
    What is a PID Controller? | DigiKey
    Опубликовано: 1 year ago
    142967
  • Is Our Model of Dark Energy WRONG? | New 4.2σ Results 21 hours ago
    Is Our Model of Dark Energy WRONG? | New 4.2σ Results
    Опубликовано: 21 hours ago
    217969
  • Why won't this ball balancer work? 3 days ago
    Why won't this ball balancer work?
    Опубликовано: 3 days ago
    188
  • 6 Years Ago This Idea FAILED.. Now it’s WORKING 6 days ago
    6 Years Ago This Idea FAILED.. Now it’s WORKING
    Опубликовано: 6 days ago
    1259294
  • How to Tune a PID Controller for an Inverted Pendulum | DigiKey 1 year ago
    How to Tune a PID Controller for an Inverted Pendulum | DigiKey
    Опубликовано: 1 year ago
    69868
  • ЛАБУБУ и заговор китайских маркетологов 1 day ago
    ЛАБУБУ и заговор китайских маркетологов
    Опубликовано: 1 day ago
    753934
  • Кризис 8 ГБ | Почему NVIDIA и AMD экономят на видеопамяти? 3 hours ago
    Кризис 8 ГБ | Почему NVIDIA и AMD экономят на видеопамяти?
    Опубликовано: 3 hours ago
    26824
  • Introduction to RTOS Part 1 - What is a Real-Time Operating System (RTOS)? | Digi-Key Electronics 4 years ago
    Introduction to RTOS Part 1 - What is a Real-Time Operating System (RTOS)? | Digi-Key Electronics
    Опубликовано: 4 years ago
    977780

Контактный email для правообладателей: [email protected] © 2017 - 2025

Отказ от ответственности - Disclaimer Правообладателям - DMCA Условия использования сайта - TOS