Скачать с ютуб видео Anthropic Head of Pretraining on Scaling Laws, Compute, and the Future of AI

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Скачать видео с ютуб по ссылке или смотреть без блокировок на сайте: Anthropic Head of Pretraining on Scaling Laws, Compute, and the Future of AI в качестве 4k

У нас вы можете посмотреть бесплатно Anthropic Head of Pretraining on Scaling Laws, Compute, and the Future of AI или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:

Информация по загрузке:

Скачать mp3 с ютуба отдельным файлом. Бесплатный рингтон Anthropic Head of Pretraining on Scaling Laws, Compute, and the Future of AI в формате MP3:

Если кнопки скачивания не загрузились НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу страницы.
Спасибо за использование сервиса ClipSaver.ru

Anthropic Head of Pretraining on Scaling Laws, Compute, and the Future of AI

Ever wonder what it actually takes to train a frontier AI model? Ankit Gupta, YC General Partner, sits down with Nick Joseph, Anthropic's Head of Pre-training, to explore the engineering challenges behind training Claude—from managing thousands of GPUs and debugging cursed bugs to balancing compute between pre-training and RL. We cover scaling laws, data strategies, team composition, and why the hardest problems in AI are often infrastructure problems, not ML problems. Apply to Y Combinator: https://www.ycombinator.com/apply Work at a startup: https://www.ycombinator.com/jobs Chapters: 00:00 – Introduction 01:05 – From Vicarious to OpenAI to Anthropic 06:40 – What pretraining is 11:20 – Why next-word prediction won out 16:05 – Scaling laws and the feedback loop of compute → models → revenue 21:50 – Building Anthropic’s early infrastructure 27:35 – Efficiency hacks and debugging at scale 33:10 – Generalists vs. specialists on the pretraining team 38:45 – Challenges of training across thousands of GPUs 44:15 – Working with new chips: GPUs vs. TPUs 49:00 – Pretraining vs. post-training (RLHF and reasoning models) 54:25 – The future of data quality and availability 59:10 – Where pretraining goes next 1:03:00 – Closing reflections

Comments