Скачать с ютуб видео HPC Café: Inference in the Age of Reasoning Models

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Скачать видео с ютуб по ссылке или смотреть без блокировок на сайте: HPC Café: Inference in the Age of Reasoning Models в качестве 4k

У нас вы можете посмотреть бесплатно HPC Café: Inference in the Age of Reasoning Models или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:

Информация по загрузке:

Скачать mp3 с ютуба отдельным файлом. Бесплатный рингтон HPC Café: Inference in the Age of Reasoning Models в формате MP3:

Если кнопки скачивания не загрузились НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу страницы.
Спасибо за использование сервиса ClipSaver.ru

HPC Café: Inference in the Age of Reasoning Models

Speaker: Dr. Séverine Habert, NVIDIA Date: November 11, 2025 Slides: https://hpc.fau.de/files/2025/11/infe... Abstract: This presentation explores how distributed and disaggregated inference techniques enable scalable execution of large language models (LLMs), particularly in the context of reasoning and agentic AI. It highlights architectural optimizations such as KV caching, prefix reuse, KV-cache aware routing and KV-cache offloading which improve performance, reduce latency, and support efficient deployment at the cluster level of inference workloads. Material from past events is available at: https://hpc.fau.de/teaching/hpc-cafe/

Comments