У нас вы можете посмотреть бесплатно Jon Leiñena Otamendi - CompactifAI: Quantum-Inspired AI Model Compression - PyData Eindhoven 2025 или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
Jon Leiñena Otamendi - CompactifAI: Quantum-Inspired AI Model Compression - PyData Eindhoven 2025 https://pydata.org/eindhoven2025 Large AI models have become powerful but increasingly impractical; with escalating training costs, bloated memory requirements, and latency bottlenecks that limit real-world deployments. This talk introduces CompactifAI: a quantum-inspired compression framework that uses tensor networks to surgically shrink large models while preserving their accuracy and capabilities. We will begin with the story of how Multiverse came to be in 2019 with the mission to solve today’s problems through quantum technologies. Along this path, we discovered during a project for Bosch that quantum-inspired algorithms running entirely on classical hardware could ultra-compress AI models. In 2024, we realized that these same techniques could be applied to Large Language Models. This insight gave birth to CompactifAI. From there, we’ll walk through CompactifAI and its compression pipeline, highlighting how it outperforms naive pruning or quantization approaches in both precision and control leveraging Tensor Networks. Attendees will see how this enables new deployment scenarios: running powerful LLMs on edge devices, routing queries between local and cloud models, and even removing or restoring specific behaviors (e.g. safety filters or domain knowledge). Additionally, we’ll show how to integrate our compactifAI compressed models via API with minimal code changes and provide relevant developer resources for those interested in benefitting from faster, cheaper, more efficient models. This talk is aimed at ML engineers, researchers, and technical leads working with LLMs, vision models, or constrained deployment targets who are ready to think beyond just “bigger is better.” www.pydata.org PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R. PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases. 00:00 Welcome! 00:10 Help us add time stamps or captions to this video! See the description for details. Want to help add timestamps to our YouTube videos to help with discoverability? Find out more here: https://github.com/numfocus/YouTubeVi...