У нас вы можете посмотреть бесплатно When Models Manipulate Manifolds: The Hidden Geometry of AI Counting или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
In this deep-dive audio overview, we explore the research paper When Models Manipulate Manifolds: The Geometry of a Counting Task, examining how Claude 3.5 Haiku performs structured spatial reasoning without any visual input. Specifically, we unpack the linebreaking task — how a language model determines when a line of text has reached its character limit and needs a newline. In this episode, we discuss: • The Discovery of Counting Manifolds – Instead of storing raw numbers, the model represents character counts on low-dimensional curved manifolds (a 1D curve embedded in a 6D subspace) resembling a rippled helix • The Twist Mechanism – How specialized boundary attention heads rotate counting manifolds to align internal representations with line-width constraints • Linear Decision Boundaries – Why arranging characters remaining and next word length into near-orthogonal subspaces makes the final linebreak decision geometrically simple • Visual Illusions for LLMs – How certain sequences like @@ in git diffs can hijack the counting circuit, producing errors analogous to perceptual illusions • The Complexity Tax – Why shifting from a discrete feature perspective to a geometric manifold view simplifies our understanding of large neural networks This episode is ideal for listeners interested in mechanistic interpretability, the internal “biology” of large language models, and the role of high-dimensional geometry in enabling AI systems to reason about structured tasks. Educational Note: This Podcast is a pedagogical summary of the research conducted by Gurnee et al. (2025) and is intended to complement — not replace — the original paper. Read the full paper here: https://arxiv.org/pdf/2601.04480 #SciPulse #AIResearch #ManifoldLearning #Interpretability #Claude35 #MachineLearning #GeometryOfAI #NeuralNetworks #Anthropic #DeepLearningPodcast #TechDeepDive