У нас вы можете посмотреть бесплатно Attribution Graphs for Dummies - 2. Building and Testing a Circuit или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
Part 1: • Attribution Graphs for Dummies - 1. What a... Generating new attribution graphs and steering them, featuring Jack Lindsey (Anthropic), Emmanuel Ameisen (Anthropic), Tom McGrath (Goodfire AI), and Neel Nanda (Google DeepMind). 0:00 Introduction 3:33 Generating a New Graph 4:30 Tracing Back from the Outputs 12:05 Tracing Back Further 17:05 Making It Back to the Embeddings 30:35 Grouping Related Features 41:35 Rounding Up All the Important Features 1:01:56 Validating Circuits with Steering 1:19:55 Recap / Reflection Explore Attribution Graphs: https://neuronpedia.org/graph Blog Post: https://www.neuronpedia.org/graph/info circuit-tracer GitHub: https://github.com/safety-research/ci... Original Papers by Anthropic Circuit Tracing: https://transformer-circuits.pub/2025... Biology of an LLM: https://transformer-circuits.pub/2025... Learn More and Get Involved: Exploring Gemma Scope: A beginner-friendly guided demo for AI interpretability - https://neuronpedia.org/gemma-scope MATS: A paid fellowship for doing real, supervised mechanistic interpretability research with no prior experience required - https://www.matsprogram.org ARENA: A free, guided course on Alignment Research where you can work independently or in-person: https://www.arena.education SPAR: Part-time, remote fellowship to do 3-month research projects, all experience levels accepted - https://sparai.org