У нас вы можете посмотреть бесплатно Automated Hypothesis Validation with Agentic Sequential Falsifications или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
Speaker: Kexin Huang, PhD Student, Stanford University Abstract: (provisional) Hypotheses are central to information acquisition, decision-making, and discovery. However,many real-world hypotheses are abstract, highlevel statements that are difficult to validate directly. This challenge is further intensified bythe rise of hypothesis generation from Large Language Models (LLMs), which are prone to hallucination and produce hypotheses in volumes thatmake manual validation impractical. Here wepropose POPPER, an agentic framework for rigorous automated validation of free-form hypotheses.Guided by Karl Popper’s principle of falsification, POPPER validates a hypothesis using LLMagents that design and execute falsification experiments targeting its measurable implications. Anovel sequential testing framework ensures strictType-I error control while actively gathering evidence from diverse observations, whether drawnfrom existing data or newly conducted procedures.We demonstrate POPPER on six domains including biology, economics, and sociology. POPPERdelivers robust error control, high power, and scalability. Furthermore, compared to human scientists, POPPER achieved comparable performancein validating complex biological hypotheses whilereducing time by 10 folds, providing a scalable,rigorous solution for hypothesis validation. Bio: Kexin Huang (https://www.kexinhuang.com/) is a fourth-year PhD student in Computer Science at Stanford University, advised by Prof. Jure Leskovec. His research focuses on leveraging AI to drive novel, deployable, and interpretable biomedical discoveries, while also tackling fundamental AI challenges such as multi-modal modeling, uncertainty quantification, and agentic reasoning. His work has been published in Nature Medicine, Nature Biotechnology, Nature Chemical Biology, Nature Biomedical Engineering, and machine learning conferences including NeurIPS, ICML, ICLR, and UAI. His research has been featured in major media outlets such as Forbes, WIRED, and MIT Technology Review. He has also contributed to machine learning research at leading companies and institutions, including Genentech, GSK, Pfizer, IQVIA, Flatiron Health, Dana-Farber Cancer Institute, and Rockefeller University.