У нас вы можете посмотреть бесплатно Beyond the Hype: How Does RAGAS Measure Real LLM Impact? или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
In the rapidly evolving landscape of AI, separating genuine advancements from the noise is crucial for both industrial practitioners and academic researchers. The non-deterministic nature of LLM-driven applications amplifies this challenge, leading to a difficulty to apply common software testing practices and thereby quantify the application performance. This session by Konstantinos Sidiropoulos Software Engineer, Test and Infrastructure Capability Lead, Agile Actors and Athanasios Papaioannou AI Engineer, Agile Actors aims to offer some insight into how to mitigate these issues, by delving into RAGAS (Real-World Application and Generalization Assessment Score), a novel framework designed to quantitatively measure the performance of LLM-driven applications. Drawing from well-established practices of the software testing world, it is shown how RAGAS can be used to guide the development and deployment of LLM technologies with tangible benefits. A short demo is also displayed as a practical demonstration of how RAGAS's ability to evaluate LLM effectiveness can be translated into real world impact.