У нас вы можете посмотреть бесплатно APEX–Agents или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
Researchers introduced the APEX-Agents benchmark to evaluate whether AI agents are capable of performing complex professional tasks used in fields like investment banking, management consulting, and law. This test was built by industry experts who designed realistic scenarios where the AI must use various tools and files to complete work that would typically take a human one to two hours. The study tested eight different AI models, and the results showed that Gemini 3 Flash performed the best with a success rate of 24%, followed closely by GPT-5.2. Despite these achievements, the low success rates indicate that while AI agents are becoming more capable, they are still not consistent enough to reliably handle the difficult daily work of human professionals. https://arxiv.org/pdf/2601.14242 https://huggingface.co/datasets/merco... https://github.com/Mercor-Intelligenc...