У нас вы можете посмотреть бесплатно MiniGPT 4 или скачать в максимальном доступном качестве, видео которое было загружено на ютуб. Для загрузки выберите вариант из формы ниже:
Если кнопки скачивания не
загрузились
НАЖМИТЕ ЗДЕСЬ или обновите страницу
Если возникают проблемы со скачиванием видео, пожалуйста напишите в поддержку по адресу внизу
страницы.
Спасибо за использование сервиса ClipSaver.ru
It is a model for multimodal chatbot, image captioning, meme explaining, funny memes, factual retrieval, website creation, write stories and poems, cooking recipes. The recent GPT-4 has demonstrated extraordinary multi-modal abilities, such as directly generating websites from handwritten text and identifying humorous elements within images. These features are rarely observed in previous vision-language models. However, the technical details behind GPT-4 continue to remain undisclosed. MiniGPT-4 aligns a frozen visual encoder with a frozen advanced LLM, Vicuna, using one projection layer. Properly aligning the visual features with an advanced large language model can possess numerous advanced multi-modal abilities demonstrated by GPT-4, such as detailed image description generation and website creation from hand-drawn drafts. MiniGPT-4 can also write stories and poems inspired by given images, teach users how to cook based on food photos, and so on. In this video, I will talk about the following: What can MiniGPT-4 do? How is the dataset to finetune MiniGPT-4 curated? How is MiniGPT-4 trained? How does MiniGPT-4 perform? For more details, please look at https://arxiv.org/pdf/2304.10592.pdf Zhu, Deyao, Jun Chen, Xiaoqian Shen, Xiang Li, and Mohamed Elhoseiny. "Minigpt-4: Enhancing vision-language understanding with advanced large language models." arXiv preprint arXiv:2304.10592 (2023).