Practical ML - Document Retrieval for RAG with Chroma, General Text Embeddings, and Oxen.ai

LlamaIndex Webinar: LLaVa Deep Dive

The moment we stopped understanding AI [AlexNet]

It's Clash Anime Season! Happy 12th Clashiversary!

David Beckham Beach Kicks DEBUNK

What REALLY Happened To Shelly-Ann Fraser-Pryce

How LLaVA works 🌋 A Multimodal Open Source LLM for image recognition and chat.

Oxen

Просмотров 3 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 5 янв 2024
Arxiv Dives is a group from Oxen.ai of engineers, researchers, and practitioners that gets together every Friday to dig into state of the art research that relates to Machine Learning and Artificial Intelligence. If you would like to join the live discussion we would love to have you!
Join here:
lu.ma/oxenbookclub
Each week we dive deep into a topic in ML/AI. Whether it is a research paper, a blog post, a book, or a RUclips video, we break down the content into a digestible format and have an open discussion with the Oxen.ai team, and anyone else who wants to join. We try to cover the content as high level so that anyone can understand it, and will dive into deeper technical details to get a clearer understanding.
This week we cover the LLaVA paper which is a multimodal model that combines image recognition with an LLM through a chat like interface, removing the barrier to entry for many computer vision tasks.
All the notes and previous dives can all be found on the Oxen.ai blog:
blog.oxen.ai/tag/arxiv-dives/
Наука

Комментарии • 6

@albertmashy8590 7 месяцев назад ⁺¹
Good video
@Pingu_astrocat21 5 месяцев назад ⁺²
how can we fine-tune Llava on a custom image caption dataset?
thank you for uploading this video:)
@oxen-ai 5 месяцев назад
It looks like they have some instructions in their github repo! github.com/haotian-liu/LLaVA/blob/main/docs/Finetune_Custom_Data.md
Also if you end up trying to fine tune, or need some people to collaborate with - let us know in our Discord: discord.com/invite/s3tBEn7Ptg
@Akshatgiri 5 месяцев назад ⁺¹
That man is stressed. Give him a vacation
@bennguyen1313 5 месяцев назад
I have pdf files of handwritten data that I'd like to OCR, perform calculations and finally edit or append the pdf with the results.
I like the idea of using a Custom GPT, but only GPT4 Plus subscribers could use it. So I'd prefer a standalone browser or desktop solution, that anyone drag and drop a file into. However, not sure if ChatGPT4's API assistant has all the Vision / Ai PDF Plugin support.
If using LLaVA+Ollama, would anyone who wants to use my application also need to install the 20GB Ollama?
@oxen-ai 5 месяцев назад
This is a good question - I haven't tried Ollama yet, but would be a cool integration to try. If you end up getting it working let us know in our discord! I'm sure people would be interested there.
discord.com/invite/s3tBEn7Ptg

Следующие

Автовоспроизведение

Practical ML - Document Retrieval for RAG with Chroma, General Text Embeddings, and Oxen.ai

Practical ML - Document Retrieval for RAG with Chroma, General Text Embeddings, and Oxen.ai

LlamaIndex Webinar: LLaVa Deep Dive

LlamaIndex Webinar: LLaVa Deep Dive

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

It's Clash Anime Season! Happy 12th Clashiversary!

It's Clash Anime Season! Happy 12th Clashiversary!

David Beckham Beach Kicks DEBUNK

David Beckham Beach Kicks DEBUNK

What REALLY Happened To Shelly-Ann Fraser-Pryce

What REALLY Happened To Shelly-Ann Fraser-Pryce

Man City 4-2 Chelsea | HIGHLIGHTS | Chelsea FC USA Tour 2024

Man City 4-2 Chelsea | HIGHLIGHTS | Chelsea FC USA Tour 2024

Image Recognition with LLaVa in Python

Image Recognition with LLaVa in Python

What are AI Agents?

What are AI Agents?

Image Annotation with LLava & Ollama

Image Annotation with LLava & Ollama

Fine Tuning LLaVA

Fine Tuning LLaVA

Miles Cranmer - The Next Great Scientific Theory is Hiding Inside a Neural Network (April 3, 2024)

Miles Cranmer - The Next Great Scientific Theory is Hiding Inside a Neural Network (April 3, 2024)

LLaVA 1.6 is here...but is it any good? (via Ollama)

LLaVA 1.6 is here...but is it any good? (via Ollama)

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

An update on DPO vs PPO for LLM alignment

An update on DPO vs PPO for LLM alignment

How AI Stole the ✨ Sparkles ✨ Emoji

How AI Stole the ✨ Sparkles ✨ Emoji

Как на Любом смартфоне восстановить Старую Фотопленку?📸 #Shorts

Как на Любом смартфоне восстановить Старую Фотопленку?📸 #Shorts

Vision Pro наконец-то доработали! Но не Apple!

Vision Pro наконец-то доработали! Но не Apple!

ОТРЫГНУЛА ВИДЕОКАРТА RADEON RX 6600M в ИГРОВОМ LEGION 5 PRO / ЧТО БУДЕТ ПРИ ПЕРЕГРЕВЕ НОУТБУКА?🔥

ОТРЫГНУЛА ВИДЕОКАРТА RADEON RX 6600M в ИГРОВОМ LEGION 5 PRO / ЧТО БУДЕТ ПРИ ПЕРЕГРЕВЕ НОУТБУКА?🔥

КУПИЛ САМЫЙ ПОПУЛЯРНЫЙ ПК ARDOR GAMING в DNS для CS2

КУПИЛ САМЫЙ ПОПУЛЯРНЫЙ ПК ARDOR GAMING в DNS для CS2

Ура!наконец-то куплю новый Samsung!#хочуврек#котики#футажи#shorts Автор звука:@GORA9338

Ура!наконец-то куплю новый Samsung!#хочуврек#котики#футажи#shorts Автор звука:@GORA9338

Телевизор с вб , всего за 12 тысяч рублей Артикул :184346001💜

Телевизор с вб , всего за 12 тысяч рублей Артикул :184346001💜

Колхозный способ разблокировки iPhone #icloud от индуса

Колхозный способ разблокировки iPhone #icloud от индуса