Don’t Build AI Products The Way Everyone Else Is Doing It

The Weird Rise Of Anti-Startups

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

ITZY "Imaginary Friend" M/V

BLACK OPS 6 ZOMBIES LIBERTY FALLS EASTER EGG GUIDE: FULL BO6 ZOMBIES EASTER EGG WALKTHROUGH!

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Steal my secrets for making LLMs more reliable

Steve (Builder.io)

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 27 окт 2024

Комментарии • 18

@Steve8708 2 месяца назад ⁺⁴
Steal more of my secrets: www.builder.io/blog/make-ai-suck-less
@MatBat__ Месяц назад ⁺²
Great video, thanks alot for sharing your experiences.
I've been working on increasing the accuracy of a company's RAG systems for about a year now and your insights are spot on.
It still amuses me that we can kinda 'program' these LLMs using our own language, like it's more semantic than logic in a way.
I also have been using automated testing to grade and point error in responses. Funny how these things can give a wrong answer to a question and then, when given the same question + answer, point precisely what was wrong with the answer..
I'd say that around 80% of my accuracy chasing endeavours were based on tweeking the system prompt to isolate possible contexts. The other 20% were documents/business related.
Cheers
@rdf274 Месяц назад ⁺¹
Hey man. I'm in a very similar boat as you.
I've been leading the developing of a RAG system, and the newest one which is an IA product sales agent.
Our problem with our RAG is that it's not specific to any niche or industry, we make it available for anyone wanting to upload their docs. Our problem was 50% docs, 20% prompting, and 30% the embeding/vector search techniques.
We end up today using 2 different embedding llm's and using the k-5 of both of them, this improved accuracy greatly.
Some of our clients have lists of 100+ itens (like a long manual) and they expect the AI the give out the whole 100 items in one answer, so we hack a few of them to include the entire doc instead of just the chunks.
Since clients have different use cases, the quality of the response is often not the expected output the client would want to.
The product sales agent tho, probably because it have very specific goals and instructions, performs immensily better in all sorts of manner. It collects information, profiles the buyer, and makes sensible suggestion according to pretty much anything the buyer inputs.
I am about to start the attempts on loopbacks to grade and point error in responses. I assume these can really help with the RAG.
@williamseipp9691 2 месяца назад ⁺¹
Yeah as bad as hallucination is I often put in stories short of 1000 words and am surprised by how well it understands the story. Like "wow you really can extrapolate / infer a lot of accurate details based on what I tell you".
I'm also learning software development at the moment and I'm tested on my ability to accurately explain software concepts whether I'm talking about database constraints or Ruby features. The better I've gotten with my explanations, the easier it is to "boss" a model around to get exactly what I want.
Thanks for the videos. They're always clear and of top-notch quality.
@ruuman4 2 месяца назад ⁺¹
Wonderful video. I would love to see more videos on constraining LLMs to get better outputs
@keteremillpario Месяц назад ⁺¹
First of all thank you for sharing this experiences. Now, I still have a main concern for this LLM's techniques: isn't this a lot of time and effort just to acchieve some confidence (but not 100% confidence) that the LLM is working? I still think that by the time I finish this tune-up process I could've search and solve the original problem I had by my own and completly bypass the usage of the LLM.
@andrezimpel_unknown 2 месяца назад
Thank you for this video bro! Would love to see a more in depth video from you on how to train my own model.
@Steve8708 2 месяца назад
got u fam ruclips.net/video/fCUkvL0mbxI/видео.html
@christopherscheidel5431 2 месяца назад
Great points. Thanks for sharing.
@oszi7058 2 месяца назад
as always high quality content
@faizanahmed9304 2 месяца назад ⁺⁶
If possible, can you please make a beginner video to learn these concepts like LLM, transformers, fine tuning etc. That would be really helpful. Thanks!
@thefatcat-hd6ze Месяц назад
This, need this badly
@RyanSmith-rb1ch 2 месяца назад
Great video
@Metruzanca 2 месяца назад ⁺²
Bluetooth and wireless printers lmfao.
@404statuscode 2 месяца назад
Is it a reupload or am I having a Deja vu
@GifCoDigital Месяц назад ⁺¹
These should have stayed "secrets". lol

Следующие

Автовоспроизведение

Don’t Build AI Products The Way Everyone Else Is Doing It

Don’t Build AI Products The Way Everyone Else Is Doing It

The Weird Rise Of Anti-Startups

The Weird Rise Of Anti-Startups

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

ITZY "Imaginary Friend" M/V

ITZY "Imaginary Friend" M/V

BLACK OPS 6 ZOMBIES LIBERTY FALLS EASTER EGG GUIDE: FULL BO6 ZOMBIES EASTER EGG WALKTHROUGH!

BLACK OPS 6 ZOMBIES LIBERTY FALLS EASTER EGG GUIDE: FULL BO6 ZOMBIES EASTER EGG WALKTHROUGH!

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

Gucci Mane & Sexyy Red - You Don't Love Me [Official Music Video]

How To Build A Shelf Between Two Walls

How To Build A Shelf Between Two Walls

a day in the life of an engineer working from home

a day in the life of an engineer working from home

This Theory of Everything Could Actually Work: Wolfram’s Hypergraphs

This Theory of Everything Could Actually Work: Wolfram’s Hypergraphs

20 Programming Projects That Will Make You A God At Coding

20 Programming Projects That Will Make You A God At Coding

The Secret Language Scaling WhatsApp and Discord

The Secret Language Scaling WhatsApp and Discord

Training Your Own AI Model Is Not As Hard As You (Probably) Think

Training Your Own AI Model Is Not As Hard As You (Probably) Think

AWS CEO - The End Of Programmers Is Near

AWS CEO - The End Of Programmers Is Near

Refactoring a React component - Design Patterns

Refactoring a React component - Design Patterns

How To Build AI Products That Don't Flop

How To Build AI Products That Don't Flop

Good refactoring vs bad refactoring

Good refactoring vs bad refactoring

Хамзат Чимаев КРАСИВО ОТВЕТИЛ НА ПРОВОКАЦИОННЫЙ ВОПРОС #мма

Хамзат Чимаев КРАСИВО ОТВЕТИЛ НА ПРОВОКАЦИОННЫЙ ВОПРОС #мма

Распаковываю Детский Спиннинг-ручку! #shorts

Распаковываю Детский Спиннинг-ручку! #shorts

🔥 ПРЕМЬЕРА 2024! 🔥 Взгляд русалки (2024). 1 серия. Детективный сериал.

🔥 ПРЕМЬЕРА 2024! 🔥 Взгляд русалки (2024). 1 серия. Детективный сериал.

Новый УАЗ БУХАНКА! Вся ЖЕСТЬ! Вся ПРАВДА!!! Двигатель В ХЛАМ. ВСЁ В РЖАВЧИНЕ! СВАРКА, ШВЫ. ЭТО УЖАС.

Новый УАЗ БУХАНКА! Вся ЖЕСТЬ! Вся ПРАВДА!!! Двигатель В ХЛАМ. ВСЁ В РЖАВЧИНЕ! СВАРКА, ШВЫ. ЭТО УЖАС.

ЖУТКАЯ ПОСЫЛКА ДЛЯ МЕДВЕДЯ ВАЛЕРЫ

ЖУТКАЯ ПОСЫЛКА ДЛЯ МЕДВЕДЯ ВАЛЕРЫ

青椒把子肉做好了，大家看看怎么样#food #shorts

青椒把子肉做好了，大家看看怎么样#food #shorts

Maria help Shadow Tapes rank up #trend #shinsonic #animation

Maria help Shadow Tapes rank up #trend #shinsonic #animation

новое испытание

новое испытание