The Ultimate Writing Challenge: Longwriter Tackles 10,000 Words In One Sitting

AgentWrite with LangGraph

How might LLMs store facts | Chapter 7, Deep Learning

iPhone 16 Pro and Pro Max hands-on

LL COOL J - Murdergram Deux ft. Eminem

My Daughter Survives WORLD'S TINIEST HOUSE

Microsoft's Phi 3.5 - The latest SLMs

Sam Witteveen

Просмотров 14 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 сен 2024

Комментарии • 31

@thenoblerot 22 дня назад ⁺¹⁰
Thanks Sam! You always have good content in a sea of clickbait nonsense :)
@samwitteveenai 21 день назад ⁺²
Thanks this is what I am trying to go for. This who space has gotten sop hype focused over the past couple of years.
@supercurioTube 22 дня назад ⁺¹
Thanks for the coverage, I'd be interested in a tool use / RAG and other utilities comparison with Llama 3.1 8B quantized aggressively to bridge the gap in RAM and performance!
@thmo_ 20 дней назад ⁺¹
the MoE wasn't wrong, the correct answer for that calculation was exactly 9.9996, rounding _is_ the next step. So I'd say it did better at that specific question..
@blossom_rx 20 дней назад ⁺³
Unfortunately every Phi model I tested so far had a model collapse after 3 to 5 queries. I have this only with Microsoft models OR models I truncated on my own. I do not understand the hype and do not trust the benchmarks. Just to make clear: I have about 15 different official models running locally that were not tampered with and NONE except the Microsoft models have this issue.
@Alex29196 22 дня назад ⁺²
Phi 3.5 is mindblowing. Works crazy fast and accurate for function calling, and json answers also.
@NoidoDev 19 дней назад
Which version, what functions?
@mukilanru 15 дней назад ⁺¹
Is it faster than Llama-3.1-8b-Instruct float16 for json response? Also which model, mini, right?
@user-th7cu9ll4j 14 дней назад
What are some different use cases for Mini and MoE? For example if you want to do a RAG application, which would be more suitable?
@0cano 22 дня назад
Always top notch content Sam!
@erniea5843 22 дня назад
Nice overview!
@jeremybristol4374 21 день назад
Surprisingly good. Better than v3. But still get's stuck in loops as the response context length grows. Experimenting with prompts to avoid this.
@NetZeroEarth 22 дня назад
🔥 🔥 🔥
@Diego_UG 22 дня назад
Is there any cheap way to finetune these small models with proprietary data?
@samwitteveenai 21 день назад ⁺¹
yeah you can do FTs with Unsloth etc quite easily for these.
@xthesayuri5756 22 дня назад ⁺⁸
It's funny. Every time a new Phi model comes out I get so insanely bearish for LLMs because they always suck. Just gaming the benchmark but are horrendous to use.
@hidroman1993 22 дня назад ⁺²
100% agreed, just ask a slightly different question and Phil goes NUTS
@Spathever 21 день назад
This is what I noticed too. Went crazy on the 2nd time. There was no 3rd. Maybe newer bigger ones would work. Probably will need to fine-tune.
@Alex29196 21 день назад ⁺¹
This kind of models are like gold for people working with NLP.
@user-fy8gm1nc9g 21 день назад
😂
@samwitteveenai 21 день назад
Can I ask what you are using it for that you are finding it sux. Curious is it a chat kind of app etc?
@WillJohnston-wg9ew 22 дня назад ⁺¹
Does anyone know of a source for community/conversation on LLMs and business? I'm a technologist developing an app and would really like to find a good source for discussing ideas and what's working/not working.
@hidroman1993 22 дня назад ⁺¹
Definitely first
@ArianeQube 22 дня назад ⁺¹
o fucks given.
@IdPreferNot1 22 дня назад ⁺¹
How much longer are we going to pretend that these are in any way practical? No on prem running for anyone except large corp and many of the privacy issues open source was supposed to address arise come back once you start using someone else's hardware. Guess Its great to see smaller models improve and push foundation models, but if you want to do stuff with any off these, especially with agentic processes gobbling thousands of tokens, latency and performance demand hosted service.... might as well go free flash, mini with no setup or hosting issues.
@pwinowski 21 день назад ⁺¹
Well, you actually can run a crew of Phi models on a MacBook Pro. The M3 Pro with 36 GB of system memory, can allocate around 27 GB of that pool solely to GPUs for inference.
@IdPreferNot1 21 день назад
@@pwinowski Its not about can/cant. What is the tokens/sec doing that locally? Now consider hitting the gemini-flash API with 128k tokens 15 times a minute for free.

Следующие

Автовоспроизведение

The Ultimate Writing Challenge: Longwriter Tackles 10,000 Words In One Sitting

The Ultimate Writing Challenge: Longwriter Tackles 10,000 Words In One Sitting

AgentWrite with LangGraph

AgentWrite with LangGraph

How might LLMs store facts | Chapter 7, Deep Learning

How might LLMs store facts | Chapter 7, Deep Learning

iPhone 16 Pro and Pro Max hands-on

iPhone 16 Pro and Pro Max hands-on

LL COOL J - Murdergram Deux ft. Eminem

LL COOL J - Murdergram Deux ft. Eminem

My Daughter Survives WORLD'S TINIEST HOUSE

My Daughter Survives WORLD'S TINIEST HOUSE

Vettaiyan - Manasilaayo Lyric | Rajinikanth | T.J. Gnanavel | Anirudh | Manju Warrier | Subaskaran

Vettaiyan - Manasilaayo Lyric | Rajinikanth | T.J. Gnanavel | Anirudh | Manju Warrier | Subaskaran

Cursor Is Beating VS Code (...by forking it)

Cursor Is Beating VS Code (...by forking it)

This AI tool Feels Like Magic - My review of Napkin AI.

This AI tool Feels Like Magic - My review of Napkin AI.

AWS CEO - The End Of Programmers Is Near

AWS CEO - The End Of Programmers Is Near

Paying for software is stupid… 10 free and open-source SaaS replacements

Paying for software is stupid… 10 free and open-source SaaS replacements

I Built an AI That Does My Work For Me

I Built an AI That Does My Work For Me

Try this Before RAG. This New Approach Could Save You Thousands!

Try this Before RAG. This New Approach Could Save You Thousands!

Agentic Info Extraction with Structured Outputs

Agentic Info Extraction with Structured Outputs

AI isn't gonna keep improving

AI isn't gonna keep improving

The Big Fat Llama has arrived - Llama-3.1-405B

The Big Fat Llama has arrived - Llama-3.1-405B

Вопрос Ребром - Булкин

Вопрос Ребром - Булкин

Нельзя смеяться | Смех с водой | 78 #shorts

Нельзя смеяться | Смех с водой | 78 #shorts

My daughter is creative when it comes to eating food #funny #comedy #cute #baby#smart girl

My daughter is creative when it comes to eating food #funny #comedy #cute #baby#smart girl

Разбираем АНАЛИТИКУ БУДУЩЕГО автомобилей

Разбираем АНАЛИТИКУ БУДУЩЕГО автомобилей

Каха домашние продукты #непосредственнокаха

Каха домашние продукты #непосредственнокаха

КРАСНОГЛАЗОМУ ЗОМБИ ПОФИГ НА МОЮ ОБОРОНУ! / PVZ ODD MOD

КРАСНОГЛАЗОМУ ЗОМБИ ПОФИГ НА МОЮ ОБОРОНУ! / PVZ ODD MOD

PVZ, НО ЭТОТ ЗОМБИ ХИЛИТ СЕБЕ ПОДОБНЫХ! / ODD MOD

PVZ, НО ЭТОТ ЗОМБИ ХИЛИТ СЕБЕ ПОДОБНЫХ! / ODD MOD

ФОКУС -СВЕТОФОР

ФОКУС -СВЕТОФОР