Gemini 2.0 Flash Thinking: Mind-Blowing Reasoning from IMAGES (FULLY TESTED!)

Senior Developers vs. Junior Developers, What's The Difference?

My Framework for LLM Use Cases and AI Tooling (With Phi-4, Gemini 2.0, Llama 3.3)

Marvel Rivals | Winter Celebration, Joyful Jubilation

Finally! A Battery That’s Better Than Energizer and Duracell!

Hey.. long time no see

Microsoft Phi-4 (14B) : This Opensource LLM is a MINI BEAST! The Best 14B Model YET! (Beats Qwen!)

AICodeKing

Просмотров 16 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 19 дек 2024

Комментарии •

@wmzayed 2 дня назад ⁺⁵⁰
I believe it is time to change your 13 test questions. I feel the Microsoft PHI team is following you and training the model around your questions. :). You can create a different set of questions similar in concept.
@kafkaesqued 2 дня назад
😂😂
@luismoriguerra669 2 дня назад
hahha classic benchmark issue
@HemangJoshi 2 дня назад ⁺¹
Actually the best benchmark is aider leadboard. Whichever LLM is on top it is the best period.
@You12783 2 дня назад ⁺³⁷
You should make a longer video by creating a full stack application using the models who've scored really good in your benchmarking questions. That way we'll know which one's the best.
@aculz 2 дня назад ⁺²
then join the membership, simple
@midzuushi 2 дня назад
@@aculz thx for the Info🎉
@EditUMedia 2 дня назад ⁺⁷
My only concern is the model only being good on benchmarking questions, because of the history of Phi models being trained specifically to score high in benchmarks rather than real world performance.
But this model seems promising, I'm excited to try it out.
@trokk24 2 дня назад ⁺¹
Next Level and local. I expect that tool use becomes a feature. It would greatly enhance the potential. I've ran it at q8, q6, and q4 and basically got the same performance. Trying it now with the settings as you recommended. Thanks for sharing CodeKing.
@paulyflynn 2 дня назад ⁺⁷
Can you do simple, non-impactful changes to the questions? for example, "2 plums" instead of "2 apples"
@AICodeKing 2 дня назад
Okay, got it!
@themarksmith 2 дня назад
excellent video dude!!!!
@rrioclkls7721 2 дня назад
Does open web ui normally display generated pages (like with the confetti button @ ~9:11)?
@AICodeKing 2 дня назад
Yes
@AB-cd5gd 2 дня назад
Best test is asking for a modern sleek landing page, you quickly see how good or bad the model is
@_lun4r_ 2 дня назад ⁺²
can't wait for Phi-4 small (~7B) and Phi-4 mini (~3B) and make it crush all benchmarks in these ranges
the Phi-4 you're showcasing here is a Phi-4 medium
@VietVuHunzter 2 дня назад ⁺¹⁰
Lmao I won't trust Phi models until real world benchmark like arena/live bench.
@tukanhamen 2 дня назад
Yep been disappointed too many times
@PseudoProphet 2 дня назад ⁺¹
You can run it on even a M4 Mac mini .
@jimlynch9390 2 дня назад
That's a quite good model. Thanks.
@Tyrexxllc 2 дня назад
I think it's time to update your test questions!!
@kydjester 2 дня назад
next time paste all the questions at once and lets see the fun.
@SipChai 2 дня назад
Can you compare other small models?
@midzuushi 2 дня назад
Open Ai ... its gonna be Open
@Adam-fl9uc 2 дня назад
Woooooooow! It is incredible
@njt4u 2 дня назад
This is quite insane 😮
@1-chaz-1 2 дня назад
Wow
@다루루 2 дня назад
🐿️🐿️🐿️🐿️🐿️🐿️

Следующие

Автовоспроизведение

Gemini 2.0 Flash Thinking: Mind-Blowing Reasoning from IMAGES (FULLY TESTED!)

Gemini 2.0 Flash Thinking: Mind-Blowing Reasoning from IMAGES (FULLY TESTED!)

Senior Developers vs. Junior Developers, What's The Difference?

Senior Developers vs. Junior Developers, What's The Difference?

My Framework for LLM Use Cases and AI Tooling (With Phi-4, Gemini 2.0, Llama 3.3)

My Framework for LLM Use Cases and AI Tooling (With Phi-4, Gemini 2.0, Llama 3.3)

Marvel Rivals | Winter Celebration, Joyful Jubilation

Marvel Rivals | Winter Celebration, Joyful Jubilation

Finally! A Battery That’s Better Than Energizer and Duracell!

Finally! A Battery That’s Better Than Energizer and Duracell!

Hey.. long time no see

Hey.. long time no see

Vermont vs. Marshall: 2024 NCAA men’s soccer championship highlights

Vermont vs. Marshall: 2024 NCAA men’s soccer championship highlights

What is the Dark Web? A Guide to the Dark Side of the Internet

What is the Dark Web? A Guide to the Dark Side of the Internet

Gemini Can Now THINK like O1-But Does It Pass the Misaligned Attention Test?

Gemini Can Now THINK like O1—But Does It Pass the Misaligned Attention Test?

Google's Quantum Chip 'Willow' Just Made History

Google's Quantum Chip 'Willow' Just Made History

Is Functional Programming DEAD Already?

Is Functional Programming DEAD Already?

Devin review: is it a better AI coding agent than Cursor?

Devin review: is it a better AI coding agent than Cursor?

This Video is AI Generated! SORA Review

This Video is AI Generated! SORA Review

Windsurf vs Cursor: which is the better AI code editor?

Windsurf vs Cursor: which is the better AI code editor?

Magnus Carlsen is broken

Magnus Carlsen is broken

Build Anything with Claude Agents, Here’s How

Build Anything with Claude Agents, Here’s How

ПАЦАНСКИЕ РАЗБОРКИ В СЕЛЕ - МУЖСКОЕ ЖЕНСКОЕ feat. Приятный Ильдар

ПАЦАНСКИЕ РАЗБОРКИ В СЕЛЕ - МУЖСКОЕ ЖЕНСКОЕ feat. Приятный Ильдар

Qizim 164-qism | Zilola bunaqa qaynonaga chidolmaydi

Qizim 164-qism | Zilola bunaqa qaynonaga chidolmaydi

Действительно Худшие Стримеры

Действительно Худшие Стримеры

Взрыв в Москве // Шанс на переговоры // Илон Маск строит город

Взрыв в Москве // Шанс на переговоры // Илон Маск строит город

РКН смотрит shorts (и блокирует их)

РКН смотрит shorts (и блокирует их)

"Yurayotgan mashinalar yonib ketdi" - guvohlar

"Yurayotgan mashinalar yonib ketdi" — guvohlar

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

Путин. Прямая линия 2024. LIVE

Путин. Прямая линия 2024. LIVE