There's ANOTHER new AI fork of VS Code??!

Why The Next AI Breakthroughs Will Be In Reasoning, Not Scaling

7 New AI Tools You Won't Believe Exist

FEEDING STARVING INFLUENCERS FT. DUKE DENNIS

BLESSD ❌ ANUEL AA | DEPORTIVO 💜 (VIDEO OFICIAL)

送给所有知道我名字的人！For Everyone Who Knows My Name.丨Liziqi Channel

There is A NEW KING!!!

1littlecoder

Просмотров 7 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 14 ноя 2024

Комментарии • 47

@Roenbaeck 5 часов назад ⁺²
I use LLMs daily for coding assistance. Tried the experimental Gemini yesterday as a substitute and it was a mixed bag. For simple tasks it produced cleaner code than other LLMs, but for complex tasks it would greatly overcomplicate the code, and get stuck in suggestion loops when it does not compile.
@Macorelppa 19 часов назад ⁺³⁹
If it ain't coding then it's useless.
@Kaoru8168 19 часов назад ⁺²
well i dont care about coding so... its perfect
@CaptainSnackbeard 18 часов назад
Solipsism is a helluvah drug
@Cat-vs7rc 16 часов назад
its already better than most coders. thats why everyone uses AI for coding.
@Nick_With_A_Stick 15 часов назад ⁺²
But only claude is good at coding… gpt-4o is pretty crap. It makes up libraries like every 2 seconds. I actually would rather use Qwen 2.5 coder 32b.
@samuelgarcia1802 14 часов назад
Qwen better tan o1 and 3,5 sonnet in coding ?
@Shaunmcdonogh-shaunsurfing 8 часов назад ⁺¹
Thank you for making us all aware of this
@freddiechipres 15 часов назад ⁺¹
I also found a strange on AI studio this morning Gemini 1.5 pro latest was giving amazing answers. Probably not related but this is awesome.
@leeme179 18 часов назад ⁺²
for OpenAI's o1-preview I think the model temperature is fixed to 1, does google allow changing the temperature for this new model?
@elawchess 18 часов назад ⁺²
just tested in programming and it's much weaker. I think it's not using monte carlo search.
@leeme179 18 часов назад ⁺²
@@elawchess If it is not using search like o1-preview, then that is a bit worrying because any new breakthroughs after transformer unlikely to be made public, and 32k context length suggests it's a model fresh out of the oven, I hope it's not a breakthrough like the transformer was any other improvement is welcome.
@annamalainarayanan1192 12 часов назад ⁺²
The reason why it mentions 1 B and 0 b in bananas is the context. You started the conversation asking for jokes. Probably it's trying to be funny and hence this response and a WINK
@ammadali5799 2 часа назад ⁺²
Exactly, that's why I think the model gave a decent answer
@aaronpaulina 17 часов назад ⁺³
testing the logic of a model by asking it to write a joke about a famous person is pretty useless.
@Cingku 11 часов назад
Benchmarks don't mean anything. I don't trust benchmarks anymore. I have to use it to believe, especially Google. Tried once just now to generate code but it didn't finish the generation.
@atypocrat1779 18 часов назад ⁺⁴
bs means bull-sh*t
@Serifinity 15 часов назад
Another great video, thanks for letting us know about Gemini Exp 1114, have done a little bit of testing myself, seems very smart on a similar level to o1 Preview but more concise. Interestingly I had it generate multiple paragraphs on various topics and in every test it passed as 75% - 100% human on 8 different AI checkers. As for your banana question, it may have been reading that as bs (b******t) and not B's. Could be why it was giving an odd answer?
@unclecode 13 часов назад
Thanks for the content. Comparing two models from the same company is challenging, especially when one builds on top of the other. Random questions may not reveal the new model’s strengths. I don't think Google is promoting this as a replacement for Gemini 1.5. The shorter token window suggests it's an intelligent model meant to complement Gemini 1.5, handling cases it cannot. Instead, I suggest testing it on hard problems like math and logic where Gemini 1.5 struggles would better show improvements. In production, we could use the new model for complex tasks, switching back to 1.5 as needed. Please try to collect some of these problems and test it. Appreciate.
@trojanhell7639 16 часов назад
Didn’t have enough tokens to use it lol 😂 literally couldn’t understand the question . By time it understood I ran out of tokens 😂😂
@d_b_ 19 часов назад ⁺¹
Those jokes 😂. Do you have a video describing how these llmsys ratings are calculated? You know what they say about measures and targets
@zacboyles1396 16 часов назад
With Google’s track record, they probably just told them it’s best
@maninzn 19 часов назад ⁺⁵
I think the bananas was because the model think you are expecting it to be funny perhaps?
@1littlecoder 19 часов назад ⁺¹
Oh the wink
@supercurioTube 18 часов назад ⁺³
Good point, it's best to clear the context before asking several separate questions since the model should assume that it's an ongoing discussion.
@leeme179 18 часов назад ⁺³
it seems this is google's answer to o1-preview
@elawchess 18 часов назад
IT's not. On the coding it's much lower.
@dogenrinzai6699 2 часа назад
LLMs won't able to generate new logics in code because they come from human mind. Generating something new will only possible for LLMs when it will be able to think like human mind exactly.
Just think for sec all the code which LLMs are generating right now is already available on the internet, so the code which is not available on the internet if LLMs need to generate it LLMs must think like human beings like the fully developed programmer mind, otherwise it is not possible.
@MichealScott24 16 часов назад
i really like gemini 1.5 pro with 1/2million context window also it is free which is really cool --- cant wait for long outputt to drop
gemini 2 or upcoming models would push openai to drop their things cant wait whats next
@mort-ai 14 часов назад
great content lately
@antrikshtewari 19 часов назад ⁺¹
Google FTW? Wait... What?
@1littlecoder 19 часов назад ⁺¹
this should have been the title of this video
@antrikshtewari 19 часов назад
@@1littlecoderping me for more quirky and free ideas.
BTW.. huge fan!
@zen1tsu-sam 12 часов назад
when is 1400 elo model out?
@augmentos 12 часов назад ⁺¹
let me know when it is #1 on coding and not on BS metrics. Still good video but I don't trust Google for shit
@TheReferrer72 3 часа назад
You don't trust the company that made this all possible, and gives you the best free API access to its models!
@FactsNoCare 18 часов назад
Man I didn't like that banana response, like its jokey but there shouldn't be a personality built into any model unless developers want to give it some personality.
@MoFields 18 часов назад
It isn't that good

Следующие

Автовоспроизведение

There's ANOTHER new AI fork of VS Code??!

There's ANOTHER new AI fork of VS Code??!

Why The Next AI Breakthroughs Will Be In Reasoning, Not Scaling

Why The Next AI Breakthroughs Will Be In Reasoning, Not Scaling

7 New AI Tools You Won't Believe Exist

7 New AI Tools You Won't Believe Exist

FEEDING STARVING INFLUENCERS FT. DUKE DENNIS

FEEDING STARVING INFLUENCERS FT. DUKE DENNIS

BLESSD ❌ ANUEL AA | DEPORTIVO 💜 (VIDEO OFICIAL)

BLESSD ❌ ANUEL AA | DEPORTIVO 💜 (VIDEO OFICIAL)

送给所有知道我名字的人！For Everyone Who Knows My Name.丨Liziqi Channel

送给所有知道我名字的人！For Everyone Who Knows My Name.丨Liziqi Channel

Ferrari Enzo V12 on Steroids! The ULTIMATE Engine for my F40!

Ferrari Enzo V12 on Steroids! The ULTIMATE Engine for my F40!

DON'T Become a Software Engineer - Do THIS instead

DON'T Become a Software Engineer - Do THIS instead

Top Educator Reveals Best AI Tool for Smarter Learning

Top Educator Reveals Best AI Tool for Smarter Learning

Paul Roetzer Opening Keynote: The Road to AGI - MAICON 2024 (9/11/24)

Paul Roetzer Opening Keynote: The Road to AGI - MAICON 2024 (9/11/24)

Best AI Video Tool Right Now? Testing Hailuo Minimax

Best AI Video Tool Right Now? Testing Hailuo Minimax

Save Your Money: My Honest Take on ChatGPT4, Claude Pro, Gemini Advanced, Perplexity Pro

Save Your Money: My Honest Take on ChatGPT4, Claude Pro, Gemini Advanced, Perplexity Pro

DRM explained - How Netflix prevents you from downloading videos?

DRM explained - How Netflix prevents you from downloading videos?

Nvidia Finally Reveals The Future Of AI In 2025...

Nvidia Finally Reveals The Future Of AI In 2025...

RAG vs. Fine Tuning

RAG vs. Fine Tuning

Microsoft Just Showed Us How To Use New AI Agents...

Microsoft Just Showed Us How To Use New AI Agents...

消防避险训练，消防员用“水盾”逼退烈火！这是训练，也是他们可能面对的日常。致敬！#熱門 #中国

消防避险训练，消防员用“水盾”逼退烈火！这是训练，也是他们可能面对的日常。致敬！#熱門 #中国

Пришла в себя в городской больнице... // Было дело. Советский след

Пришла в себя в городской больнице... // Было дело. Советский след

V8 из двух 1.5L моторов - КАК ЗВУЧИТ?

V8 из двух 1.5L моторов - КАК ЗВУЧИТ?

Арестович: Гос.Система Украины распадается в прямом эфире. @A.Shelest

Арестович: Гос.Система Украины распадается в прямом эфире. @A.Shelest

Купил самую дорогую тачку

Купил самую дорогую тачку

Academeg - о популярности блогеров, бизнесе, семье и детстве

Academeg — о популярности блогеров, бизнесе, семье и детстве

Ляп года от Астон Виллы в Лиге Чемпионов #shorts

Ляп года от Астон Виллы в Лиге Чемпионов #shorts

НЕОБЫЧНЫЙ УРОК (смешное видео, приколы, поржать, юмор )

НЕОБЫЧНЫЙ УРОК (смешное видео, приколы, поржать, юмор )