[summary from o1, excerpted:] Overall Conclusion: Claude 3.5 Sonnet outperforms Gemini 2 (Exp) across all tasks, demonstrating faster response, better handling of instructions, and producing more functional, reliable code. Gemini 2 (Exp) is slower, struggles with the diff-edit format, frequently fails to follow instructions properly, and delivers less polished results. Final Verdict: Claude 3.5 Sonnet is the clear winner, consistently producing more usable, stable, and feature-complete solutions than Gemini 2 (Exp).
Thanks. o1 Pro or just o1? Should I add these summaries somewhere like the description or pinned comment for future videos? Along with the chapters available? Won't be a problem for non-spoiler viewers?
It's my first time coming across a south African LLM content creator..I've been coding with openAi and claude for 2 years..😂😂I started with chatgpt 3.5..before people where even asking it to do snake games😂😂..bro we started before the LLMs had a attach file option or copy code option..we had to code all these features on our own...back when jail breaking would work easily..😂😂😂
@@sizwemsomi239 ola bafo! You also come a long way with LLMs I see. Let's push. If you haven't already, a sub would go a long way. My Reddit if you wanna keep in touch: u/marvijo-software
Weird Gemini is so slow for you. In my experience it’s insanely fast, like look away for a split second and it’s completed hundreds of lines of code fast. Very VERY much faster for me than Claude. I love Claude and pay for it but Gemini is so fast and free that I find myself using it for lots of stuff now
You're probably using Gemini 2 Flash, which is very fast. I cover it in one of my other videos versus Claude 3.5 Haiku: ruclips.net/video/op3iaPRBNZg/видео.htmlsi=PgfH1EztFt_7Ofzy
I hear and appreciate your comment. I have longer videos with an existing codebase on the channel. But I find that if LLMs can't complete the elementary code editing tasks like these (I gave them a code base with SQLite + Express + React (Vite) + Node with ShadCN and authentication already baked in), there's no need for us to test them in bigger code bases. I used that repo in multiple of these tests. Repo: github.com/marvijo-code/sqlite-express-react-nodejs-template In the Windsurf vs Cursor video I used another medium sized repo as a starting point because Claude 3.5 Sonnet already proved itself through these elementary tests: ruclips.net/video/duLRNDa-CR0/видео.html
Interesting setup, thanks for sharing!
Thanks! Glad you liked it. Please sub if you haven't already
[summary from o1, excerpted:]
Overall Conclusion:
Claude 3.5 Sonnet outperforms Gemini 2 (Exp) across all tasks, demonstrating faster response, better handling of instructions, and producing more functional, reliable code.
Gemini 2 (Exp) is slower, struggles with the diff-edit format, frequently fails to follow instructions properly, and delivers less polished results.
Final Verdict:
Claude 3.5 Sonnet is the clear winner, consistently producing more usable, stable, and feature-complete solutions than Gemini 2 (Exp).
Thanks. o1 Pro or just o1? Should I add these summaries somewhere like the description or pinned comment for future videos? Along with the chapters available? Won't be a problem for non-spoiler viewers?
@@MarvijoSoftware just o1; don't pin, people who want spoilers can scrub to the end
It's my first time coming across a south African LLM content creator..I've been coding with openAi and claude for 2 years..😂😂I started with chatgpt 3.5..before people where even asking it to do snake games😂😂..bro we started before the LLMs had a attach file option or copy code option..we had to code all these features on our own...back when jail breaking would work easily..😂😂😂
@@sizwemsomi239 ola bafo! You also come a long way with LLMs I see. Let's push. If you haven't already, a sub would go a long way. My Reddit if you wanna keep in touch: u/marvijo-software
Weird Gemini is so slow for you. In my experience it’s insanely fast, like look away for a split second and it’s completed hundreds of lines of code fast. Very VERY much faster for me than Claude. I love Claude and pay for it but Gemini is so fast and free that I find myself using it for lots of stuff now
You're probably using Gemini 2 Flash, which is very fast. I cover it in one of my other videos versus Claude 3.5 Haiku: ruclips.net/video/op3iaPRBNZg/видео.htmlsi=PgfH1EztFt_7Ofzy
good vid
@@augmentos Thank you, I truly appreciate it. Please sub, it goes a long way
Inclua áudio dublado.
Farei em breve. Por favor, inscreva-se enquanto isso.
Gemini not so fast but more qualified I think
zero shot minigames and "uses database for prototype on its own" aren't real world use cases - give both 100kb spaghetti code and ask for a fix
I hear and appreciate your comment. I have longer videos with an existing codebase on the channel. But I find that if LLMs can't complete the elementary code editing tasks like these (I gave them a code base with SQLite + Express + React (Vite) + Node with ShadCN and authentication already baked in), there's no need for us to test them in bigger code bases. I used that repo in multiple of these tests. Repo: github.com/marvijo-code/sqlite-express-react-nodejs-template
In the Windsurf vs Cursor video I used another medium sized repo as a starting point because Claude 3.5 Sonnet already proved itself through these elementary tests: ruclips.net/video/duLRNDa-CR0/видео.html
@@MarvijoSoftwaregood points. I love these showdowns :)