DeepSeek-V3 (Fully Tested) : RIP 3.5 Sonnet & O1! This Opensource Model Beats Claude 3.5 Sonnet!

Поделиться
HTML-код
  • Опубликовано: 25 дек 2024

Комментарии • 108

  • @wedding_photography
    @wedding_photography 14 часов назад +49

    We know that when question 4 is answered correctly, the AGI has been achieved.

    • @fabiankliebhan
      @fabiankliebhan 12 часов назад +1

      o1 pro gets it correct

    • @markantscott
      @markantscott 11 часов назад +2

      Q4 is bigger than mere AGI. It is the ability to answer obscure English Pub Quiz Night questions.

    • @tescOne
      @tescOne 10 часов назад

      @@fabiankliebhan sonnet too

    • @kafkaesqued
      @kafkaesqued 9 часов назад +2

      What is so special about question 4?

    • @seanlbrennan
      @seanlbrennan 9 часов назад +1

      Deepthink option gets Q4 right but took two turns. First turn it ran out of tokens testing words and used a 10 letter word. asked it to keep going and it gives Sententious. The next turn it came up with Transparent right away.

  • @HedleyPugh
    @HedleyPugh 11 часов назад +10

    The "preview" of DeepSeek's new V3 model takes 2nd place on the aider polyglot leaderboard.
    1: 62% o1
    2: 48% DeepSeek V3 Preview
    3: 45% Sonnet
    4: 38% Gemini-exp-1206
    5: 33% o1-mini

  • @theorderofz
    @theorderofz 14 часов назад +7

    Thanks for always putting us on, mate

  • @EditUMedia
    @EditUMedia 14 часов назад +6

    Thank you so much for these videos covering new models. Merry Christmas

  • @samuelsilveira9709
    @samuelsilveira9709 14 часов назад +5

    Merry Christmas, codeking

  • @sinapxiagency
    @sinapxiagency 14 часов назад +2

    King, i dont know how you get this reviews so fast even in holidays, thank you so much

  • @ElvinHoney707
    @ElvinHoney707 12 часов назад +5

    o1 passes question 4: "A suitable answer is "SENTENTIOUS." It is an English adjective (from Latin "sententiosus"), it has 11 letters, begins and ends with S, and its vowels (e, e, i, o, u) appear in strictly alphabetical (non‐decreasing) order."

  • @voltax4435
    @voltax4435 11 часов назад +3

    Finally a real sonnet alternative, and way cheaper!

  • @SipChai
    @SipChai 14 часов назад +4

    Panda has replaced Santa. Poor old man.

  • @notshekhar4738
    @notshekhar4738 14 часов назад +12

    Even with a slightly altered prompt like ' 'what mode are you using?' (with a single quote at the beginning), the model still responds with 'GPT-4'. This raises questions about its underlying architecture.

    • @gui1236100
      @gui1236100 14 часов назад +3

      Maybe they trained on data generated by gpt-4

    • @BACA01
      @BACA01 13 часов назад +2

      @@gui1236100 They stole it as always 😁

    • @wwkk4964
      @wwkk4964 13 часов назад

      They tend to tend to say that, even Gemini would say it last year. everyone trained on Chatgpt.

    • @GRVTY3
      @GRVTY3 12 часов назад

      i'm using it in cline with openrouter deepseek chat api, and it keeps saying it's claude and acting like claude. something really sus going on here

    • @boynet2
      @boynet2 12 часов назад +3

      @@GRVTY3 maybe cline prompt has something like "you are Claude..." ?

  • @Kevencebazile
    @Kevencebazile 14 часов назад +2

    Merry Christmas Brother Love your content

  • @jacobfloyd6929
    @jacobfloyd6929 14 часов назад +6

    Brother you really have extremely valuable content. Have you ever thought about running a community/course? I’m sure there’s a lot of people looking to collaborate with like minded people, especially since AI is so tough to stay on top of.

    • @AICodeKing
      @AICodeKing  14 часов назад +7

      I already have a membership on my channel where I post in-depth tutorials for niche topics.

    • @jacobfloyd6929
      @jacobfloyd6929 14 часов назад +2

      @ thank you I’m gonna look into that. Are you opposed to creating a discord or Skool community? That way everyone can collaborate on new stuff they’re finding, we all know networking is everything but it’s tough to find a valuable community.

    • @theorderofz
      @theorderofz 14 часов назад

      @@jacobfloyd6929true. That would work pretty well.

    • @jmg9509
      @jmg9509 12 часов назад

      This guy in the vid sounds like ai lol

  • @maddoxthorne2297
    @maddoxthorne2297 9 часов назад +1

    Christmas gift galore.🎁❤️

  • @d.d.z.
    @d.d.z. 13 часов назад +3

    With Qwen and Deepseek China strikes back. So amazing to live in 2025.

  • @ram49967
    @ram49967 14 часов назад +1

    Super questions for the LLM! It's ok with me to give it a pass on Question 3, even though it used the first letter and not the second letter to make the Haiku.

  • @Teetanthegamer
    @Teetanthegamer 13 часов назад +1

    Can you please make a tutorial on how to use it with cline locally through ollama or through paid api ?

  • @uniq6318
    @uniq6318 10 часов назад +1

    Without using deep thinking
    That's amazing

  • @chyldstudios
    @chyldstudios 14 часов назад +1

    Love to see this

  • @notshekhar4738
    @notshekhar4738 14 часов назад +2

    I tried prompting the model with 'what model are you using to response to this chat?' and it said 'GPT-4'. When I followed up with 'who developed you?', it answered 'OpenAI'. This makes me wonder if the system is actually utilizing OpenAI APIs.

    • @gui1236100
      @gui1236100 14 часов назад +1

      Maybe just training data generated by gpt-4

    • @notshekhar4738
      @notshekhar4738 13 часов назад

      @@gui1236100 maybe yess

    • @BACA01
      @BACA01 13 часов назад

      When it was deepseek v2 it was saying that it's a gpt3.5 and now it says it's gpt4

    • @Nomadnotepad
      @Nomadnotepad 7 часов назад

      Tell me you don’t understand how training data works without telling me you don’t know how training data works.

    • @TheFinanciallyWiseKidsTV222
      @TheFinanciallyWiseKidsTV222 33 минуты назад

      I’m DeepSeek-V3, an intelligent assistant developed by the Chinese company DeepSeek. I’m built on advanced natural language processing and machine learning technologies, designed to assist with answering questions, providing information, and engaging in conversations. If you have any questions or need help, feel free to ask! 😊

  • @displayname7t4
    @displayname7t4 13 часов назад

    Most useful channel in youtube right now

  • @TitoSadek
    @TitoSadek 13 часов назад

    Merry Christmas , I love your content , thanks

  • @flutterflowexpert
    @flutterflowexpert 12 часов назад

    New questions! Finally! 🎉❤

  • @Reverse-sg5rn
    @Reverse-sg5rn 14 часов назад +1

    Merry Christmas. Can you do code testing with cline and aider on it?

  • @salimalsenani2614
    @salimalsenani2614 13 часов назад +2

    Everyone wants to take sonnet down.. but no one could!
    It remains the king of coding.

    • @thanartchamnanyantarakij9950
      @thanartchamnanyantarakij9950 11 часов назад

      Not at this time. You can check by yourself

    • @salimalsenani2614
      @salimalsenani2614 11 часов назад

      @thanartchamnanyantarakij9950 I just tested Deepseek V3, Gemini 2.0 Flash, and Sonnet, asked them to create amazing landing page for coffee brand.
      Sonnet won by far in terms of design and following correct prompts.
      Second is so close Deepseek V3 and Gemini 2.0 Flash, but I preferred the Deepseek it's really amazing 🤩.

    • @Osys91
      @Osys91 8 часов назад

      ​@@thanartchamnanyantarakij9950 did it outperformed sonnet? I quickly tested some code and sonnet was still performing better

  • @alexjensen990
    @alexjensen990 8 часов назад

    Well, color me surprised. I look forward to using it. Especially the lite model.

  • @Bangs_Theory
    @Bangs_Theory 13 часов назад

    Merry X-mas King!

  • @mrinalraj4801
    @mrinalraj4801 12 часов назад

    Thanks for the video 😊

  • @andrinSky
    @andrinSky 14 часов назад +2

    Hello Is it possible to work with Deepseek perhaps with Cline or RooCline. If yes how can i do this. Because this would be very Great! I could be very good for Coding.

    • @sinapxiagency
      @sinapxiagency 14 часов назад

      Use in cline the Open ai compatible api

    • @andrinSky
      @andrinSky 13 часов назад

      @@sinapxiagency And how are the Settings under "Open AI compatible AI"?
      I Mean the "Base URL"?
      and The "Model ID"?

    • @finnpoitier
      @finnpoitier 13 часов назад

      @@sinapxiagency Do you know, which provider? Openrouter?

  • @Wesley58481
    @Wesley58481 10 часов назад

    Tks for sharing!! u so incredible!

  • @SudeeptoDutta
    @SudeeptoDutta 12 часов назад

    So, If I'm already paying for the 2.5 API using Continue extension, it should automatically start using v3 right? No need to configure any new API key right?

    • @AICodeKing
      @AICodeKing  3 часа назад

      Yes, it should automatically switch

  • @JohnLewis-old
    @JohnLewis-old 14 часов назад

    How fast are inference speeds? Did it do well in fine?

  • @sharvin9283
    @sharvin9283 Час назад

    Is there anyone can help to answer this ....why previously deepseek say its based on gpt4 model when we asked what ai model are u using but yeah i know now they already fix it but why its say gpt 4 at the first place?
    Is that mean they training use older model?

  • @santypk5
    @santypk5 3 часа назад +1

    Why Australia and not Mongolia ?

  • @fabiankliebhan
    @fabiankliebhan 12 часов назад

    Will deepseek v3 be available for cursor?

  • @jeffwads
    @jeffwads 8 часов назад

    I asked QwQ 32b the 4th question and it refused on the grounds that it may be part of a competition test and it wouldn't be fair, etc. It can be stubborn at times but I hope this isn't a sign of things to come.

  • @DouhaveaBugatti
    @DouhaveaBugatti 4 часа назад

    Um can you also add questions for coding in other frameworks like svelte etc.
    This will tell how much useful this model can be for building real applications

  • @greenpulp.
    @greenpulp. 14 часов назад

    Nice! How do we use it with Cline in VS Code?

    • @karamjittech
      @karamjittech 3 часа назад

      Use openai compatible from Cline settings.

  • @pranjalsuthar9476
    @pranjalsuthar9476 10 часов назад +1

    hey...You are making amazing videos. Please make video on organised files by AI

  • @UsmanAli-ve6tq
    @UsmanAli-ve6tq 14 часов назад

    Is there any model which was able to answer question 4 and achieved 100% score.

  • @TawnyE
    @TawnyE 14 часов назад +1

    E
    Merry Christmas 🎄🎅

  • @collinsk8754
    @collinsk8754 13 часов назад

    Great work. And it's finally up to date with NextJS 14! 🙌🙌

  • @nolannosike
    @nolannosike 9 часов назад

    is question 5 correct? you should get a decimal no? 20% of 48 is 9.6 so shouldnt the answer be 38, 38.4 to be exact? the way it did it also seems correct but we're getting two diff answers.

    • @AquaAstronaut23
      @AquaAstronaut23 8 часов назад

      That’s 20% of the inflated number 48 not the original number 40. You need to divide by the percentage as a decimal (1.2) to work it back.

  • @dfasfa6657
    @dfasfa6657 13 часов назад +1

    DeepSeek told me it was GPT 4 after asking "what AI model are you"

    • @skpassegna
      @skpassegna 12 часов назад

      Maybe it is already fixed, since it is replying correctly.

    • @とふこ
      @とふこ 40 минут назад

      ​@@skpassegna in English not in other languages

  • @DemocracyDecoded
    @DemocracyDecoded 13 часов назад

    I need to see a direct comparison to sonnet 3.5 new in all your videos buddy.

  • @Opeyemi.sanusi
    @Opeyemi.sanusi 12 часов назад

    I am amazed

  • @aislanarislou
    @aislanarislou 8 часов назад

    What about programming skills ?

  • @ProkopHapala
    @ProkopHapala 9 часов назад

    The biggest problem I always have with using DeepSeek for programming is the speed (14 token/s, its like 5times slower than Sonet and 10 slower than GPT), I hope they adress also that.

  • @formixcode
    @formixcode 11 часов назад +1

    yes I can sense their suddenly better in coding solving here is why

  • @VincentLannoo-Bourton
    @VincentLannoo-Bourton 8 часов назад

    I know, there is twoo Rounds...
    But...
    first:
    After careful research, there is no known English adjective of Latin origin of 11 letters that has the first and last letter identical and whose vowels appear in alphabetical order.
    second was jusr "are you sure?"
    and: The adjective that fits all the specified criteria is "sententious."
    11 letters: S-E-N-T-E-N-T-I-O-U-S.
    Adjective: It describes someone who is given to moralizing or using maxims.
    Latin origin: The word derives from the Latin "sententia," meaning opinion or meaning.
    First and last letters are the same: Both are 'S.'
    Vowels in alphabetical order: The vowels are E, E, I, O, U, which follow the sequence A, E, I, O, U (with A not present).
    Thus, "sententious" satisfies all the conditions.

  • @misterleo885
    @misterleo885 13 часов назад

    QvQ 72B Test please

  • @paulyflynn
    @paulyflynn 5 часов назад

    amazing

  • @varunaeeriyaulla
    @varunaeeriyaulla 14 часов назад

    Bro, I just asked, "What is the AI model I’m chatting with?" (using the Deepseek API via OpenWebUI). The answer is "You're currently chatting with OpenAI's GPT-4".
    I asked the same question from the chat and the code model. Are they reselling the OpenAPI GPT4???? Crazy. Please run a test.

    • @UsmanAli-ve6tq
      @UsmanAli-ve6tq 14 часов назад

      I got the same answer :)

    • @gui1236100
      @gui1236100 14 часов назад

      Maybe they used training data generated by gpt-4

    • @varunaeeriyaulla
      @varunaeeriyaulla 14 часов назад

      @@UsmanAli-ve6tq Yes, I asked the same question from the Deepseek chat interface, and it says "You're currently chatting with DeepSeek-V3". Very strange.

    • @varunaeeriyaulla
      @varunaeeriyaulla 13 часов назад

      @@gui1236100 then why it's only one API but not on deepseek chat interface?

    • @dfasfa6657
      @dfasfa6657 12 часов назад

      @@gui1236100 where are u from?

  • @rashad6459
    @rashad6459 10 часов назад +1

    I cant keep up😂😂😂

  • @fun8711
    @fun8711 9 часов назад

    Question number 4 stand on ten toes

  • @perfectartiste6332
    @perfectartiste6332 14 часов назад +1

    merry Christmas, first here

  • @limjuroy7078
    @limjuroy7078 12 часов назад

    Interesting!!!

  • @JoraMacKornev
    @JoraMacKornev 5 часов назад

    Rip 3.5 Sonnet and O3 😅

  • @miselgpt
    @miselgpt 14 часов назад +1

    Why not Mongolia? 😉

  • @chouawarasteven
    @chouawarasteven 5 минут назад

    Please code king, before comparing anything to claude 3.5 sonnet, make a list if tests that are purely based on different coding tasks.
    Many LLMs have claimed to surpass sonnet only to become garbage when it comes to coding.

  • @aleksanderspiridonov7251
    @aleksanderspiridonov7251 10 часов назад

    Finally🎉🎉🎉🎉🎉🎉🎉❤

  • @sontieudev
    @sontieudev 14 часов назад

    In v2.5 its slow, and impossible to use in my usecases.

  • @Luca-xr7bs
    @Luca-xr7bs 9 часов назад

    Uhmm I dunno

  • @aculz
    @aculz 14 часов назад +1

    wow, i have been waiting for this. i use deepseek as my main LLM since its the cheapest. great job to cover this model
    it seems we get our open-source model king this end of the year.
    Marry Christmas and Happy new year everyone 🎄🎄