Coding With OpenAI-o1 in Cursor - Can We Replace Claude 3.5 Now?

Поделиться
HTML-код
  • Опубликовано: 23 дек 2024

Комментарии • 65

  • @eposnix5223
    @eposnix5223 4 дня назад +27

    Using o1 for setting up a project and Claude for doing the small edits is the best of both worlds. I love how cleanly formatted o1's code is!

    • @19LloydG
      @19LloydG 4 дня назад +3

      Indeed! The 01 model is definitely superior for coding, so far it has been flawless and surpassed the preview, as expected. The latency is a small price to pay. However, I am keen to see how Anthropic respond 🙂

    • @Kiki-qh7xk
      @Kiki-qh7xk 4 дня назад +1

      @@19LloydG in my experience, claude is still better.

    • @suicidequad
      @suicidequad 4 дня назад

      i still get better results with claude. however i agree on the clean formatting.
      a really underrated practice is to generate examples with whatever model is best at that and feed it as context to your workhorse model.
      having a dataset of good examples works wonders for any llm application.

    • @__J____ff
      @__J____ff 4 дня назад +2

      @@Kiki-qh7xk good for you, if you like an AI that can't even reason. Some people dont need reasoning depth, because their work and life lacks the depth that they would need from an AI anyways.

    • @19LloydG
      @19LloydG 4 дня назад

      There are many variables (use cases, prompting style and so on). I’m primarily interested in Swift coding. Recently an app enhancement that I wanted to implement was apparently difficult for Claude to implement but when I gave the task to the 01 model it produced an elegant and robust solution immediately. However I have had very good results when using Claude Sonnet 3.5 in other projects so I will continue to use it. Thanks for your RUclips videos which are always informative, helpful and interesting 👍

  • @fabiankliebhan
    @fabiankliebhan 4 дня назад +4

    BTW you can also use gemini-2.0-flash-exp and gemini-2.0-flash-thinking-exp-1219 in Cursor (you need to add a Google API Key and add them manually to models)
    In my first tests these models perform very very good :)

  • @fabobg
    @fabobg 4 дня назад +30

    So, how much did that o1 usage cost you?

    • @multi_variate
      @multi_variate 4 дня назад +11

      Cursor o1 charges is 40 cents per request.

    • @0xunknown336
      @0xunknown336 4 дня назад +2

      that's the right question.

    • @ArcanoIncantatore
      @ArcanoIncantatore 4 дня назад

      You have 10 01-mini requests/day with Pro subscription, dunno about 01

    • @onlybryanliu
      @onlybryanliu 4 дня назад

      @@ArcanoIncantatoreisn’t it unlimited?

    • @samuellembke4565
      @samuellembke4565 4 дня назад +1

      @@multi_variateif true that is actually insanely expensive hahahaha

  • @earthAP
    @earthAP 4 дня назад +11

    what was the API cost for these chats, or its it included in Cursor subscription?

    • @RoryDavidWatts
      @RoryDavidWatts 4 дня назад

      I think, at the moment, the cost per call to o1 (and o1 preview) is 40 cents USD.

  • @StefanoKocka
    @StefanoKocka 4 дня назад +6

    10:45 You opened the HTML file directly from the drive instead accessing the node server. IMHO the Oneshot worked.

  • @mostafaelbarrak6468
    @mostafaelbarrak6468 4 дня назад +2

    I think you should compare it with Claude for the same tasks. Giving Claude the same tasks would allow you to compare both models in terms of response quality, speed, cost, and overall performance.
    However, I don't think the two tasks you provided are very representative, as the documentation appears to be well-structured and comprehensive. These tasks seem to mainly involve reading the documentation and combining the provided examples.
    It might also be valuable to test the same tasks on GPT-4 and even less advanced models, to better evaluate the actual level of difficulty.
    And what would be even more interesting is to assign O1 and Claude more complex real-world tasks, such as working within an existing codebase to add a new feature or solve a specific problem. Most models are quite good at generating standalone files like HTML, JS, or CSS, but they tend to struggle when dealing with an existing codebase.

  • @davefellows
    @davefellows 4 дня назад +2

    You should use Composer rather than Chat. There's also an agent option there which may've worked better for one-shot.
    Also, you can add docs in Cursor itself under Features in the settings.

    • @witchcraft8118
      @witchcraft8118 4 дня назад +1

      can only use gpt4 or sonnet 3.5 in Composer

    • @davefellows
      @davefellows 4 дня назад

      @ ah! Good to know, thanks. Wonder if that will change in the future.

    • @ZlagCraft
      @ZlagCraft 4 дня назад +2

      @@witchcraft8118 Not true, you can use any model in composer "normal" mode, if you want composer "agent" mode then you can only use 4o or sonnet 3.5

  • @UselessDuckCompany
    @UselessDuckCompany 4 дня назад

    Is cursor much better than just VS code with roo-cline

    • @UselessDuckCompany
      @UselessDuckCompany 4 дня назад

      For example with roo-cline you can connect an Obsidian MCP so it can search and take in knowledge during its work

  • @satrianaliqa
    @satrianaliqa 4 дня назад +3

    OHH WOW THE AUDIO TRACK TOO??? 🎉🎉😮

  • @FrozzenFreak
    @FrozzenFreak 4 дня назад +2

    Do you know that you can just paste the URL of the website in the cursor chat? No need to copy the entire content over.

  • @yurijmikhassiak7342
    @yurijmikhassiak7342 4 дня назад +3

    It's following cursor rules much better than any other model. But cost is high. It's easy 10$/ day

  • @lilan1
    @lilan1 4 дня назад

    thanks. was this costly?

  • @ridho6908
    @ridho6908 16 часов назад

    wow it's amazing that you added Indonesian audio.

  • @Leto2ndAtreides
    @Leto2ndAtreides 4 дня назад

    How do local docs files work?

  • @davefellows
    @davefellows 4 дня назад

    Looks very impressive. Would be interesting to see a direct comparison with claude with the same prompts

  • @alx8439
    @alx8439 4 дня назад

    Do you always have to scroll through the chat to find different code snippets and apply the individually one by one? Doesn't Cursor provide any better experience for dealing with this?

    • @ontheruntonowhere
      @ontheruntonowhere 4 дня назад +1

      Instead of Chat, he could have used Composer, which will edit multiple files per prompt. All you have to is approve or reject the changes. Sometimes Composer updates files when you're not expecting it, and I often find myself telling it 'don't change anything' when iterating a strategy for building a feature. I probably could get the same result by switching to Chat, but Composer feels smarter, dunno if that's true. Anyone have thoughts on that?

  • @_MrGameplay_
    @_MrGameplay_ 4 дня назад

    Does anyone know whether o1 pro mode also will be available through the API and if so how much Cursor might charge for it?

  • @kritikusi-666
    @kritikusi-666 4 дня назад +4

    isn't o1 like super expensive?

    • @AdamTwardoch
      @AdamTwardoch 3 дня назад +1

      o1 is cheaper than GPT-4. o1 is $15+60/M while GPT-4 is $30+60/M. o1 produces more tokens so in practice its price is closer to GPT-4-32k.

  • @CLSGL
    @CLSGL 4 дня назад +3

    You know I really liked the idea of Claude, and I even coded with it for a time. But holy shit, their business model is annoying as hell within the actual Claude app. Even as a paying customer I had a limit per chat. Meaning I’d get really far into a project, get a cutoff mid-project, and have to start a new conversation with zero context.
    I use chats as projects. Funnel my entire limit into one chat for Pete’s sake.

    • @jaxxedbytes
      @jaxxedbytes 3 дня назад

      yeah why shouldn't they give everyone an ability to code for free endlessly for $20 a month :) same with gas in the cars - if you paid for a few liters, and started your around the world trip, why do you have to recharge so many times, it should just work till the end of times :)
      damn, people get used to incredible things so fast and feel so entitled.

    • @CLSGL
      @CLSGL 3 дня назад

      @@jaxxedbytes that’s weird, I could’ve sworn I said “funnel my limit into one chat”, and not “I should get unlimited access”. ChatGPT 4o does this. It isn’t an unreasonable ask.

  • @fabiankliebhan
    @fabiankliebhan 4 дня назад

    You really should use the composer with the agent option enabled in Cursor.
    Much better for tasks like the one you did in the video.

    • @darknessguy4221
      @darknessguy4221 4 дня назад

      If he uses cursor while on composer mode, the bill for using 01 is more expensive

    • @fabiankliebhan
      @fabiankliebhan 4 дня назад

      @darknessguy4221 that may be true. There is a good alternative btw. The google flash-2.0 models are free at the moment and can be used in cursor (google api key is needed). They seem very very powerful and ultra fast.

  • @micbab-vg2mu
    @micbab-vg2mu 4 дня назад +3

    Kris I do not have access as well - Maybe because we are from EU.

    • @superlama6452
      @superlama6452 4 дня назад

      i have the o1 model available in the models options. i am from germany

    • @multi_variate
      @multi_variate 4 дня назад +2

      @@superlama6452 Woke regulators are going to keep you away from innovation. Bring the change.

    • @ontheruntonowhere
      @ontheruntonowhere 4 дня назад

      @@multi_variate Define woke?

    • @multi_variate
      @multi_variate 4 дня назад +1

      @ontheruntonowhere regulators who don't understand the tech but pretend to.

    • @ontheruntonowhere
      @ontheruntonowhere 3 дня назад

      @@multi_variate Woke is a term used to describe heightened awareness and activism regarding social and racial justice issues, often emphasizing the importance of equity and inclusion. Nothing to do with tech. You're using it as a shorthand to diminish responsible governance, which is wrong because our 'woke' government has created the most powerful country the world has ever known, precisely because the wealthy and corporations must abide by regulations meant to protect the health and well-being of all of us.

  • @marcusfdm6111
    @marcusfdm6111 4 дня назад

    Amazing

  • @m4RIK
    @m4RIK 4 дня назад

    thats impressive

  • @sethhamilton5824
    @sethhamilton5824 4 дня назад

    The composer agent would have been interesting

  • @SimonNgai-d3u
    @SimonNgai-d3u 4 дня назад +1

    I think it would be better to use sonnet on a regular basis and solve the harder problems with o1 otherwise your waller is gonna be broken 😂😂

  • @__J____ff
    @__J____ff 4 дня назад

    how much do I have to pay you guys to advertise about my AI framework ? Mine is the sh1t

  • @Sofiane.ti2.0
    @Sofiane.ti2.0 4 дня назад

  • @ThomasConover
    @ThomasConover 4 дня назад

    You’re all doing AI coding completely wrong! 😂😂😂😂

  • @hammeedabdo.82
    @hammeedabdo.82 4 дня назад

    Please, add arabic to audio track.

  • @applego6290
    @applego6290 4 дня назад

    All ai are useless just for boilerplate
    If one that train with an open source repo then u will get these type of ai
    dev u will get what I'm saying if ur not copy-paste

    • @okaydetar821
      @okaydetar821 4 дня назад

      Just for boiler plate doesn't sound useless, boilerplates are very useful.

    • @anubisai
      @anubisai 4 дня назад

      😂😂😂😂 wrong