Aider, Cursor, Windsurf: Build Apps with a SINGLE prompt, NO coding!

Поделиться
HTML-код
  • Опубликовано: 10 фев 2025

Комментарии • 79

  • @investfoxy
    @investfoxy 2 месяца назад +6

    Windsurf is byfar the best Ai IDE.. The only missing feature is the image upload. Other than that it is great

    • @vxdro
      @vxdro 2 месяца назад

      I feel like Cursor's "Tab" autocomplete is much better and faster than Windsurf's. I triedWindsurf for a day and switched back to Cursor right after because the Tab autocomplete felt too slow and less context-aware than that of Cursor's.

    • @razorbackroar
      @razorbackroar 2 месяца назад

      It’s there now lol

    • @xanthosoma1989
      @xanthosoma1989 2 месяца назад +1

      New update fixed that.

  • @chadjones4255
    @chadjones4255 2 месяца назад +9

    The strength of these applications (and bolt.new) is not building something with one command, it is interacting with a codebase.

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад

      I really appreciate your feedback @chadjones4255! I agree. On a real project, how often do we spin up a brand new code repo? Not all that often? So, even though sure, these assistants can help us quickly create these simple apps or get complex ones scaffolded really fast, the REAL value lies in what they can help us do AFTER a real production-bound app is scaffolded.
      But these exercises do provide a critical data point in determining just where we are with what these coding assistants are capable of. And spinning up a new, functional app during a short video is WAY easier to explain and understand - especially for folks who are new to software dev.
      As like when we're architecting/designing new software systems, it's important to have multiple "views" of the proposed system from various perspectives (e.g. component relationships, deployment, etc.), it's helpful to conduct many different types of experiments.

  • @MinaEllis-XXAI
    @MinaEllis-XXAI 2 месяца назад

    OMG this is awesome! Great demo, thank you for saving me the time of doing this myself... and money.

  • @EliSpizzichino
    @EliSpizzichino 2 месяца назад

    This video clearly show that given the same base model they all performs similarly.
    I've replicated and expanded this experiment to parse also pdf and markdown and improve the UI. I have to say I'm impressed! (and aider it's opensorce)

  • @ComplexACC
    @ComplexACC 2 месяца назад +2

    Aider - inconvenient
    Cursor - waste of money, over hyped
    Windsurf - high potential and well coordinated

  • @imcool3357
    @imcool3357 2 месяца назад +7

    You might want to retry by giving docs for chroma db (or atleast the langhchain subset) and bs4. The fact they all had similar errors indicates it was more an error of outdated source knowledge than the ide's themselves (Although I didn't read the docs he provided, so maybe it does mention how it should work)

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад

      I really appreciate your feedback @imcool3357! You are 100% correct. Had I gone to the trouble of first identifying the up-to-date API docs that the current version of LangChain community library is dependent on and loaded those docs into the context as well, it's likely none of the assistants would have had any errors - or maybe just one error vs 3-5.

  • @MaxZapara
    @MaxZapara 2 месяца назад +3

    Great content and very easy to follow then try all the tools shown

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад

      Thank you so much for your kind words @MaxZapara! I hope you find at least one of them helpful to you.

  • @boardsontt1756
    @boardsontt1756 2 месяца назад +18

    Everybody over uses Claude 3.5 sonnet but I have great experience using o1mini. Have o1 mini do heavy lifting and then create a summary + instructions for Claude or other model and the results yield are way better.

    • @sivaramsathiamoorthi87
      @sivaramsathiamoorthi87 2 месяца назад +1

      Fantastic use case, thank you let me try to implement this strategy in my app

    • @ontheruntonowhere
      @ontheruntonowhere 2 месяца назад +3

      Could you define what you mean by heavy lifting?

    • @soulacrity7498
      @soulacrity7498 2 месяца назад +2

      I also would like to know what heavy lifting is. Could you go into detail what you mean for your whole process?

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +8

      I really appreciate your feedback @boardsontt1756! I won't deny that folks are having success using o1 mini. It's best at "planning" (I mean, reasoning is kinda its "reason for existing"), but according to the vast majority of folks I network with at least, it's not as good at generating/editing code. I could be wrong, because I'm basing this on hearsay, as I haven't really put o1 mini to the true test myself.
      Not that leaderboards are the "gold standard" by any means, but I've found that the aider team really does pretty much get it right with theirs, which is based on just on the LLM, but the LLM when used with aider: aider.chat/docs/leaderboards/. This point is critical. I never trust general LLM benchmarks.
      Also, in case you haven't already used aider's "architect" mode or seen this, here's aider's comparison of various LLMs combined with its architect and editor modes: aider.chat/2024/09/26/architect.html.
      If you watch tutorials on my channel, you'll know I kinda harp on this notion of "better" and "best" 😉 I put those notions in the same category as Santa Clause 🎅
      Could you provide an example of a code gen/editing task you tried using Claude that didn't work and then, with the exact same prompt and process, you switched to o1 and it just worked? I really do want to understand.

    • @boardsontt1756
      @boardsontt1756 2 месяца назад

      @@ontheruntonowhere Critical thinking/ planning out your file structure etc. Think of it as laying the foundation. I found that I can get amazing results when using o1 when compared to Sonnet, usually in the first shot. I usually have to go back and fourth or upload extra context to Sonnet. Many people shy away because of the price but if you’re building something you care about that shouldn’t be an issue. And if you’re only using it to get a foundation it’s pretty inexpensive compared to using Sonnet.
      Extra tip: Brainstorm in Perplexity if you have a pro membership. Create a space for what ever project it is your trying to build (spaces in perplexity are like custom gpts) upload as much context as you want and talk to Sonnet via perplexity. To increase your context window edit responses that don’t really to stay. Regenerate answers if you feel you’re not getting the results you want or approach your prompt differently (make sure to edit bad/ poor original prompt).

  • @Andrielson_FS
    @Andrielson_FS Месяц назад

    Nice video! Thanks for teaching us so much! Do you think of doing a video exploring the aider architect feature?

  • @marleymomo9582
    @marleymomo9582 2 месяца назад

    Most youtubers give overhype to AI tools, you tell the actual comparison between them with any hype. Like this type of content.

  • @investfoxy
    @investfoxy 2 месяца назад +1

    Sir, which Theme are you using for your IDEs?

  • @augmentos
    @augmentos 6 дней назад

    Would love an update with Replit and Trae added to this, maybe even project idx

  • @favourdev
    @favourdev 2 месяца назад

    Just so you know this video potentially solved an issue I had with my code base
    Where I didn't know I had to use embeddings but rather I was passing a whole long text to the bot which compromised a whole lot of things including time

  • @lakergreat1
    @lakergreat1 2 месяца назад +2

    I would love for you to add a small follow up video with Cline as well (formerly claude dev). It's pretty cool how it can use computer use to debug the code itself and think it would be a great addition to this trio.

    • @TreeLuvBurdpu
      @TreeLuvBurdpu 2 месяца назад

      I've been using Cline for a month or two and it's great, but I've been ringing up quite a tab in it with Claude Sonnet

  • @hannespi2886
    @hannespi2886 2 месяца назад

    Yes Yes Yes, great video!

  • @gani2an1
    @gani2an1 2 месяца назад +1

    windsurf for the win

  • @BleedingDev
    @BleedingDev 2 месяца назад +7

    One thing that really bothers me is pasting screenshots instead of direct text errors. It just doesn't make sense. It's way slower, way more expensive in API and way less precise.
    On the hand I'd love to see comparison with Wingman-AI which is VS Code extension. :)

    • @untoldecay
      @untoldecay 2 месяца назад +1

      Agreed, for non ui related errors, it’s useless.
      Also use a screenshot to ocr app to extract text form instant screenshot and paste in ai chat.

    • @OzCoruhlu
      @OzCoruhlu 2 месяца назад

      100%. and you need to host it send to windsurf as url

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +2

      I really appreciate your feedback @BleedingDev! I don't disagree with your point about using images when all the same error info is available as text right in the dev env.
      I truly only intended to demonstrate an important feature of aider and most AI coding assistants.
      Breakages aside, there absolutely will be situations in which taking a screenshot is by far the best way to provide context to the assistant. For instance, I have a screenshot of a mockup or wireframe I created that I want to use to advise the assistant on what I'm planning to build. That will GREATLY reduce my prompting load. Or, maybe I notice something isn't being rendered correctly? Can't get that from any error logs.
      I know that's not your point. Just wanted to call out that the "screenshot, copy/paste" is a really critical coding assistant capability.
      Also, I measured with aider and the most recent version of Claude 3.5 Sonnet. I pasted a very detailed and "busy" screenshot and asked aider to explain it. It nailed it and my Anthropic cost was one penny.

    • @BleedingDev
      @BleedingDev 2 месяца назад

      @@CodingtheFuture-jg1he thanks for the reply! :) I would go like "it's possible, but this time it's better to just go text way".
      I think it is important to ask correctly. It was truth in the past and it's truth now. If you know how to ask people, Google or AI, you are way more powerful not just as developer. :)
      Also for the fair comparison it would be better to compare same input. I know it doesn't make sense here, but in some edge-cases it would be critical!

  • @FrankDenbowDotCom
    @FrankDenbowDotCom 2 месяца назад

    Building the detailed prompt could be an app by itself tbh

  • @artistaartificial5635
    @artistaartificial5635 2 месяца назад +1

    aider runs locally? if i don't have a powerful pc i can't use it?

    • @TreeLuvBurdpu
      @TreeLuvBurdpu 2 месяца назад

      You can use it with server models like Anthropic or Open AI and others. Aider has a leader board for AI models on their site.

  • @augmentos
    @augmentos 2 месяца назад +10

    Would be good with Bolt or OttO and wish you expanded the exercise such that you could draw conclusions at the end

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +7

      I really appreciate your feedback @augmentos! I hear you. But, as I stated, I intentionally do not want to draw the conclusion for anyone as to "which assistant is best". All 3 are quite capable of this kind of task. The choice of which to use is likely based on your personal preferences and style of working.
      My advice: the moment anyone tells you "this tools is the best", without saying clearly "best at what" and "why is that particular strength most important", respectfully nod and then immediately purge your memory of every bit of "advice" that person gave you. I've been in software engineering a long time and I can tell you that people abuse the terms "best", "best practice" and "antipattern" to add the perception of authority to what is nothing more than their personal opinion. It's a human thing, but we need to be aware. Gotta take it all in and decide for ourselves 😀

    • @bambanx
      @bambanx 2 месяца назад

      Otto?

  • @puremajik
    @puremajik 2 месяца назад +8

    Very interesting. All three AI coding assistants (Aider, Cursor, and Codium Windsurf) successfully built the RAG app, but needed different amounts of hand-holding:
    Codium Windsurf: Only needed 2 fixes
    Aider: Needed 3 fixes
    Cursor: Needed 4 fixes
    They all hit similar bumps (mostly API and config stuff) and ended up with working apps.

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +3

      Yep @puremajik! Now, honestly, I wouldn't read too much into the number of fixes per assistant. Maybe I should've have said that. The reason is: because part of the performance of these tools is how their developers have designed them, but a major part is the backend LLMs. Since LLM output is totally non-deterministic, I likely could have repeated this exact exercise with each assistant 5 times and during some runs, one assistant would outperform another one that previously outperformed it.
      I think the big takeaway is that ALL 3 are quite capable of this kind of task. Right now at least, the choice of which to use is likely based on your personal preferences and style of working.

    • @hayder978
      @hayder978 2 месяца назад +1

      ​@@CodingtheFuture-jg1heI found your video after seeing half a dozen that tried to convince me this IDE is better than an another one. I was skeptical because they all use the same LLM model. And I can see they all seems converging to provide the same features, especially now with cursor agent. You've just confirmed with your experiments what I was thinking. Very reasonable conclusion without any hype. Thank you so much! You've got one more subscriber.
      P.S.: Have you tried Cline before? I guess it will be the same too though. Real breakthrough comes with new LLMs and not IDEs.

  • @SoloJetMan
    @SoloJetMan 2 месяца назад +4

    was kinda routing for windsurf, but the lack of vision (attachment) is a dealbreaker for me

    • @olegdragora2557
      @olegdragora2557 2 месяца назад +5

      I would expect them to add it soon, since the rest of their competitors already have it and it doesn't seem like it requires some sort of engineering breakthrough.

    • @aculz
      @aculz 2 месяца назад +4

      chill bro, it just release couple days ago. they sure will add some more. but currently its good already with the initialization

    • @ItsUforia
      @ItsUforia 2 месяца назад +1

      The flows work well on small interactions, but I just wrote a library that puts all interactions into a DB, as well as the success or failure of the request. I kept on having a chat go gray and having to reload windows. I lost interest in a few hours, but I was also growing tired of vs code forks...

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +3

      You know, that was one of my initial reactions @SoloJetMan! Then I reminded myself that this, as well as the ability to add reference docs like aider and Cursor allow, are far simpler features for the Codeium team to add than the core Cascade capabilities (which are a considerably higher bar). If they can achieve what we're seeing in Cascade so far, it should be trivial for them to add the other features.

  • @MTNtf2
    @MTNtf2 2 месяца назад

    Would you say developers will be obsolite? If so when?

    • @Baltazar-l9n
      @Baltazar-l9n 2 месяца назад +3

      juniors will be obosolete .

  • @regalx1
    @regalx1 2 месяца назад

    Yes even though Aider, Cursor, and Windsurf all completed the task successfully.
    Cursor and Windsurf are way cheaper than Aider, and Windsurf is half the price of of even Cursor.
    So Windsurf is the winner here if you want the most bang for your buck.

    • @joeking5211
      @joeking5211 24 дня назад

      Am a newbe looking to start this AI coding. I thought Aider was free, naturally you have to add you LLM, but you say it is the most expensive, i am very lost here in this sea of AI coders after only been looking in the last 7 days and there seems to be so many contradictions everywhere you look. But Aider most expensive, i may have looked at the work thing then, wow, this won't be easy I guess. Rgds.

    • @regalx1
      @regalx1 24 дня назад

      @@joeking5211 Well if there's any consolation I still stand by what I said. I'm relatively new myself and Windsurf does what I need it to do. But if you are super new
      You can also use something called "Google AI Studio" that allows you to also share your screen with an LLM and it can explain to you what how to use the other LLMs you're looking at.

  • @AaronBlox-h2t
    @AaronBlox-h2t 2 месяца назад

    Well then....I will go with free. haha. Thanks for the video!

  • @GregDowns
    @GregDowns 2 месяца назад +3

    There is not that much between all the big assistants ('the many faces of Claude'), so it'll be interesting to see which one(s) emerge from the Darwinian stew they're all in at the moment. They surely won't all survive. Many will merge into others. A few will disappear.
    I've tried them all and could feasibly go with any of them and be happy, but Windsurf has had the least amount of friction for me, as was seen in this test, and their pricing is the most competitive. Cursor has massive brand awareness though. Aider witha large Local LLM would be best of all, but my potato machines don't like that.

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад

      I really appreciate your feedback @GregDowns! I agree. There are far too many tools, with more and more coming out on a daily basis. The market will eventually winnow them down to a few - and the rest will be minor niche players.
      The one we don't talk much about - mainly because from a quality perspective, it's way behind these other coding assistants so far - is GitHub Copilot. Even though it's unlikely to be as top-notch as the Cursors and Codeiums and aiders of the world, there's just this tradition of C-levels and engineering leads at companies buying such "corporate-friendly" solutions. They see them as lower risk. In the long run, Copilot will likely win the day within the enterprise. That will take a bite out of the market as well. In my experience, it's the very rare CTO who asks the engineers "have you experimented with tools to do XYZ? Which have you found to be the most productive? ☹

  • @TyBohannon
    @TyBohannon 2 месяца назад

    I read the description. What LLM was used with Aider? Assume the fine tuned models from Cursor and Codium were used respectively.

  • @tobitege
    @tobitege 2 месяца назад +1

    For the Cursor test I'd say the first 2 issues were with Sonnet, not Cursor. Cursor did all the edits correctly, but Sonnet made the actual errors in coding. Which was a bit surprising to me since you mentioned it was supposed to be the same model as used in Aider?

    • @MeowEngineer
      @MeowEngineer 2 месяца назад +1

      Well if you knew anything mr Keyboard warrior... cursor is responsoble for setting up the system prompt.
      Which greatly affects sonnets logic..

    • @aculz
      @aculz 2 месяца назад +3

      AI Agent is the one who has the responsibility to setting up the system prompt. thats why there is a different between them. so its cursor fault not sonnet, cause they all using sonnet with the exactly same prompts which will enhanced by the AI Agent itself. so its cursor fault

    • @ontheruntonowhere
      @ontheruntonowhere 2 месяца назад +1

      @@MeowEngineer don't be a dick. everyone's level of expertise is different

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +1

      I really appreciate your feedback @tobitege! IMHO, you're partially correct. Although as some others have stated, you do have to take into account that the performance of any coding assistant is the combination of the backend LLM's capabilities AND the way the coding assistant devs have designed and implemented the assistant. BTW, it's not just the prompts they've designed - it's also all the CODE they've written around all this stuff. If you'd like to get a sense of what I mean, browse aider's codebase: github.com/Aider-AI/aider. You'll find it's WAY more than aider just taking your prompts, wrapping them with aider's prompts and passing it all through to the LLM.
      Also, if you pay attention to the errors each assistant made (other than the one API key error with Cursor, which I clarified was my fault not Cursor's), you'll notice a pattern: the errors were related to either a ChromaDB API issue or a BeautifulSoup API error. Now, I used Claude 3.5 Sonnet with all 3 assistants.
      What does that tell you? Well, it's pretty clear that the root cause of those errors was in fact Claude. That's why I say that you're kinda correct. But it's important to also note that the exact same can be said of both aider and Windsurf.

    • @antoniofuller2331
      @antoniofuller2331 2 месяца назад

      Ok Mr. Keyboard Warrior

  • @antoniofuller2331
    @antoniofuller2331 2 месяца назад

    You sound extremely intelligent, even if you're reading from a teleprompter. Amazing

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад

      Thank you for that @antoniofuller2331! Teleprompter?! What's a teleprompter 😉

  • @habibthephantom7040
    @habibthephantom7040 2 месяца назад

    Which one is free to use ?

  • @lancemarchetti8673
    @lancemarchetti8673 2 месяца назад

    Looks like there's a new kid on the block. Val Town has built a coder...called Townie.

    • @antoniofuller2331
      @antoniofuller2331 2 месяца назад

      it's not impressive as tho. New, but underwhelming.

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +1

      For some time, we're going to have some "new kid" every couple of weeks. Have you tried Townie? And other coding assistants? If so, what are your impressions so far?

  • @berserkerrxii5776
    @berserkerrxii5776 2 месяца назад

    Imagine actually learning how to code

    • @ontheruntonowhere
      @ontheruntonowhere 2 месяца назад +3

      I imagine spending two minutes creating this app but charging for two days, which I spend with your mom

    • @CodingtheFuture-jg1he
      @CodingtheFuture-jg1he  2 месяца назад +1

      I really appreciate your feedback @berserkerrxii5776! Oh, you GOTTA learn to code. And... you gotta still write some code and keep those skills sharp.
      I didn't get into that in this video (every video must have a core message or two), but if you watch my other videos on this channel, you'll see that I STRESS this point. Just can't revisit every concept in every video.