Introduction to CPU Pipelining

Поделиться
HTML-код
  • Опубликовано: 1 июн 2024
  • This video motivates a simple, four stage CPU pipeline and demonstrates how instructions flow through it. It shows how a conditional jump can disrupt the pipeline's function because we need to flush the pipeline. In addition, it shows how choices that software engineers make can affect how often the pipeline gets flushed.
    Please subscribe to see more videos about how to write high quality code. If you have questions, email me at wizard.craft.code@gmail.com!

Комментарии • 86

  • @donaldwright2426
    @donaldwright2426 9 месяцев назад +2

    Breaking down technical topics like micro-electronics is not for everyone, but you've done a great job of translating something complex like a CPU pipeline into a simple, easy-to-grasp and understanding. Just thank you!

  • @eliasrezaei3931
    @eliasrezaei3931 4 месяца назад +2

    Best explanation ever! Never understood pipelining very well but your explanation made it so clear. Thanks!

  • @mtushar
    @mtushar 2 года назад +5

    Thank you, I never could before but this high level overview has helped me understand how conditional branching introduces inefficiency!

  • @brian_kirk
    @brian_kirk 2 года назад +1

    Thanks for making this! This was very helpful. You also have a great voice for narration and excellent pace! May have to check out all your other videos :)

  • @Sviskebisk
    @Sviskebisk Год назад

    Thank you for this. Structured, precise and to the point. I gained +5 intelligence from just this video.

    • @wizardcraftcode
      @wizardcraftcode  Год назад

      Thanks! That's a fun comment and I'm glad it helped!

  • @amsonyoutube
    @amsonyoutube 2 года назад +1

    Hey Merlin, you did fantastic job !!

  • @xedover
    @xedover 2 года назад

    Very nice explanation... I've been hearing about Pipelining CPUs, but until now I had no idea what they were talking about.

  • @azd1752
    @azd1752 2 года назад

    Great job ! Thank you for the clear explanation, it is very helpful !

  • @subhajitchatterjee1637
    @subhajitchatterjee1637 3 месяца назад +1

    I love how easy to understand this video is. Subscriber++

  • @favourmokwenye6387
    @favourmokwenye6387 Год назад

    super helpful!
    Really love how i understood it in no time

  • @JJJ-ee5dc
    @JJJ-ee5dc 2 года назад +2

    Omg so much quality content. blessed to found out this channel

    • @boitshepojoy6925
      @boitshepojoy6925 2 года назад

      Tell me abt it
      Quality information for free!

    • @wizardcraftcode
      @wizardcraftcode  2 года назад

      Thank you, both! I love it when my videos are helpful!

  • @AlbertDongler
    @AlbertDongler 4 месяца назад

    Superbly explained, thank you for taking the time to put this together! 😁

  • @michaelcrainiciuc
    @michaelcrainiciuc Год назад

    Great explained and easy to understand!

  • @berthor0m1
    @berthor0m1 2 года назад

    Thanks so much for this video! Very clear insight of cpu pipelining for the noob I am 😊

  • @sushmithaguggella1782
    @sushmithaguggella1782 2 года назад +1

    such beautiful explanation

  • @bowthunell
    @bowthunell Год назад

    Thank you for this well explanation. It's very helpful.

  • @piecucci
    @piecucci Месяц назад

    what a gen! 💎 instant subscribe!

  • @THERaikami1
    @THERaikami1 2 года назад +1

    Thanks so much for this video. I have an interview and I'm sure they will ask me about pipelining!

  • @dougm275
    @dougm275 2 года назад

    I'm learning scripting for now but I enjoyed this.

  • @suuuken4977
    @suuuken4977 2 года назад

    good work!

  • @jpmorallo
    @jpmorallo 16 дней назад

    This is helpful thank you

  • @HenTrademark
    @HenTrademark Год назад

    Love your vids xx♥️♥️♥️♥️♥️

  • @fantasypvp
    @fantasypvp 2 года назад

    now im wondering if cpu makers could improve the performance even more with parallelism on a singlethreaded application.
    so for example, if there are two paths of instructions being executed in a single code function, and some instructions didnt have much to do with others, maybe that instruction could be loaded into the cpu way before it is actually ready to be executed, and if the instruction after it doesnt rely on the result of that data, they could be processed in parallel by the same core.
    something like
    fn main() {
    let mut x = 0
    let mut y = 0
    for i in (0..1000).rev() {
    x *= 2
    y *= 2
    }
    }
    so for this code (rust), x and y are separate and dont depend on each other. would it be feasible to separate this into multiple threads purely using the cpu without any kind of explicit code to make it parallel?

    • @wizardcraftcode
      @wizardcraftcode  2 года назад

      Actually, there are already optimizing compiler that will know to put those on different cores in your CPU. Essentially, the compiler can control the parallelism on the cores within a CPU without the programmer having to explicitly code the multi-threading.

    • @mikafoxx2717
      @mikafoxx2717 4 месяца назад

      Even single cores have had multiple execution units and do batches, they also are smart about doing things in a new order so that things don't conflict, they also use unnamed registers to start new operations that other registers haven't yet been cleared for since it's out of order. Pipelining gets you to scalar, or one operation max per clock cycle, since the Pentium 3 or something, we've done out of order super-scalar processing

    • @wizardcraftcode
      @wizardcraftcode  16 дней назад

      That's so cool! So much magic underlying everything we do!

  • @nourghazal296
    @nourghazal296 Год назад

    Thank you alot, lets say that we keep all the stages the same but we add an extra execution stage. Would it not affect the speed of the CPU at all?

    • @wizardcraftcode
      @wizardcraftcode  Год назад

      Just duplicating the existing execution stage doesn't help because you can't get two instructions there fast enough (without duplicating everything). However, all of the stages have to take the same amount of time for the pipeline to work. If you had a really long execution stage and split it into two, then the "tick" of your pipeline (how fast things go from one stage to the next) would be doubled. When your pipeline was completely full, you'd pump out finished instructions at twice the speed.

  • @lanternarasu6061
    @lanternarasu6061 2 года назад +1

    Yo thanks , i have a very silly doubt here we saw 4 stage instructions and the first stage is fetch cycle , right? Correct me if i am wrong... In the fetch cycle the address of the instruction fetched is given to the address reg. And now the contents of the address reg is passed on the bus, then the instruction is fetched from memory right? Is the whole thing happens only at the first stage? For more clear view i will write like this
    Add reg.

    • @wizardcraftcode
      @wizardcraftcode  2 года назад +1

      The stages of the pipeline all have to take the same amount of time, so they all have to be as long as the store stage. So, there's time for the fetch of an instruction in the fetch stage. The adding to the PC is done in a separate path, in parallel. That's what the Addr in that phase is doing while the fetch from memory of the instruction is happening. So, the things you wrote happen, but not sequentially. I hope that helps!

    • @lanternarasu6061
      @lanternarasu6061 2 года назад +1

      @@wizardcraftcode thanks for taking your time to respond me👍

  • @pavanchytanya1283
    @pavanchytanya1283 2 года назад +1

    Thank u mam you taught it so well , please suggest a book for processor architecture( computer architecture in general)

    • @wizardcraftcode
      @wizardcraftcode  2 года назад

      I'm glad you liked it. For a book, I like Computer Organization and Design by Patterson and Hennessy. It's the standard textbook for lots of schools. Maybe other people can chime in with other suggestions!

  • @nipunmihimal9092
    @nipunmihimal9092 2 года назад

    Thanks lot❤

  • @rambazamba4164
    @rambazamba4164 2 года назад

    oooommmg fuck yes the little animation is so helpful and motivating for adhd its such a relief every time hahaha

  • @masbro1901
    @masbro1901 2 года назад

    hi, what is stall and flush and branch penalty in this concept, and what is its impact to pipeline. thank you.

    • @wizardcraftcode
      @wizardcraftcode  2 года назад +1

      I didn't cover stall in this video. Some pipelines are more complicated than the simple ones I talked about here and can recognize that two sequential instructions interfere with each other. Then they are smart enough to "stall" one until the data it needs is ready. Branch penalty is what I showed here - the cost of taking a conditional branch and how that empties the pipeline. That emptying of the pipeline is what "flush" means.

  • @JJJ-ee5dc
    @JJJ-ee5dc 2 года назад

    Thanks from india

  • @BlueNSour
    @BlueNSour Год назад

    for compiled languages, wouldnt the compilers optimise the conditions for the conditional branching?

    • @wizardcraftcode
      @wizardcraftcode  Месяц назад +1

      The compiler will, and the CPU will. My goal with this video was just to give a high level view of what is happening and why you should understand it

    • @BlueNSour
      @BlueNSour Месяц назад

      @@wizardcraftcode valuable insight you're sharing. Would love to see how the CPU might be able to do this for itself? I hadn't realised that was an option

    • @wizardcraftcode
      @wizardcraftcode  Месяц назад

      @@BlueNSour I'll put that on the list of topics I should make videos about. Thanks!

  • @nurlanimanov9503
    @nurlanimanov9503 Год назад

    Thanks for nice video and explanation. my question is "How many stages can CPU maximum does in one cycle, I mean what is the limit? I don' think that it is unlimited right?"

    • @wizardcraftcode
      @wizardcraftcode  Год назад

      The stages of a pipeline run in parallel meaning they all execute every cycle. In theory, you could build an N stage pipeline for any N, but, in reality, there's a limit to how many are useful.

  • @nourghazal296
    @nourghazal296 Год назад +1

    And what if I have 2 pipelines. Will the speed of the CPU double?

    • @wizardcraftcode
      @wizardcraftcode  Год назад +2

      That would essentially be a two core CPU. If your compiler distributes things across those cores well, yes, the speed will double.

  • @mahseratokpas631
    @mahseratokpas631 2 года назад

    It's a great explanation. Are you a professor?

    • @wizardcraftcode
      @wizardcraftcode  2 года назад +1

      I'm glad you like it. Yes! I am a Software Engineering professor at Shippensburg University. Don't hold that against me! :)

  • @semibiotic
    @semibiotic 2 месяца назад

    ALU does not "execute" instruction, it only do the math commanded by CU. It is CU, which does execute the instruction.

    • @wizardcraftcode
      @wizardcraftcode  2 месяца назад

      You are correct. It executes the math necessary for the instruction. The rest of what I talked about is the high level view of the rest of what you are talking about. This video is intentionally high level, but I could have been more precise with my language at that point.

  • @artnevka779
    @artnevka779 2 года назад

    robotic sound could improve

    • @wizardcraftcode
      @wizardcraftcode  2 года назад

      I'll try to work on that! Thanks for the feedback

  • @sreerajchundayil588
    @sreerajchundayil588 2 года назад +5

    Overall explanation is good. But that robotic voice part is irritating.

    • @wizardcraftcode
      @wizardcraftcode  2 года назад +1

      I'll have to work on being a bit more animated. Thanks for the feedback!

    • @mtushar
      @mtushar 2 года назад +3

      @@wizardcraftcode don't believe that, I love the animated part!