Intro to CUDA (part 5): Memory Model

Поделиться
HTML-код
  • Опубликовано: 24 янв 2025

Комментарии • 30

  • @MichaelB-yw8tl
    @MichaelB-yw8tl 5 лет назад +28

    WOW !!! THE BEST EXPLANATION EVER !!! SO CLEAR !!!

  • @HWei-eh3qg
    @HWei-eh3qg 4 года назад +8

    Excellent explanation of GPU memory hierarchy. Good job!

  • @devanantMe
    @devanantMe 3 года назад +2

    This is the best CUDA tutorial on the internet!!!

  • @seanmylaw
    @seanmylaw 3 месяца назад

    Thank you! The comment stated at time 10:42 is probably one of the most important parts! Essentially, one thread (within a given grid) will create/allocate the shared array and all other threads within the same grid will ignore that line of code and assume that it has been handled by the first thread.

  • @RPD_ps
    @RPD_ps 3 года назад +4

    This was astoundingly good. Thank you so much for making these lectures and for releasing them to the public

  • @potreschmotre1118
    @potreschmotre1118 4 года назад +3

    im gonna give you a subscribe. keep up the good work .sometimes i am really suprised how rare it is to find an actual good introduction to computer science related topics.

  • @rakhekhanna
    @rakhekhanna Год назад

    extremly good explanation of complex topic in an easy way.. You are awsome

  • @mauriciovanzulli1545
    @mauriciovanzulli1545 2 года назад

    Extremely clear explanation of different GPU mem, cleaning the spurious dust on GPU. Really useful info for a better GPU memory management. THX from UY!

  • @TheAIEpiphany
    @TheAIEpiphany Год назад +3

    Great stuff! Everything I needed to form a decent mental model of a GPU

  • @linmartin1841
    @linmartin1841 Год назад +1

    The best CUDA tutorial !!!

  • @AndrewCodeDev
    @AndrewCodeDev 4 года назад

    Fantastic explanation! Looking forward to future content if you decide to produce more. Much appreciated so far!

  • @azheng9090
    @azheng9090 3 года назад

    This is much better than I read the cuda c book...

  • @pranavraj4855
    @pranavraj4855 11 месяцев назад

    this was so helpful, feel like I need to do more than just subscribe 😀

  • @samarthtandale9121
    @samarthtandale9121 9 месяцев назад

    Great content indeed!!! I'm willing to pay for such content 🔥

  • @Gruby7C1h
    @Gruby7C1h 3 года назад +1

    This series is great! How come such a video has only 4K views after over 2 years? Is everybody on a Python hype-train these days? ;)

    • @azheng9090
      @azheng9090 3 года назад +1

      Maybe not much people use cuda.

  • @whydoineedausername1386
    @whydoineedausername1386 3 года назад

    These are great.
    And they're pre-covid. What other good lectures were released online pre-pandemic?!

  • @kjkszpjab1510
    @kjkszpjab1510 4 года назад +1

    Brilliant!

  • @ayoubaarabi4296
    @ayoubaarabi4296 3 года назад +1

    Thank you

  • @samarthtandale9121
    @samarthtandale9121 Год назад

    Hello Sir! I have a doubt ... wouldn't the transfer of data from cpu to gpu (global memory) be the bottle-neck in the entire architecture? You teach great! Thank You !!!

  • @OmarSalahTawaba
    @OmarSalahTawaba Год назад

    Thank you sir for the great lectures, how can we access the rest of the course, is there more than those 6 parts?

    • @taivas7216
      @taivas7216 Год назад +1

      look up the books he's showing at the end of the video

  • @MultiNeurons
    @MultiNeurons 3 года назад

    Thanks for the content of this video very well explained.
    One issue arises about the __shared__ memory access; I guess there is a mistake in the proposed program: while the index i=threadIdx.x + blockDim.x * blockIdx.x is used to access the "IN" array stored in device memory the shared_array instead should not be indexed by i but by "int j = threadIdx.x". If not then N (size of the array) must be large as the size of the block or less (as max as 1024 threads). If N is larger than the size of the block then errors about accessing violation could rise up. Do you agree?

  • @coder3652
    @coder3652 3 года назад

    Concise and Precise

  • @myelinsheathxd
    @myelinsheathxd 4 года назад +1

    thx!

  • @Jax9835
    @Jax9835 2 года назад

    what's the difference between Cuda core and thread?

  • @joymatrix576
    @joymatrix576 2 года назад

    very clear,learn

  • @Os-el9hz
    @Os-el9hz 11 месяцев назад

    CUDA 101

  • @yupp_harish3936
    @yupp_harish3936 Год назад

    cant understand your english ascent..😢

  • @戴志远-z3q
    @戴志远-z3q 8 дней назад

    This is the best CUDA tutorial on the internet!!!