CUDA Crash Course (v2): Vector Addition

CUDA Crash Course: Cache Tiled Matrix Multiplication

Unified Memory in CUDA 6.0

The Aston Martin Valkyrie Is a $4.5 Million Insane Hypercar

I 3D Printed a $1,500 Chair

revealing the truth...

CUDA Crash Course (v2): Unified Memory

Nick

Просмотров 6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 янв 2025

Комментарии • 21

@rbaleksandar Год назад ⁺¹
Lovely tutorial. Far superior than sifting through all the spagetti that is the CUDA documentation.
@mario7501 4 года назад ⁺⁵
These are some high quality tutorials! I really hope you’ll get more traffic to your channel soon!
@NotesByNick 4 года назад
Glad you are enjoying them!
@stevenh7729 3 года назад
This series of tutorials is very helpful for me, a novice who is just learning cuda! keep it up!
@5minutes-j8u 2 месяца назад
Love this high quality tutorial. You have saved my time thanks
@TheKenigham 2 года назад ⁺¹
Thanks for the cuda videos! they are very helpful!
@Dr.tech- 3 года назад ⁺¹
Thank you. This is a high-quality tutorial. I have checked our channel and it is great. keep it up
@NotesByNick 3 года назад
Thanks! Glad you have enjoyed the content!
@lucasdiazmiguez8680 4 года назад ⁺⁴
Hi, im from Argentina. I would love to see the next episode of this series!
Im new to the CUDA arquitecture, so im getting started with this videos and the book CUDA by example.
Thanks a lot for this videos!
Any book you recommend for me?
Saludos!
@closerlookcrime Год назад ⁺¹
Great video. I learned a lot. Thank-you sir.
@quantphobia2944 Год назад
Hi, Nick; amazing tutorial; I was looking for something like this; get down to coding right away.
nvprof is no longer used with devices having more than 8.0 compute capability. To profile, one can use this, for example, "nsys profile --stats=true -t cuda ./vector_add_unified_memory.out.
Also, will there be version 2 of your previous sum reduction videos? Thanks a ton for sharing
@anonymoussloth6687 Год назад
Please make more cuda videos
@abdelhaksaouli8802 3 года назад
quick question Sir, if the size of the memory is bigger then the GPU max size, what will happen ? For instance you have like 3 GB of data allocated on Host how does the unified memory deal with it ? will it send it patch by patch or what ?
@NotesByNick 3 года назад ⁺¹
Good question - It somewhat depends on the GPU arch you're working with. For some, you'll be limited by the max capacity of the GPU (i.e., you can't reserve a unified memory region larger than what is available on the GPU). Newer GPUs (pascal and later and only on Linux) support memory oversubscription where you can reserve more than what is available on the GPU (and data will be paged back and forth as needed).
@abdelhaksaouli8802 3 года назад
@@NotesByNick Thank you for you quick replay Sir , BTW good videos hope to see more.
@penguimTwo 4 года назад ⁺¹
Thank you!
@NotesByNick 4 года назад
Happy to help!
@lolololo359 5 лет назад ⁺¹
Hi , thanks so much for all the videos which have been very helpful. If you don't mind , I have a question . For some reason my device has significantly more counts than yours even when using the prefetch code. Do you know what might be the issue?
==25060== Unified Memory profiling result:
Device "GeForce RTX 2080 Ti (0)"
Count Avg Size Min Size Max Size Total Size Total Time Name
16 32.000KB 32.000KB 32.000KB 512.0000KB 292.5000us Host To Device
36 35.555KB 32.000KB 128.00KB 1.250000MB 6.072400ms Device To Host
@NotesByNick 5 лет назад ⁺¹
Different GPUs and driver versions will likely behave differently. I had to play around with my hints to get the results I did in the video. Playing around with the hints on your system will likely be your best bet. Hope this helps!
@lolololo359 5 лет назад ⁺²
@@NotesByNick Hi thanks for the reply , I believe its because i'm on windows at the prefetch commands don't work there hahaha
@NotesByNick 5 лет назад ⁺¹
Ah, makes sense! Haha

Следующие

Автовоспроизведение

CUDA Crash Course (v2): Vector Addition

CUDA Crash Course (v2): Vector Addition

CUDA Crash Course: Cache Tiled Matrix Multiplication

CUDA Crash Course: Cache Tiled Matrix Multiplication

Unified Memory in CUDA 6.0

Unified Memory in CUDA 6.0

The Aston Martin Valkyrie Is a $4.5 Million Insane Hypercar

The Aston Martin Valkyrie Is a $4.5 Million Insane Hypercar

I 3D Printed a $1,500 Chair

I 3D Printed a $1,500 Chair

revealing the truth...

revealing the truth...

Blox Fruits Dragon Rework Update [Full Stream]

Blox Fruits Dragon Rework Update [Full Stream]

Writing Code That Runs FAST on a GPU

Writing Code That Runs FAST on a GPU

CUDA Crash Course: GPU Performance Optimizations Part 1

CUDA Crash Course: GPU Performance Optimizations Part 1

What is an object pool, and how to create one in C?

What is an object pool, and how to create one in C?

Polars (but FASTER!) - Using Nvidia cuDF with Polars for INSANE speeds!

Polars (but FASTER!) - Using Nvidia cuDF with Polars for INSANE speeds!

Intro to CUDA (part 5): Memory Model

Intro to CUDA (part 5): Memory Model

CUDA Crash Course: Why Coalescing Matters

CUDA Crash Course: Why Coalescing Matters

Nvidia CUDA in 100 Seconds

Nvidia CUDA in 100 Seconds

Embedded Rust setup explained

Embedded Rust setup explained

Premature Optimization

Premature Optimization

1 смена vs 2 смена ( 7 уроков ) *подписывайся на мой тг♥️ - ссылка в шапке профиля*

1 смена vs 2 смена ( 7 уроков ) *подписывайся на мой тг♥️ - ссылка в шапке профиля*

Атомная бомба добьёт Левиафана! Пойман на ошибке

Атомная бомба добьёт Левиафана! Пойман на ошибке

Как популярность УБИВАЕТ: Моргенштерн* / Егор Крид / Эминем и другие

Как популярность УБИВАЕТ: Моргенштерн* / Егор Крид / Эминем и другие

Из за чего супермаркеты в Японии УДОБНЕЕ и лучше остальных 🤯 #япония #shorts #путешествия #токио

Из за чего супермаркеты в Японии УДОБНЕЕ и лучше остальных 🤯 #япония #shorts #путешествия #токио

ALL SIZE SHIN SONIC TAPES FAMILY vs MANHOLE - GREEN HILL in Garry's Mod ! (Perfect Outlines)

ALL SIZE SHIN SONIC TAPES FAMILY vs MANHOLE - GREEN HILL in Garry's Mod ! (Perfect Outlines)

Арестович: Мир пришел в движение. Где место Украины? @A.Shelest

Арестович: Мир пришел в движение. Где место Украины? @A.Shelest

Передовое предприятие банкротят на глазах силовиков. Власть бездействует

Передовое предприятие банкротят на глазах силовиков. Власть бездействует

ПАЛАТА №6 | Дакота VS Кардашьян / СУБСТАНЦИЯ Блогера / Встреча с ТРАМПОМ / МИЛОХИН стал АСТРОЛОГОМ

ПАЛАТА №6 | Дакота VS Кардашьян / СУБСТАНЦИЯ Блогера / Встреча с ТРАМПОМ / МИЛОХИН стал АСТРОЛОГОМ