OpenMP: Functions

OpenMP: ParallelFor

Writing Code That Runs FAST on a GPU

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

The Greatest Comeback Of All Time?

"BENDY: LONE WOLF" - Official Trailer - Coming 2025

OpenMP: Reduction

HPC Education

Просмотров 15 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 7 фев 2025
Hey guys! Welcome to HPC Education and today we are looking at the Reduction Clause.
Let’s go back to our example in the parallel for video. We are trying to find the sum of first 100 natural numbers parallelly. Here’s the code. Basically, we are using the parallel for construct to add up 25 numbers in each of the four threads and then add these partial sums. Thread_sum[ID] contains each of the individual thread sums while sum contains the gathered thread sums. We know this method causes false-sharing.
To tackle this issue, OpenMP introduced a concept called reduction which solves our problem more efficiently and avoids false sharing. Now lets look at the same problem with reduction incorporated the code. Reduction is a clause that adds specific functionality to the for loop here. ‘ +’ is the type of operation we want to perform and sum is the reduction variable. Reduction basically performs all the steps of initiating and calculating individual thread sums to local variable and then combining these local variables into a single global variable in just a couple of lines of code. Internally, reduction is implemented by creating private copies of each list item for every implicit task, as if the private clause has been used.
The sum += i we saw in the code is basically a shorthand assignment operation. Reduction supports various short hand assignment operations. Operations like sum += a[i]; sum = sum + a[i]; sum = a[i] + sum; can be written as shorthand assignment operations. The same goes for subtraction, multiplication, division and other operations. When operators have the same precedence, the associativity of shorthand operators is always right to left just like assignment.
In mathematics, addition and multiplication of real numbers is associative. However in computer science, the addition and multiplication of floating point numbers is not associative due to rounding errors. Reduction in OpenMP does not have mechanisms in place to account for this and leaves it to the programmer. This why you might see a difference in results between a serial and parallel implementation of the same calculation.
Reduction is useful when we need to perform some operation on a large number of operands and saving it in a single variable. Here are some reduction use cases. We can use reduction to add, subtract or multiply a large number of operands. For multiplication, the reduction variable is automatically initialized as 1. Reduction can also be applied to Logical and bitwise operations. We can also calculate minimum and maximum where the reduction variable is initialized as the largest positive number and most negative number respectively.
That’s all for this video. See you again in the next one!

Комментарии • 3

@mohammedimran1280 3 года назад ⁺⁷
after watching reduction, my syllabus also got reduced:))
@dummyaccount8328 3 года назад
Thank you for this series, really helping alot
@anlmaral1762 2 года назад
Thank you!

Следующие

Автовоспроизведение

OpenMP: Functions

OpenMP: Functions

OpenMP: ParallelFor

OpenMP: ParallelFor

Writing Code That Runs FAST on a GPU

Writing Code That Runs FAST on a GPU

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

Demetrious Johnson Trains w/ KHABIB & ISLAM MAKHACHEV! | EXCLUSIVE FOOTAGE!

The Greatest Comeback Of All Time?

The Greatest Comeback Of All Time?

"BENDY: LONE WOLF" - Official Trailer - Coming 2025

"BENDY: LONE WOLF" - Official Trailer - Coming 2025

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

Rory McIlroy, Scottie Scheffler vs Bryson DeChambeau, Brooks Koepka | Crypto.com Showdown Highlights

Episode 4.8 - Parallel Reduction

Episode 4.8 - Parallel Reduction

Dependency Injection, The Best Pattern

Dependency Injection, The Best Pattern

OpenMP: Shared and Private Variables

OpenMP: Shared and Private Variables

Fast Inverse Square Root - A Quake III Algorithm

Fast Inverse Square Root — A Quake III Algorithm

Parallel C++: OpenMP Reduction

Parallel C++: OpenMP Reduction

How different are C and C++? Can I still say C/C++?

How different are C and C++? Can I still say C/C++?

But what is a neural network? | Deep learning chapter 1

But what is a neural network? | Deep learning chapter 1

FANG Interview Question | Process vs Thread

FANG Interview Question | Process vs Thread

Трамп разгоняет ЦРУ, некому следить за НЛО, Зеленский просит бомбу. Ликвидация Газы. Разбор новостей

Трамп разгоняет ЦРУ, некому следить за НЛО, Зеленский просит бомбу. Ликвидация Газы. Разбор новостей

притворился дедом и проверил шаурмечные на человечность ч11

притворился дедом и проверил шаурмечные на человечность ч11

Арестович: План Трампа по завершению войны в Украине. @A.Shelest

Арестович: План Трампа по завершению войны в Украине. @A.Shelest

WE TRIED TO DO IT IN DOUBLE SPEED! 🤣 #shorts

WE TRIED TO DO IT IN DOUBLE SPEED! 🤣 #shorts

“Хусури Ман 20” - качество оригинал 4К. Официально!

“Хусури Ман 20” - качество оригинал 4К. Официально!

Виселица Hangman #boardgames #настольныеигры #games #игры #настолки #настольные_игры

Виселица Hangman #boardgames #настольныеигры #games #игры #настолки #настольные_игры

Владимир Пастухов и Александр Роднянский говорят о Киеве своего детства и юности

Владимир Пастухов и Александр Роднянский говорят о Киеве своего детства и юности

DOEY Theme Song - Poppy Playtime CH4

DOEY Theme Song - Poppy Playtime CH4