- Видео 40
- Просмотров 624 481
Introduction to Parallel Programming in OpenMP
Добавлен 1 июн 2017
Видео
Locks
Просмотров 5 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
The OpenMP Shared Memory Consistency Model
Просмотров 4,2 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Advanced Task handling
Просмотров 2,8 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Matrix Multiplication using tasks
Просмотров 4,7 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Parallel LU Factorization
Просмотров 8 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Understanding LU Factorization
Просмотров 4,6 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Recursive task spawning and pitfalls
Просмотров 3,8 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Completion of tasks and scoping variables in tasks
Просмотров 2,9 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Accessing variables in tasks
Просмотров 3,3 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Task queues and task execution
Просмотров 5 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Introduction to tasks
Просмотров 9 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Matrix-Matrix operations (Matrix-Matrix Multiply)
Просмотров 16 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Matrix-Vector operations (Matrix-Vector Multiply)
Просмотров 14 тыс.7 лет назад
To access the translated content: 1. The translated content of this course is available in regional languages. For details please visit nptel.ac.in/translation The video course content can be accessed in the form of regional language text transcripts, books which can be accessed under downloads of each course, subtitles in the video and Video Text Track below the video. Your feedback is highly ...
Vector-Vector operations (Dot product)
Просмотров 9 тыс.7 лет назад
Vector-Vector operations (Dot product)
Distributing for loops and reduction
Просмотров 11 тыс.7 лет назад
Distributing for loops and reduction
Manual distribution of work and critical sections
Просмотров 8 тыс.7 лет назад
Manual distribution of work and critical sections
Computing sum: first attempt at parallelization
Просмотров 12 тыс.7 лет назад
Computing sum: first attempt at parallelization
Shared Memory Consistency Models and the Sequential Consistency Model
Просмотров 22 тыс.7 лет назад
Shared Memory Consistency Models and the Sequential Consistency Model
OpenMP: thread private variables and more constructs
Просмотров 12 тыс.7 лет назад
OpenMP: thread private variables and more constructs
OpenMP: Scoping variables and some race conditions
Просмотров 10 тыс.7 лет назад
OpenMP: Scoping variables and some race conditions
Program Memory with Multiple threads and Multi-tasking
Просмотров 14 тыс.7 лет назад
Program Memory with Multiple threads and Multi-tasking
OpenMP: A parallel Hello World Program
Просмотров 45 тыс.7 лет назад
OpenMP: A parallel Hello World Program
Interconnection networks in Distributed Memory architectures
Просмотров 23 тыс.7 лет назад
Interconnection networks in Distributed Memory architectures
Are there systems in real life that present, from programming point of view, a Distributed Memory system as if it were a Shared Memory system.
Thank u sir
This is an amazing course!
Great job explaining the concepts clearly!
Smart Aliens is a project with great potential feelings excited to be part of this community #ETH, #ETHEREUM, #ARBITRUM, #ARB, and #ALTCOIN.
Turn down the decibels pls.
Amazing thanks
So what is the solution if sequential consistency model dictate so much constraints on the compiler? Will we just live with the decrease on performance?
what about the cache data that is needed in a certain tast? when tast is free to be executed at any time, the caches might not have required data.
sir my question is can we avoid a for loop and call the omp_get_num_threads() just once? , I was trying to do it, but i failed, or it is explained in upcoming videos?
Thank you sir , the video was very informative
which language is this?
Very helpful 👍
Really helpful!!! Thanks a lot.
what is the intro music its so addictive !
ruclips.net/channel/UCdBufqFbamF48hdZ-dmSgnw 🍁🚀Make education easy learn more ❤️
Impressive video!
Nice lecture!
Thank You sir
please share the source code
#include <omp.h> #define ARR_SIZE 600 #define STEP_SIZE 100 #include <stdio.h> int main() { /* Computing Array sum using tasks */ int i; int sum = 0; #pragma omp parallel { int a[ARR_SIZE]; #pragma omp for for (i = 0; i < ARR_SIZE; i += STEP_SIZE) { int j, start = i, end = i + STEP_SIZE - 1; printf("Computing Sum(%d,%d) in thread %d of %d ", start, end, omp_get_thread_num(), omp_get_num_threads()); #pragma omp task { int psum = 0; printf("Task computing Sum(%d,%d) in thread %d of %d ", start, end, omp_get_thread_num(), omp_get_num_threads()); for (j = start; j <= end; j++) { psum += a[j]; } #pragma omp critical sum += psum; } printf("Sum=%d ", sum); } } }
Does this you tube channel has all subjects of cse in gate exam???
Thanks Sir.
Superb !!
Where do we find the slides for the course?
I thought it would be difficult but it is damn easy.
I have a doubt, dot is a shared variable so I don't think the program will produce the required output as there will be race condition involved, If somebody could clear this, that would be very greatful
Yeah. reduction(+:dot) should be put up as a clause for the code to work.
Explained well
Same thumbnail colors represent same week lectures.
yes
This is how concurrenthashmap is implemented in java by having lock per segment.
Amazing Lecture series. IF you could provide more details on cache coherence. IT would be cherry on the top.
And thanks to all students for asking good questions. That answers many of my questions too.
How to have different number of threads across different parallel region. If i have 4 threads in first parallel region and 3 threads in second parallel region, which one going to be killed (first or fourth one).
Didnt get why it took 100 ns for first 4 bytes but just 5 ns for next 4 bytes. Should not it be same for all bytes ?
You are pipelining the data transfer through the bus so essentially all the memory 4 byte memory transfers took 100ns but 2nd memory transfer occurs 5ns after the 1st and so on
wow this is amazing.
again great lecture!
Excellent lecture Sreries, If you have knowledgw of OS, Computer Architecture then learning Prallel programming is very intresting
Conteudo incrivel. Nao entendo o idioma, mas o codigo e muito show.
One of the most interesting lectures I've seen in a while. I could only imagine how lucky your students are. Kudos!!
really awesome
Just a word of caution! #pragma omp master does not have an implied barrier either on entry to, or exit from, the master construct.
thank you for the subtitles!
Very good lecture. Thank you very much.
What is a branch instruction?
Branching statements are like if statement, for, while loop etc. Think of it as if(flag1) {flag2 = true}. The second statement "flag2=true" is dependent on the execution of flag1 instruction. if flag2 is in pipeline and flag1 turns out to be false, second statement will be thrown away. Which basically means wastage of cpu cycles spent on processing flag=true.
Really helpful explanation.
you are the Best teacher sir , make some more videos on computer organization and architecture please sir
Great intro, thanks!
@Micheal Kleyman You look like an ad bot.
@Aaron Ronald And you are even worse.
Great explanation sir 💯
Thank you sir, this really helps me a lot.
Awesome way of teaching , Thanks a lot !!
The problem is with recursion. How brilliant the student must be. I thought all the 3 same problems that might occur but recursion was no where in my mind.
Isn't the bisection bandwidth of a 2D Torus sqrt n ?