What Is RDD In Spark | Session 1 | RDD Basics | Resilient Distributed Dataset
HTML-код
- Опубликовано: 17 окт 2024
- This session will make you learn basics of RDD (Resilient Distributed Dataset) in spark. This covers the following topics-
1) What is RDD?
2) How to create RDD?
3) Operations supported by RDD in spark
4) Lazy evaluation concept in RDD
Here is complete RDD tutorial playlist-
• Playlist
Honestly, you are one of the best one that knows how to teach, because most of the video clips on RUclips have no idea of fundamental principles of teaching! Please do more of this kind of work especially in the area of massively confusing materials about Spark, we need people like you on RUclips. Thanks so much, and please let me know how to find more of the clips have you down so far. Unfortunately the link that you provided does not work and it is broken, would you please leave a connecting link to get in touch with you.
Thanks
nice , i listen to many videos i did not understand but the way explain the topic very clear and simple,thank you so much make me easy RDD
Really good explanation. Thank you for simplifying this concept for me.
Could you please help me in below query.
.suppose I need to create a application where I need to load multiple files (CSV) using DF , if any of the file structure ids different from our defined structure(we defined in beginning) then we need to redirect those files into some error folder and load the file only with correct structure..how will we achieve this in spark.
Ex . file columns should be id,name,roll but in some files id,name,city,subject.. i need to load the files which are having id,name,roll columns only
I really appreciate what you explained it's very easy to understand please share data frame Vs dataset 👍
Can please post data frame Vs dataset,ur explained Cristal clear
Best lecture on spark
Thanks for the explanation! Can a RDD be stored as a file or stored on a hard drive (and not just in a memory)? If yes, then a link towards an article describing it will be very helpful.
What is RDD first ?
You have explained how the data is stored in RDD but what is the architecture of RDD and why does we need that ?
spark.apache.org/docs/latest/rdd-programming-guide.html#resilient-distributed-datasets-rdds
very good explanation with example. Thank you bro
Hi sir, please,, plz plz ... upload scala tutorials for the beginners
Brilliant!
Well done.
nice
You should first explain what exactly RDD is rather than explaining how it is made and other things.Please clear the basics of the viewer first.Thank you :)