Видео 97
Просмотров 442 264

22:33

All you need to know about Spark Monitoring

14:11

Google Gemini vs ChatGPT

11:55

What is generative AI

19:36

Stream Processing Fundamentals

19:05

Evolution of Data Architectures in last 40 years

21:26

What is AI and data science

What is AI
Evolution of AI
What is ML
What is Data Science
Real world application

Видео

22:33

All about spark tuning

Просмотров 270Месяц назад

All about spark tuning

All you need to know about Spark Monitoring

14:11

All you need to know about Spark Monitoring

Просмотров 4672 месяца назад

All you need to know about Spark Monitoring - Ways to Monitor - WebUI - History Server - REST API - External Instrumentation

11:55

Google Gemini vs ChatGPT

Просмотров 954 месяца назад

Google Gemini vs ChatGPT

19:36

What is generative AI

Просмотров 2295 месяцев назад

What is AI What is generative AI Large language model (LLM) use cases challenges

19:05

Stream Processing Fundamentals

Просмотров 2485 месяцев назад

Stream Processing Fundamentals What is stream processing Stream and batch combination Benefits Challenges Design considerations

Evolution of Data Architectures in last 40 years

21:26

Evolution of Data Architectures in last 40 years

Просмотров 4336 месяцев назад

Evolution of Data Architectures -The Landscape -RDBMS -Datawarehouse -Data lake -Why data lakes? -Data lakehouse

Spark low level API Distributed variables

10:24

Spark low level API Distributed variables

Просмотров 3729 месяцев назад

Different APIs offered by Spark What are low level APIs ? Why are they needed? Types of low level API What are distributed variables ? Distributed variable types Broadcast variables Why are Broadcast Variables better ? Accumulators

12:11

Spark low level API - RDDs

Просмотров 4479 месяцев назад

Different APIs offered by Spark What are low level APIs ? Why are they needed? Types of low level API What is RDD? Internals of RDD RDD API Types of RDD Creating RDDs Transformations on RDD Actions of RDD

Spark structured API - Dataframe and Datasets

17:05

Spark structured API - Dataframe and Datasets

Просмотров 90810 месяцев назад

Spark structured API - Dataframe and Datasets - Structured and unstructured APIs - Dataframe and Datasets - Row Object - Schema - Column - Column as logical tree - Dataset - when to use Dataset

14:37

Spark structured API - Dataframe

Просмотров 85711 месяцев назад

This video explains about - High level structured API Dataframe - How spark executes user code - All the steps that are needed to create a DAG

15:15

Spark Architecture in Depth Part2

Просмотров 2,2 тыс.11 месяцев назад

Spark Architecture in Depth Part 2 - Spark Architecture - Spark APIs - transformation vs actions with examples - End to end example to explain spark execution -

13:02

Spark Architecture in Depth Part1

Просмотров 3,8 тыс.Год назад

Spark Architecture in Depth - Driver - Executor - Cluster Manager - Data frame - Partition - Transformations - Narrow - Wide

12:05

All About Continuous Integration

Просмотров 426Год назад

All About Continuous Integration

Top 3 file formats frequently used in bigdata world

13:51

Top 3 file formats frequently used in bigdata world

Просмотров 672Год назад

Top 3 file formats frequently used in bigdata world

15:33

Understanding Spark Execution

Просмотров 2 тыс.Год назад

Understanding Spark Execution

18:56

Structured Streaming in spark

Просмотров 1,1 тыс.Год назад

Structured Streaming in spark

18:29

All about Debugging Spark

Просмотров 3,4 тыс.Год назад

All about Debugging Spark

15:00

What is Machine Learning in a nutshell

Просмотров 271Год назад

What is Machine Learning in a nutshell

What are Metadata Driven Architectures ?

13:09

What are Metadata Driven Architectures ?

Просмотров 1,8 тыс.Год назад

What are Metadata Driven Architectures ?

12:28

What is Quantum Computing?

Просмотров 162Год назад

What is Quantum Computing?

16:54

All about Blockchains

Просмотров 99Год назад

All about Blockchains

21:15

All about Data Vaults

Просмотров 455Год назад

All about Data Vaults

18:38

All you need to know about chatGPT

Просмотров 505Год назад

All you need to know about chatGPT

15:40

Top 8 Bigdata Trends

Просмотров 901Год назад

Top 8 Bigdata Trends

23:13

How to build efficient Data lakes

Просмотров 570Год назад

How to build efficient Data lakes

11:20

All about stream processing

Просмотров 1,4 тыс.Год назад

All about stream processing

12:27

All about partitions in spark

Просмотров 5 тыс.Год назад

All about partitions in spark

How to crack Bigdata Engineer Interviews

18:23

How to crack Bigdata Engineer Interviews

Просмотров 1,8 тыс.Год назад

How to crack Bigdata Engineer Interviews

14:30

What is Kubernetes

Просмотров 633Год назад

What is Kubernetes

@isaackodera9441 День назад
Wonderful explanation
@Themotivationstationpower 4 дня назад
Really appreciate your hard work. Thank you for the great explanation.
@BigDataThoughts 4 дня назад
thanks
@shubhamdaundkar8327 8 дней назад
Hello Shreya, Can you make a video of hand on Data ingestion in AWS S3?
@yashawanthraj8872 22 дня назад
Can Node/Thread have more partition than no of executors, if yes where the no of partition information will be stored.
@gvnreddy2244 27 дней назад
Very good session mam if it was a practically show means it is very useful. thank you for your efforts
@BigDataThoughts 25 дней назад
Thanks
@nishchaysharma5904 28 дней назад
Thank you for this video.
@BigDataThoughts 25 дней назад
Thanks
@vaibhavjoshi6853 Месяц назад
Getting confidence in spark because of you only. Thanks so so much!
@BigDataThoughts 25 дней назад
Thanks
@ambar752 Месяц назад
To summarize, what the Datamarts are for a DataWarehouse, same are the DataMesh for a DataLake
@rovashri566 Месяц назад
How did you make such a good visual explanation? Which tool you used to draw sketches ? Pls guide 🙏
@muralichiyan Месяц назад
Data mesh and snowflake same..? Data mesh and microsoft fabric same?
@Learn2Share786 Месяц назад
Thanks, appreciate it.. is there a plan to post practical videos around spark performance tuning?
@user-zb9hm5yh1m Месяц назад
Thank you for sharing your thoughts.
@BishalKarki-pe8hs Месяц назад
this is not excatly asnwer
@ranyasri1092 2 месяца назад
Please do videos with sample data sets so that it would help for hands on
@mindwithcuriosity5347 2 месяца назад
Seems it is PAAS as mentioned on Microsoft website
@sanketdhamane5941 2 месяца назад
Really Thanks to Good And Indepth Explantion
@BigDataThoughts 2 месяца назад
Thanks
@sindhuchowdary572 2 месяца назад
lets say there is no change in records for the next day.. then.. does the data gets overwrite again?? with same records..??
@BigDataThoughts 2 месяца назад
No we are only taking the new differential data when we do CDC
@sunnyd9878 2 месяца назад
This is excellent and valuable knowledge sharing... Easily one can make out these trainings are coming out of personal deep hands-on experience and not the mere theory ..Great work
@BigDataThoughts 2 месяца назад
thanks
@Learn2Share786 2 месяца назад
Thank you, pls also post some practical videos around the same topic
@user-zb9hm5yh1m 2 месяца назад
Thank you for sharing thoughts
@KiranKumar-cg3yg 2 месяца назад
First one to monitor the notification from you
@shreyachakravarty1347 2 месяца назад
Thanks
@ahmedaly6999 3 месяца назад
how i join small table with big table but i want to fetch all the data in small table like the small table is 100k record and large table is 1 milion record df = smalldf.join(largedf, smalldf.id==largedf.id , how = 'left_outerjoin') it makes out of memory and i cant do broadcast the small df idont know why what is best case here pls help
@harigovindk 3 месяца назад
18/april/2024
@karthikeyanr1171 3 месяца назад
your videos on spark are hidden gems
@BigDataThoughts 3 месяца назад
Thanks
@mdatasoft1525 3 месяца назад
❤
@mdatasoft1525 3 месяца назад
❤
@rupaghosh6251 3 месяца назад
Nice explanation
@BigDataThoughts 3 месяца назад
Thanks
@RameshKumar-ng3nf 3 месяца назад
At the start of the video i was so happy seing all the diagrams.. Later got fully confused & felt complicated and i didnt understand well 😢
@nahomg.4191 3 месяца назад
I wish I could give 1000 likes. You’re an excellent teacher!
@BigDataThoughts 3 месяца назад
Thanks
@user-eg9ed5nr8z 3 месяца назад
Nice explaination
@BigDataThoughts 3 месяца назад
Thanks
@amitgupta3 3 месяца назад
found it helpful. You may go slower though. I had to stop and rewind few times.
@husnabanu4370 4 месяца назад
what a wonderfull explanation to the point... thank you
@BigDataThoughts 4 месяца назад
Thanks
@sumonmal009 4 месяца назад
Good playlist for Spark ruclips.net/p/PL1RS9FR9qIPEAtSWX3rKLVcRWoaBDqVBV
@BigDataThoughts 4 месяца назад
Thanks
@mohnishverma87 4 месяца назад
Just woow, very simple explanation of a complex cluster overview.. Thanks.
@BigDataThoughts 4 месяца назад
Thanks
@masoom002 4 месяца назад
best explanation ever i came across on RUclips. watching all the parts .... Thank you for explaining it so smoothly.
@user-zb9hm5yh1m 4 месяца назад
Thank you for sharing thoughts!
@utsavchanda4190 4 месяца назад
That was very well explained. Thank you for putting this together. One question though, do you really think data modelling should be done on the Gold layer? I don't think so because Gold datasets are just busineess level aggregates suited to particular business consumption needs. Whereas Silver layer is the warehouse in Lakehouse. That is where modelling should be done, if needed.
@shrabanti84 4 месяца назад
Thank you so much.. all the vdos are very much clear and effective.
@BigDataThoughts 4 месяца назад
Thanks
@user-zb9hm5yh1m 5 месяцев назад
Thank you for sharing your thoughts.
@BigDataThoughts 4 месяца назад
Thanks
@deepalirathod4929 5 месяцев назад
Finally it got cleared to me after reading here and there . thank you .
@himanshupandey8576 5 месяцев назад
one of the helpful session !
@BigDataThoughts 4 месяца назад
Thanks
@Learn2Share786 5 месяцев назад
Nicely explained, thank you ..looking forward to learn more around this topic
@BigDataThoughts 4 месяца назад
Thanks
@srinivas123j 5 месяцев назад
well explained!!!
@srinivas123j 5 месяцев назад
Well explained!!!
@srinivas123j 5 месяцев назад
well explained!!
@BigDataThoughts 4 месяца назад
Thanks
@srinivas123j 5 месяцев назад
well explained
@BigDataThoughts 4 месяца назад
Thanks
@user-zm2me1gc5z 5 месяцев назад
Nicely explained and thanks. helping a lot
@hlearningkids 5 месяцев назад
kindly do similar simple thing for dataproc also bigquery.
@user-fz4in8bf1y 5 месяцев назад
Thank you for the detailed explanation. However the problems that I faced with reading dates prior to 1900, does not resolve even after setting all the mentioned properties. Does any one have a working example that solved the issue of reading dates prior to 1900. Below is the code that I added but did not work. conf = sparkContext.getConf() conf.set("spark.sql.legacy.parquet.datetimeRebaseModeInRead", "CORRECTED") conf.set("spark.sql.legacy.parquet.datetimeRebaseModeInWrite", "CORRECTED") conf.set("spark.sql.datetime.java8API.enabled", "true")
@hlearningkids 5 месяцев назад
Very good information 🎉
@BigDataThoughts 5 месяцев назад
Thanks

BigData Thoughts

Видео

Комментарии