Has Generative AI Already Peaked? - Computerphile

What is Big Data? - Computerphile

The ONLY PySpark Tutorial You Will Ever Need.

YELLOWSTONE Season 5 Episode 14 Ending Explained

🔴 BLOX FRUITS DRAGON UPDATE OFFICIAL COUNTDOWN!

How To Get Dragon Race Part 1 + Full Guide In Blox Fruits Update 24

Apache Spark - Computerphile

Computerphile

Просмотров 256 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 10 янв 2025

Комментарии •

@notangryjustdismayed 6 лет назад ⁺¹¹²⁶
note to the editor: please stop cutting away from the code so quickly. we're trying to follow along in the code based on what she's saying. at that moment, we don't need to cut back to the shot of her face. we can still hear her voice in the voiceover.
@SilentScream321 6 лет назад ⁺⁵⁰
I think the time the code was displayed when she went trough each line was quite sufficient. The code is very readable (except for the typo where "words" suddenly became "splitlines") and reading the code while she explains would most likely on distract you from the explanation she is giving IMHO. If you are looking for a more practical solution i would recommend you to just pause the video and read the code before she explains it step by step.
@foorack 6 лет назад ⁺⁷⁶
Fully agree. The quick switching was very annoying when trying to read the code. Also would be helpful if the editor could highlight the active line she is talking about.
@trotterdotpoulpe 6 лет назад ⁺²
Yeah thank you.
@Hourai 6 лет назад ⁺⁹¹
The RDD API is outmoded as of Spark 2.0 and in almost every use case you should be using the Dataset API. You lose out on a lot of improvements and optimizations using RDDs instead of Datasets.
@Gooberpatrol66 6 лет назад ⁺²⁴⁰
I understood some of those words.
@0xIAMROOT 6 лет назад ⁺¹³
ahh.. so refreshing after taking a week break from dev work and staying away from non dev topics. Lol, I love our field. Like music to my ears
@Technomancr 6 лет назад ⁺¹¹²
Can you do Apache Kafka next? How do they compare?
@Bolt6265 6 лет назад ⁺¹⁰⁰
pretty sure theres a typo in that code. "splitLines" doesnt exist and is probably supposed to be words.map(...) instead
@williamwurthmann1573 6 лет назад ⁺¹⁶
Thank you for teaching an old man new things.
@tablit. 4 года назад ⁺⁵
Wow congrats on the content. You were able to explain it in a concise, yet logical and detailed way. nice
@tackline 6 лет назад ⁺¹⁰
A great example of how programming languages are a reasonably efficient mechanism to communicate sections of program and how natural language really is not.
@michaelebbs6035 2 года назад ⁺⁶
Computerphile will be excited to learn that tripods exist.
@alexkompos1735 6 лет назад ⁺¹¹
These data ones are really good! Keep them coming!
@recklessroges 6 лет назад ⁺¹⁸
Is there any meta analysis on the usefulness of bigdata analysis? How often do jobs get run that either produce no meaningful data or don't produce any statistically significant data?
@m13m 6 лет назад ⁺⁷⁰
Brady Please make a video on Kubernetes
@Xakriss 6 лет назад ⁺¹³⁴
feels like this video is four years too late ... :-/
@nO_d3N1AL 6 лет назад ⁺⁹
For anyone interested, although the documentation is awful for Apache Flink and it doesn't support Java versions beyond 8, it at least lets you do setup on each node. Spark does not have any functionality for running one-time setup on each node, which makes it infeasible for many use cases. These distributed processing frameworks are quite opinionated and if you're not doing word count or streaming data from one input stream to another with very simple stateless transformations in between you'll find little in the documentation or functionality. They're not really designed for use cases where you have a parallel program with a fixed size data source known in advance and want to scale it up as you would by adding more threads, but more for continuous data processing.
@xakkep9000 5 лет назад ⁺⁵
It's so clear and easy after the explanation! I will be waiting for more vids about clustering and distributed computing)
@MJ-em_jay 6 лет назад ⁺⁶
More of these, please. More big data.
@KurtSchwind 6 лет назад ⁺¹⁴
She refers to an early example. Did I miss that video? Otherwise, nicely done. Love learning about distributed computing.
@king4aday4aday 6 лет назад ⁺¹⁰
Search for MapReduce on Computerphile
@Kadderin 6 лет назад ⁺¹
Was so excited to see this posted :) I'm a Cassandra professional.
@Mmouse_ 6 лет назад ⁺²⁸
She's damn good at explaining and easy to listen to, any plans of having her host other episodes?
(sorry for "her" I don't know her name).
@Alex55555 6 лет назад ⁺²
I wish she also talked a little about Spark's ability to deal with data streams
@aimanal-fatih386 6 лет назад ⁺²
its bit silly but i cant understand 100% because english isnt my first language , hope someone could add english subs on every this channel videos because i found computerphile videos are easy to understanding because excellent explanation
@jimmycheong7970 6 лет назад ⁺²
Thank you so much. This was an incredible explanation
@ZachBora 6 лет назад ⁺²
woohooo rebecca is back
@DroisKargva Год назад
"RDD is basically an array distributed across the cluster" - genius
@PaulSukys 6 лет назад ⁺⁷
typo in line 32 for using `splitLines` instead of `word`?
@draakisback 6 лет назад ⁺²⁵
Good old Scala.
@michael-h95 6 лет назад ⁺²
Really interesting video! I have done some MapReduce before, but I haven’t came across Apache Spark
@hantlg 3 года назад
Great explanations. Of course there are many things going on behind the scenes, but good overview.
@knowntoache 7 месяцев назад
yeah vertical scaling and modular based data handing similar Hadoop ,Hive. framework library.
@christernilsson1 6 лет назад ⁺²
Please give time measurements comparing single node with multi node execution. What is the overhead?
@RonaldSVM 6 лет назад
Sorry for redundancy, just verifying my understanding. Do I understand it correctly that (when running this example in a cluster) collect runs the 'reduceByKey' against the results on each node, and then reduces to a final result. Say on Node 1 I have count of word 'something' = 5 , on Node 2 I have count of word 'something' = 3, then collect combines from those two nodes into a count of 'something' = 8, And so on...?
@adriansrealm 6 лет назад ⁺⁵
Where are the extra bits?
@sameerakhatoon9508 7 месяцев назад
can anyone please suggest books to learn about distributed systems?
@p.z.8355 4 года назад
What is the architectural difference between spark and map reduce ?
@WaqasAliAbbasi10 3 года назад
This was very helpful
@MisterPotatoHands 6 лет назад ⁺¹
What programming language is she using??
@billoddy5637 6 лет назад ⁺²
Do a video explaining AES!
@jameslawson1 6 лет назад ⁺²
The first time I learned about Apache Spark, I was looking up documentation for another framework named Spark.
@Jlr297 6 лет назад
Thank you for the great summary.
@fluffyfloof9267 6 лет назад ⁺¹¹
1:19 Floppy drives? xD LOL
@zugletsmith5082 6 лет назад
really good summary thankyou!
@M3t4lstorm 6 лет назад ⁺³
Would have liked it to be a bit more in-depth and technical, was too high level.
@mohamedthi0une198 6 лет назад
I really love your videos I would like to know if it is possible to watch them in French or at least with subtitles so that we can follow
@hanelyp1 6 лет назад
Looks like you could do a search engine in that.
@LucasZawacki 6 лет назад ⁺³
Good video :)
@mathematicalninja2756 6 лет назад
Great video
@Jarza 6 лет назад
Interesting video!
@BeCurieUs 6 лет назад ⁺³²
Ohhh, she is using VSCode! I love VS Code :D
@gajiodea 6 лет назад
Apache Flink next please
@DmitryShultz 6 лет назад ⁺¹
@3:16 line 12 is wrong. Great review 👍 otherwise!
@christernilsson1 6 лет назад
Please show some drawings or animations of data going back and forth between the noded.
@BigDataLogin 2 года назад
thanks
@oldbootz 6 лет назад
Thanks, nice vid.
@sillybuttons925 6 лет назад
More like this!!!!!!
@LeJalapenos 6 лет назад ⁺¹
Hi friends!
@SlackWi 6 лет назад
I study bioinformatics handling txt files many gigabytes in size and this could be so handy
@lztverygood 3 года назад ⁺²
content is nice, well explained.
BUT
the camera and editor are so bad.
We are not here for a documentary, the computer shot from her shoulder is completely useless and distracting, if you want to use your cuts, use something like the picture in picture but please let us focus on the code!!
@christianlamprecht9860 6 лет назад ⁺⁴
First? Does this matter? No. Go build a cluster and be happier!
@vijeenroshponmaniwalson490 2 года назад
What useless video : - slow down, explain slow , assume audience know not much
@veggiet2009 6 лет назад
First? sorry, I've never watched a video when it said it was posted "25 seconds" ago, and so it would be weird if I were actually first.
Good Video, I feel like I stink at data analysis, but I'm more experienced than most in my organization so...
@benjaminmellingen5340 6 лет назад
totally lost me 3 min into this video.
@gegdim9307 6 лет назад ⁺¹
00000001
@CJWest08 6 лет назад
She's mumbling in the beginning... can't really hear her (American-born English speaker)
@UKFbass 6 лет назад
21st!!!
@undifini 6 лет назад
first!
@oussemamhiri9713 6 лет назад
First 😂

Следующие

Автовоспроизведение

Has Generative AI Already Peaked? - Computerphile

Has Generative AI Already Peaked? - Computerphile

What is Big Data? - Computerphile

What is Big Data? - Computerphile

The ONLY PySpark Tutorial You Will Ever Need.

The ONLY PySpark Tutorial You Will Ever Need.

YELLOWSTONE Season 5 Episode 14 Ending Explained

YELLOWSTONE Season 5 Episode 14 Ending Explained

🔴 BLOX FRUITS DRAGON UPDATE OFFICIAL COUNTDOWN!

🔴 BLOX FRUITS DRAGON UPDATE OFFICIAL COUNTDOWN!

How To Get Dragon Race Part 1 + Full Guide In Blox Fruits Update 24

How To Get Dragon Race Part 1 + Full Guide In Blox Fruits Update 24

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

"It's time for him to leave" | Jamie Carragher says Marcus Rashford should leave Man Utd

MapReduce - Computerphile

MapReduce - Computerphile

How Ray Tracing Works - Computerphile

How Ray Tracing Works - Computerphile

What is Apache Spark? Learn Apache Spark in 15 Minutes

What is Apache Spark? Learn Apache Spark in 15 Minutes

Apache Spark / PySpark Tutorial: Basics In 15 Mins

Apache Spark / PySpark Tutorial: Basics In 15 Mins

Multithreading Code - Computerphile

Multithreading Code - Computerphile

3D Gaussian Splatting! - Computerphile

3D Gaussian Splatting! - Computerphile

When to Use Kafka or RabbitMQ | System Design

When to Use Kafka or RabbitMQ | System Design

All Rust string types explained

All Rust string types explained

Computer Timescales Mapped onto Human Timescales - Computerphile

Computer Timescales Mapped onto Human Timescales - Computerphile

كان بإمكاني الاحتفاظ بكل بشرتي😡💀

كان بإمكاني الاحتفاظ بكل بشرتي😡💀

Ahh😖Gerçekten Ayak Tırnağını mı Kırdı?!😅

Ahh😖Gerçekten Ayak Tırnağını mı Kırdı?!😅

😱Короче , 10 Минут Бесполезной инфы о GTA San Andreas

😱Короче , 10 Минут Бесполезной инфы о GTA San Andreas

Homemade PIZZA APPROVED @albert_cancook

Homemade PIZZA APPROVED @albert_cancook

ИХ ДОМ РУШИЛСЯ, А ТЕПЕРЬ… Они не узнали свой дом! ФИНАЛ

ИХ ДОМ РУШИЛСЯ, А ТЕПЕРЬ… Они не узнали свой дом! ФИНАЛ

He Knew A Shortcut 👀

He Knew A Shortcut 👀

сколько угадали?✨ | 🙆🏼жду тебя в тгк: Даня Ку

сколько угадали?✨ | 🙆🏼жду тебя в тгк: Даня Ку