How to Automate Anything with Python and LLMs | TDE Workshop

What polars does for you - Ritchie Vink

EuroSciPy 2023 - Keynote: Polars

Florida rapper Julio Foolio shot, killed after posting birthday party location on Instagram

CRAWFISH MUKBANG w/ SANTEA & JAYCO | JUICY STORY TIMES & MORE

RED ONE | Official Trailer

Why I Switched From Pandas to Polars | TDE Workshop

The Data Entrepreneurs

Просмотров 9 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 26 июн 2024
In this workshop, Ben breaks down the 3 reasons he has permanently switched from Pandas to Polars and goes through a demo with code.
Led by: Ben Feifke
More Resources
TDS Article: / the-3-reasons-why-i-ha...
Example Code: github.com/bfeif/personal-web...
Udemy Course: www.udemy.com/course/data-ana...
Podcast Episode: www.inspiringcomputing.com/21...
Connect with Ben
LinkedIn: / benjamin-feifke
Medium: / benfeifke
Twitter/X: / benfeifke
Homepage: benfeifke.com/
Keep up with The Data Entrepreneurs!
🎥 RUclips: / @thedataentrepreneurs
👉 Discord: / discord
📰 Medium: / the-data
📅 Events: lu.ma/tde
🗞️ Newsletter: the-data-entrepreneurs.ck.pag...
Overview - 0:00
About Ben - 0:29
What's Polars - 3:19
Reason 1: .list namespace - 5:30
Reason 2: .scan_parquet() and .sink_parquet() - 11:48
Reason 3: Data-oriented Programming - 16:16
Demo - 20:58
Some Questions - 34:52
What’s Next? - 36:48
Q&A - 39:00

Комментарии • 19

@user-iz5rp4fl2q 9 месяцев назад ⁺²
Great Job Ben! 👍
@feifa13 9 месяцев назад
Thanks Ilia!
@smellypunks Месяц назад ⁺¹
It is a shame that the lazy API is so entangled into the API. Might be nice to write generic code which then has the option to switch on the lazy API with one single change. I don't like the idea of having to rewrite the whole codebase to switch between lazy and eager. I question if that was a good design decision from polars. - Side note please always upload videos in 1080p
@ShawhinTalebi Месяц назад ⁺¹
Here's my solution: cmd+f "scan_" replace with "read_" 😂
P.S. I'm on Mac
@spikeydude114 8 месяцев назад ⁺²
Although I see the benefits of Polars. I haven't had enough obstacle with Pandas for my workflows. I don't deal datasets that exceed memory and I think currently I can extend my memory limit using Dask ... but looking forward to the development of Polars and will likely adopt once it has more support!
@virushk 8 месяцев назад
Same situation here. I find Pandas and Dask to be sufficient tools for my workflows
@JOHNSMITH-ve3rq 7 месяцев назад ⁺²
Chatgpt knows pandas much better. For exploratory work probably not an issue. But if shopping something to prod and want to keep it very fast and minimise system resource then polars seems a better choice.
@samuelswatson 7 месяцев назад ⁺¹
To me the appeal is the coherence of the API and the superior execution model. But the ecosystem disadvantages associated with using a much less popular library are substantial.
@signoc1964 7 месяцев назад ⁺¹
@@samuelswatson but polars has a to_pandas() method, so the disadvantages is easily overcome, so its more like if you your doing simple things, then its unneccesary to bring in polars. We replaced a lot advanced elt(not etl) with polars. 16 000 lines of sql code done with the main transfroms done in polars instead, for this task it's excellent and translated really well, and a lot of stuff is easier to to in polars than in sql for example. Doing the same in pandas is a nightmare. Translating advanced sql code to pandas is a hard job.
@samuelswatson 7 месяцев назад ⁺¹
@@signoc1964 That seems to me to be the best use case for Polars (replacing complex SQL in transformation pipelines, especially because of its composability), so it's cool to hear another testimonial for its success in that context.
@ravishmahajan9314 7 месяцев назад
Can Polars replace pyspark Or hadoop?
@TheDataEntrepreneurs 7 месяцев назад
Good question. Here’s a response from Ben.
“I’m not entirely sure tbh. i'm pretty sure pyspark is more scalable (e.g. > 1 TB data), but polars is better for data processing on your local machine (e.g. < 1 TB). i don't think Polars has so much stuff yet like pyspark does for distributed computing, whereas that is pretty much what pyspark was built for afaik.”
@user-yj3mf1dk7b 2 месяца назад
man, we can read.
why read everything on the screen?
@DarrenSaw 8 месяцев назад ⁺⁶
Pandas is a massive mess. It's very easy to write very poor code in Pandas but to write it well is not that intuitive, Matt Harrison has written some great stuff, but it's not that easy to learn.
Polars is way better and improving all the time. It's much easier to write and way quicker. The lazy API is a thing of beauty.
@TheDataEntrepreneurs 8 месяцев назад ⁺¹
I'm looking forward to using Polars more in my own workflow -Shaw
@MartyAckerman310 7 месяцев назад
I agree, Pandas' learning curve was steeper for me than R. But I've kind of settled on a consistent workflow(.loc[:,['col']] instead of ['col'], and dotchaining) that minimizes the surprises.
@signoc1964 7 месяцев назад ⁺¹
One problem with polars though is that "pandas" developers then to write "polars" code like they write pandas code, and to some extent it is possible which gives people a bad example, since a couple of those. Polars becomes like pandas then executing in serial instead of parallell.

Следующие

Автовоспроизведение

How to Automate Anything with Python and LLMs | TDE Workshop

How to Automate Anything with Python and LLMs | TDE Workshop

What polars does for you - Ritchie Vink

What polars does for you — Ritchie Vink

EuroSciPy 2023 - Keynote: Polars

EuroSciPy 2023 - Keynote: Polars

Florida rapper Julio Foolio shot, killed after posting birthday party location on Instagram

Florida rapper Julio Foolio shot, killed after posting birthday party location on Instagram

CRAWFISH MUKBANG w/ SANTEA & JAYCO | JUICY STORY TIMES & MORE

CRAWFISH MUKBANG w/ SANTEA & JAYCO | JUICY STORY TIMES & MORE

RED ONE | Official Trailer

RED ONE | Official Trailer

we tried to make a carousel cafe with the new kit in the sims 4 | cozy bistro kit

we tried to make a carousel cafe with the new kit in the sims 4 | cozy bistro kit

Giles Weaver & Ian Ozsvald - Pandas 2, Dask or Polars? Tackling larger data on a single machine

Giles Weaver & Ian Ozsvald - Pandas 2, Dask or Polars? Tackling larger data on a single machine

DuckDB vs Pandas vs Polars For Python devs

DuckDB vs Pandas vs Polars For Python devs

Why I chose Python & Polars for Data Analysis

Why I chose Python & Polars for Data Analysis

Stanford's FREE data science book and course are the best yet

Stanford's FREE data science book and course are the best yet

Thomas Bierhance: Polars - make the switch to lightning-fast dataframes

Thomas Bierhance: Polars - make the switch to lightning-fast dataframes

From Pandas to Production: Best Practices for using "Effective Pandas" with Matt Harrison

From Pandas to Production: Best Practices for using "Effective Pandas" with Matt Harrison

Do these Pandas Alternatives actually work?

Do these Pandas Alternatives actually work?

Nico Kreiling: Raised by Pandas, striving for more: An opinionated introduction to Polars

Nico Kreiling: Raised by Pandas, striving for more: An opinionated introduction to Polars

DuckDB & Python | End-To-End Data Engineering Project (1/3)

DuckDB & Python | End-To-End Data Engineering Project (1/3)

100 Дней в ОБНОВЛЕНИИ 1.21 "ХИТРЫЕ ИСПЫТАНИЯ" в Майнкрафт ХАРДКОР

100 Дней в ОБНОВЛЕНИИ 1.21 "ХИТРЫЕ ИСПЫТАНИЯ" в Майнкрафт ХАРДКОР

Суд над Бишимбаевым: прямая трансляция из зала суда. 26 июня 2024 года

Суд над Бишимбаевым: прямая трансляция из зала суда. 26 июня 2024 года

МАЛЫШКА И ГИТАРИСТ притворились НОВИЧКАМИ на ПРОСЛУШИВАНИИ в ХОР | ПРАНК #4

МАЛЫШКА И ГИТАРИСТ притворились НОВИЧКАМИ на ПРОСЛУШИВАНИИ в ХОР | ПРАНК #4

Новый выпуск «ПОЗАВЧЕРАШНИХ НОВОСТЕЙ» уже на канале

Новый выпуск «ПОЗАВЧЕРАШНИХ НОВОСТЕЙ» уже на канале

LISA - ROCKSTAR (MV Teaser)

LISA - ROCKSTAR (MV Teaser)

как видит мама vs что происходит на самом деле ( я тебя не буду ругать )

как видит мама vs что происходит на самом деле ( я тебя не буду ругать )

Вот так и снимай с ним милые видео 😂 #шортс #аняпокров #shorts #pokrov

Вот так и снимай с ним милые видео 😂 #шортс #аняпокров #shorts #pokrov