DuckDB: Hi-performance SQL queries on pandas dataframe (Python)

Поделиться
HTML-код
  • Опубликовано: 27 окт 2024

Комментарии • 26

  • @pritamdodeja
    @pritamdodeja Год назад +1

    Excellent content! Your channel is bound to grow if you keep putting out stuff like this!

    • @SamuelChan
      @SamuelChan  Год назад

      Hey Pritam, thank you! So kind of you!

  • @GabsFerreira
    @GabsFerreira Год назад

    Thank you so much for this content, Samuel! I'm getting started with DuckDB and it helped me a lot.

    • @SamuelChan
      @SamuelChan  Год назад

      Really glad to hear! Thank you!

  • @uoitauz3296
    @uoitauz3296 Год назад

    very nice video. haven't expected it from such a small channel. I really liked the summary and visualisatzion of the other recources in the beginning

    • @SamuelChan
      @SamuelChan  Год назад

      Thank you for taking the time out to leave a really encouraging comment! 🙏🏼

  • @torque6389
    @torque6389 9 месяцев назад

    Great video! Thank you for the timestamps and sql against pandas!

  • @AdamCysec
    @AdamCysec Год назад

    very insightful and informational video! I plan to use duckDB in my next project.

    • @SamuelChan
      @SamuelChan  Год назад

      Yeah go for it! Been using it for a few of my hobby projects and very pleased with the user experience so far. If you like the idea of SQLite you’ll see where DuckDB fits!

  • @Davidkiania
    @Davidkiania Год назад

    This is amazing learned so much and clarified alot of the fundermental knowledge. Much appreciated.

  • @mo-ec6wu
    @mo-ec6wu Год назад

    Excellent video Samuel - thanks

  • @buggingbee1
    @buggingbee1 2 года назад +1

    This is what I have been looking for. As a python user, I always wonder why do you need sql when pandas can do all the work with fewer lines?
    And I think duckdb is the point that both sides could use the best of both worlds.
    Been watching your video until finish, and wonder if there is a script to turn dataframe to sql table? Would love to see that work

    • @SamuelChan
      @SamuelChan  2 года назад

      Thank you Bedy, I know it was always going to be a question so I spent some time explaining about the mental models before diving into the code. Pandas can do a lot with fewer lines, until it can’t. Imagine doing a few left joins for example. DBMS will do it with an optimised query plan and even execute it on-disk for you. Python won’t.
      For your subsequent question on data frame to sql table, the pd.DataFrame.to_sql() may be what you’re looking for?
      pandas.pydata.org/docs/reference/api/pandas.DataFrame.to_sql.html

  • @noahwinter4780
    @noahwinter4780 Год назад

    Great Vid - thanks a lot!

  • @nnnnnnnnnnnoable
    @nnnnnnnnnnnoable 2 года назад

    Is there a way to connect duckdb to database so as not to use pandas functions 'pd.read_sql()' and 'pd.to_sql()' to improve execution times in jupyter notebooks?

    • @SamuelChan
      @SamuelChan  2 года назад +1

      I think you might be thinking of ODBC’s use cases and not duckdb. If you have an external db somewhere and want to execute sql queries against it while staying in SQL (no conversion to and forth between pandas df, no serialization) then you’d want something like ODBC.
      Because with duckdb the point of it is to not have this external database at all, yes?

  • @30Y
    @30Y Год назад +1

    Hella cool video

  • @AbdelhalimhafedhDAHOU
    @AbdelhalimhafedhDAHOU Год назад +1

    Thanks a lot !

  • @miltondiaz2531
    @miltondiaz2531 6 месяцев назад

    Excelente. A+1. Se puede conectar un base de datos de Foxpro + DuckDB con python

  • @ularkadutdotnet
    @ularkadutdotnet Год назад +1

    nice 👍👍👍

  • @ardgbani
    @ardgbani 3 месяца назад

    😅 the pens