Apache Iceberg Overview (Jan 2024 Edition) - Architecture, Ecosystem, and more!

Поделиться
HTML-код
  • Опубликовано: 10 фев 2025
  • The latest version of our Apache Iceberg Overviews, here are some relevant links:
    Make a Data Lakehouse on Your Laptop Tutorial
    bit.ly/am-drem...
    Apache Iceberg 101 Article
    bit.ly/am-iceb...
    Getting Started with Dremio
    bit.ly/am-drem...

Комментарии • 10

  • @anandsharma213
    @anandsharma213 10 месяцев назад +1

    Lovely presentation!

  • @abdullahomar9041
    @abdullahomar9041 Год назад +2

    This is very interesting and informative

  • @esmob4140
    @esmob4140 9 месяцев назад +1

    Great explanation!

    • @Dremio
      @Dremio  9 месяцев назад

      Thank you

  • @Gfghb-u7w
    @Gfghb-u7w 11 месяцев назад +2

    Looks like this cannot be used with e.g. image files?

    • @TusharChoudhary-mf8df
      @TusharChoudhary-mf8df 10 месяцев назад

      what is your use case this assumes that data is of the relational kind

  • @emonymph6911
    @emonymph6911 10 месяцев назад

    Why did Dremio go with Iceberg over Hudi? Hudi seems more intuitive and flexible with the timeline approach.

    • @Dremio
      @Dremio  10 месяцев назад

      www.dremio.com/blog/exploring-the-architecture-of-apache-iceberg-delta-lake-and-apache-hudi/

    • @emonymph6911
      @emonymph6911 10 месяцев назад +1

      @@Dremio It's exactly that article which made me ask the question. ^.^
      Don't get me wrong I'm trying Dremio right now in local docker looks amazing. But I still thought Hudi with timeline is more suitable for BI considering dates ties well together with graphs, event streams and Data Vault methodology as well. Going to watch the Xtable presentation at subsurface, looking forward to it!
      PS: Alex your customer care videos and docs are the best in the world for a software application. I like how you guys go at a moderate pace and cover terminologies in the tutorial before showing the ropes. Makes it an easy barrier of entry. Please keep that up. 10/10 waves!

    • @Dremio
      @Dremio  10 месяцев назад

      @@emonymph6911 I think this may be answering the opposite question but this article may be helpful too: www.dremio.com/blog/dremios-commitment-to-being-the-ideal-platform-for-apache-iceberg-data-lakehouses/
      I do think there is a tremendous benefit to the reusability of Iceberg's metadata structure along it's partitioning evolution and hidden partitioning features which are unique to the format.