Apache Iceberg Overview (Jan 2024 Edition) - Architecture, Ecosystem, and more!
HTML-код
- Опубликовано: 10 фев 2025
- The latest version of our Apache Iceberg Overviews, here are some relevant links:
Make a Data Lakehouse on Your Laptop Tutorial
bit.ly/am-drem...
Apache Iceberg 101 Article
bit.ly/am-iceb...
Getting Started with Dremio
bit.ly/am-drem...
Lovely presentation!
This is very interesting and informative
Great explanation!
Thank you
Looks like this cannot be used with e.g. image files?
what is your use case this assumes that data is of the relational kind
Why did Dremio go with Iceberg over Hudi? Hudi seems more intuitive and flexible with the timeline approach.
www.dremio.com/blog/exploring-the-architecture-of-apache-iceberg-delta-lake-and-apache-hudi/
@@Dremio It's exactly that article which made me ask the question. ^.^
Don't get me wrong I'm trying Dremio right now in local docker looks amazing. But I still thought Hudi with timeline is more suitable for BI considering dates ties well together with graphs, event streams and Data Vault methodology as well. Going to watch the Xtable presentation at subsurface, looking forward to it!
PS: Alex your customer care videos and docs are the best in the world for a software application. I like how you guys go at a moderate pace and cover terminologies in the tutorial before showing the ropes. Makes it an easy barrier of entry. Please keep that up. 10/10 waves!
@@emonymph6911 I think this may be answering the opposite question but this article may be helpful too: www.dremio.com/blog/dremios-commitment-to-being-the-ideal-platform-for-apache-iceberg-data-lakehouses/
I do think there is a tremendous benefit to the reusability of Iceberg's metadata structure along it's partitioning evolution and hidden partitioning features which are unique to the format.