How to Migrate Your Tables to Apache Iceberg

Поделиться
HTML-код
  • Опубликовано: 14 мар 2023
  • In Software development developers use Continuous Integration and Continuous Deployment as a technique to automate the integration of new code safely and quickly. A challenge in the data world is often integrating new data ingested from batches and streams with the same safety and speed. As the data world adopts more and more of these software best practices in a great trend we like to call ""Data as Code"" how can we begin automating our data integration in ways that are also safe and fast.
    In this talk we'll discuss:
    - How to isolate your data ingestion
    - How to Audit your ingested Data
    - How to integrate your new data
    - How to automate these steps
    ""An Iceberg based data lakehouse has several benefits in not just query performance and cost, but in scalability, consistency and more. Migrating your existing data lake or data lakehouse to Iceberg doesn't have to be difficult, but there are some important decisions and considerations. This presentation hopes to give you a high-level roadmap to planning your successful Apache Iceberg migration journey:
    - Why migrate to Apache Iceberg
    - Which Catalog should I use
    - How to migrate data in-place
    - How to migrate by restating my data
    - How to plan the phases of my migration
  • НаукаНаука

Комментарии •