Better CI for better data quality - Coalesce 2023

Using Airflow to Orchestrate dbt, SnapTravel

How to design a modern CI/CD Pipeline

Hurricane Francine getting stronger ahead of Louisiana landfall.

We SURVIVED A Hurricane!

Making Meatloaf

Efficient CI/CD for dbt, Clearcover

dbt

Просмотров 16 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 13 сен 2024

Комментарии • 8

@kshitijpathak8646 2 года назад ⁺¹
Does the cloning time also depends on the number of tables/records? If so, can you give an approx. I am interested in understanding if this approach (of cloning prod and then dropping )is valid for tables that would potentially hold 1B+ records.
@chrism3790 3 года назад
I wonder if the manifest feature was available back when they solved this?
@tomhallett7439 3 года назад ⁺¹
At 22:00, Mark mentions the "dbt clone" takes 10 minutes (out of the 20 minutes). I was under the impression that snowflake zero-copy cloning was "instant". Is this wrong? Or is the snowflake clone part instant but then you are spending 9 minutes doing other cleaning/transforms to the cloned data to get it ready for the automated tests?
@simianinc 2 года назад ⁺²
It's not instant. We've seen times of 40 minutes and more. I'm guessing it's setting up pointers to all the micro-partitions in the original DB, but that's speculation. We have logged this with Snowflake, and the response is it takes what it takes.
@franckleveneur676 3 года назад ⁺¹
800 Models !! That does not make sense. Remind me PeriscopeData reports built by each analysts. You probably need to look into building fact and dimensions tables.
@arcadia485 3 года назад
if you have hundreds of source tables and you build staging tables for these it's not implausible to have 800 models.
@chrism3790 3 года назад ⁺²
They probably already do, they just break up the processing of them into sub-models. That's what we do to keep things understandable for bigger models. Fact and dimension models commonly have tens of models supporting them.
@franckleveneur676 3 года назад
@@chrism3790 I don’t remember the speaker mentioned fact and dimensions. Maybe he’s using DBT to join raw tables and create table, maybe each analyst can create their own models. By doing so, they (analysts) won’t be able to unify the data properly and come up with standardized KPIs at the company level.

Следующие

Автовоспроизведение

Better CI for better data quality - Coalesce 2023

Better CI for better data quality - Coalesce 2023

Using Airflow to Orchestrate dbt, SnapTravel

Using Airflow to Orchestrate dbt, SnapTravel

How to design a modern CI/CD Pipeline

How to design a modern CI/CD Pipeline

Hurricane Francine getting stronger ahead of Louisiana landfall.

Hurricane Francine getting stronger ahead of Louisiana landfall.

We SURVIVED A Hurricane!

We SURVIVED A Hurricane!

Making Meatloaf

Making Meatloaf

Dexter: Original Sin Streaming December 13 | First Look Teaser | Paramount+ with SHOWTIME

Dexter: Original Sin Streaming December 13 | First Look Teaser | Paramount+ with SHOWTIME

Introduction to dbt (data build tool) from Fishtown Analytics

Introduction to dbt (data build tool) from Fishtown Analytics

DuckDB + dbt : Accelerating the developer experience with local power

DuckDB + dbt : Accelerating the developer experience with local power

Data Pipelines With DBT (Data Build Tool) in Azure

Data Pipelines With DBT (Data Build Tool) in Azure

DBT Cloud and CI/CD with GitHub

DBT Cloud and CI/CD with GitHub

DBT: Powerful, Open Source Data Transformations | Fishtown Analytics / DBT

DBT: Powerful, Open Source Data Transformations | Fishtown Analytics / DBT

Data Engineer Project: An end-to-end Airflow data pipeline with BigQuery, dbt Soda, and more!

Data Engineer Project: An end-to-end Airflow data pipeline with BigQuery, dbt Soda, and more!

One to many: Moving from a monolithic dbt project to multi-project collaboration - Coalesce 2023

One to many: Moving from a monolithic dbt project to multi-project collaboration - Coalesce 2023

dbt Environments vs Targets | What's the Difference?

dbt Environments vs Targets | What's the Difference?

Getting Started with Prefect | Task Orchestration & Data Workflows

Getting Started with Prefect | Task Orchestration & Data Workflows

Good fragrance doesn’t have to come with a big price tag🏷️ #trending #shorts #catchysmells #itmaaz

Good fragrance doesn’t have to come with a big price tag🏷️ #trending #shorts #catchysmells #itmaaz

Decompress small game, have time to play it!

Decompress small game, have time to play it!

ПОЛ ЭТО ЛАВА В РЕАЛЬНОЙ ЖИЗНИ **Масленников, Даник, Сударь, Монтажник, Яна, Супер Стас**

ПОЛ ЭТО ЛАВА В РЕАЛЬНОЙ ЖИЗНИ **Масленников, Даник, Сударь, Монтажник, Яна, Супер Стас**

Почему не Попал?!

Почему не Попал?!

iPhone 16 для НИЩЕБРОДОВ!

iPhone 16 для НИЩЕБРОДОВ!

Patimat Rasulova - Brown eyes (70 million views) Патимат Расулова - Карие глаза 70 миллион просмотр

Patimat Rasulova - Brown eyes (70 million views) Патимат Расулова - Карие глаза 70 миллион просмотр

a hornet bit me on the nose 👃😂

a hornet bit me on the nose 👃😂

На днях мы открыли наш первый проект Crocus Fitness в Кыргызстане! #emin #инвестиции #baku #fitness

На днях мы открыли наш первый проект Crocus Fitness в Кыргызстане! #emin #инвестиции #baku #fitness