Dagster
Dagster
  • Видео 139
  • Просмотров 271 057
Unlocking the Power of Dagster and dbt
Timeline:
0:00 Why Dagster and dbt work well together
1:16 Dagster UI benifits
1:55 How to setup your dbt project in Dagster
2:32 Learning Resources
Unlock the Full Potential of Your dbt Projects with Dagster! 🚀
Discover why Dagster is the ultimate platform for deploying and managing your dbt projects. In this video, we dive into:
How Dagster and dbt create a powerful synergy for your data platform
-The benefits of deploying dbt in Dagster, including rich metadata and minimal vendor lock-in
-Exclusive Dagster Plus features that simplify data platform management
-Step-by-step guide on integrating dbt with Dagster
-How dbt model checks, dependencies, and incremental models work seamlessly with Dagster...
Просмотров: 408

Видео

Orchestrating Flexible Compute for ML with Dagster and Modal
Просмотров 562День назад
Project Repository: github.com/dagster-io/dagster-modal-demo In this Deep Dive, we showcase how Dagster and Modal combine to orchestrate scalable workloads with an intuitive developer experience to build scalable, robust pipelines. We discuss how Dagster and Modal can be used together to accelerate machine learning workflows. Dagster, a data pipeline orchestration tool, manages pipeline state a...
Building a True Data Platform: Beyond the Modern Data Stack - a Dagster Deep Dive with Pedram Navid
Просмотров 758Месяц назад
The Modern Data Stack offers countless tools but often creates hard-to-manage pipelines. Pedram Navid, Head of Data Engineering and Developer Relations at Dagster Labs shows how to transform disjointed tools into a unified, observable data platform with default high-quality data. 00:00 Introduction 00:35 Agenda 00:52: The Unkept Promise of the Modern Data Stack 06:50: The Data Platform Engineer...
Dagster, SDF, & the Evolution of the Data Platform (A Dagster Deep Dive)
Просмотров 901Месяц назад
Explore how the combined strengths of Dagster’s orchestration and SDF’s transformation capabilities can enhance your developer experience, streamline your data pipelines, reduce costs, and enhance data quality and reliability. Key Takeaways: - Unified Workflow Management: Seamlessly integrate and manage your data workflows. - Enhanced Data Quality: Ensure consistent and reliable data through ad...
Building Reliable Data Platforms (A Dagster Deep Dive)
Просмотров 1,2 тыс.Месяц назад
Data quality issues are the silent killers of today’s data-driven initiatives. Poor data quality leads to incorrect insights, flawed decisions, operational inefficiencies, and ultimately, financial loss. In this Dagster Deep Dive, Colton Padden discusses strategies for integrating comprehensive data quality checks into data pipelines using #Dagster, providing practical insights and real-world e...
Introducing Dagster+ : Beyond traditional orchestration
Просмотров 1,7 тыс.3 месяца назад
The Dagster Labs team introduces you to Dagster , a data orchestration platform that pushes the boundaries of traditional orchestration. With Dagster you can weave data quality, data cataloging, DataOps best practices, and advanced observability into your platform. 00:00 Intro 04:42 Data Catalog 09:57 Data Reliability 16:28 Branch Deployments with Change Tracking 19:40 Dagster Insights 26:06 Pa...
Dagster+ Launch: Welcome
Просмотров 4313 месяца назад
Welcome to the launch of Dagster , a data orchestration solution that aims to redefine the category. Pete Hunt, CEO of Dagster Labs shares the new features that your team can leverage to accelerate your data engineering work: - Data Cataloging - Data Reliability Checks (Quality, Freshness, Schema) - Branch Deployments with Change Tracking - Dagster Insights
Dagster+ overview
Просмотров 4,8 тыс.5 месяцев назад
Dagster is a powerful orchestration solution for building and running modern data platforms. It goes well beyond traditional orchestration tools, building on the rich metadata collected during runs to build high-order logic and observability into your data pipelines. Unlike workflow-oriented orchestrators (Airflow, Prefect) Dagster's framework is compatible with the other asset-oriented tools i...
Dagster+ implementation partner update
Просмотров 2025 месяцев назад
Eric Chernoff, Head of Partnerships at Dagster Labs, provides an update on the growth of the professional services support network and the hyperscaler integrations for the Dagster ecosystem.
Dagster+ Insights
Просмотров 1805 месяцев назад
When data platforms are small and manageable, it's easy to track the cost and performance related to the system. As the platform scales, Dagster provides you with the operational observability to track cost and performance metrics such as the deterioration of data quality over time.
Dagster+ Branch Deployments with Change Tracking
Просмотров 3155 месяцев назад
To speed up innovation, data engineering teams need to embrace software engineering best practices. Dagster provides a unique capability for data teams to rapidly iterate, deploy, and test changes in ephemeral staging environments. Jamie DeMaria, software engineer at Dagster Labs, walks us through how Branch Deployments can benefit your team, and how the new Change Tracking feature lets you zer...
Dagster+ Data Reliability
Просмотров 2615 месяцев назад
Delivering data assets that your organization can trust is the main priority of your data platform. With the introduction of data reliability checks in Dagster, your team can weave quality, schema, and freshness checks right into the operations of your pipelines, and build in alerts and operational logic based on the outcomes of these tests.
Dagster+ Data Catalog
Просмотров 6055 месяцев назад
Dagster Data Catalog
Dagster+ launch event: wrapping up
Просмотров 1155 месяцев назад
Dagster launch event: wrapping up
Introducing Dagster+
Просмотров 5 тыс.5 месяцев назад
Introducing Dagster
Dagster+ - Join the launch event on April 17th
Просмотров 7856 месяцев назад
Dagster - Join the launch event on April 17th
Dagster and the Data Mesh (A Dagster Deep Dive)
Просмотров 1,4 тыс.6 месяцев назад
Dagster and the Data Mesh (A Dagster Deep Dive)
Pipeline Sandboxing: Boosting developer productivity with LakeFS & Dagster
Просмотров 6036 месяцев назад
Pipeline Sandboxing: Boosting developer productivity with LakeFS & Dagster
Thinking in Partitions (A Dagster Deep Dive)
Просмотров 1,9 тыс.7 месяцев назад
Thinking in Partitions (A Dagster Deep Dive)
Dagster Demo Recording
Просмотров 1,2 тыс.7 месяцев назад
Dagster Demo Recording
Configuration & Resources (A Dagster Deep Dive)
Просмотров 2,4 тыс.7 месяцев назад
Configuration & Resources (A Dagster Deep Dive)
Flexible Scheduling with Automation in Data Engineering (A Dagster Deep Dive)
Просмотров 2,2 тыс.7 месяцев назад
Flexible Scheduling with Automation in Data Engineering (A Dagster Deep Dive)
Exploring the new Dagster UI
Просмотров 2,3 тыс.8 месяцев назад
Exploring the new Dagster UI
Dagster Shorts: Thinking in Partitions
Просмотров 1,5 тыс.9 месяцев назад
Dagster Shorts: Thinking in Partitions
Data Quality as part of the Data Pipeline
Просмотров 2,4 тыс.9 месяцев назад
Data Quality as part of the Data Pipeline
Dagster's run UI & debugging features
Просмотров 6059 месяцев назад
Dagster's run UI & debugging features
Why data engineers are moving their data pipelines off Airflow and onto Dagster
Просмотров 7639 месяцев назад
Why data engineers are moving their data pipelines off Airflow and onto Dagster
Building a trusted and productive data platform with Software-defined Assets - a fireside chat.
Просмотров 7589 месяцев назад
Building a trusted and productive data platform with Software-defined Assets - a fireside chat.
Embedded ELT: Save your budget and simplify your data platform with Dagster Embedded ELT.
Просмотров 2,7 тыс.10 месяцев назад
Embedded ELT: Save your budget and simplify your data platform with Dagster Embedded ELT.
How sanas.ai runs neural network inference on millions of audio files.
Просмотров 87310 месяцев назад
How sanas.ai runs neural network inference on millions of audio files.

Комментарии

  • @adityanjsg99
    @adityanjsg99 6 дней назад

    please tell us how to add dbt to existing pipeline.

    • @dagsterio
      @dagsterio 6 дней назад

      Thanks for asking. We have an entire (free) online course dedicated to this topic: dagster.io/blog/dagster-university-presents-dagster-and-dbt

  • @adityanjsg99
    @adityanjsg99 7 дней назад

    Please tell us how to insert with an existing pipeline.. DBT for me is a second step , first step being ingestion.

    • @dagsterio
      @dagsterio 6 дней назад

      Hi, we suggest running through the Dagster University courses. These will provide you with a complete example of a pipeline, incorporating dbt: courses.dagster.io/

  • @seanpool9125
    @seanpool9125 10 дней назад

    This is so cool, excited to tinker with this design pattern!

  • @Mandalorx10
    @Mandalorx10 10 дней назад

    This was a real treat to watch gang, you wouldn't happen to be able to post the repo in the description... and maybe next time have a higher contrast theme on the editor for us with poor eyesight watching on our phones :D

    • @dagsterio
      @dagsterio 7 дней назад

      Thanks, I'm glad you enjoyed this deep dive. Good call on the contrast of the editor's theme, this is something we'll improve for next time. If you are still looking for the link to the code repository, you can find it here: github.com/dagster-io/dagster-modal-demo

  • @AbdiqaniM015
    @AbdiqaniM015 19 дней назад

    but sir i have a dagster-dbt project and when i test my dbt runs it's working perfectly, but the integration is where i have a problem, could you help me?

  • @rtuin123
    @rtuin123 26 дней назад

    "Obviously we have to have code. It's 2024, we're doing code. It's over, no more no code, no low code. It's just not going to work." 15:55

  • @jpanduezadlf
    @jpanduezadlf Месяц назад

    When you write ACCESS_TOKEN, what does that stand for? Is it your password? Is it the end of a link, why is it so bad to put it in your source code?-

  • @JohannesMüller-d3k
    @JohannesMüller-d3k Месяц назад

    If I understand this correctly this is a very impressive framework. I'll try it out with our dbt project since it seems to integrate with dbt as well and consider it for dagster at a later point. Is there a Slack/discord community as well? If you include a link to your website in the description it saves us the detour via google :D

  • @MarvinAsamoah-z1i
    @MarvinAsamoah-z1i Месяц назад

    Hello Ampcus Inc. This is IBU agency a managing and consulting agency that help businesses manage all their social media handles. So we simply post content, handle clients interactions across all platforms and a bonus package that we will give more details about when we schedule a meeting. Please let us know if you have any questions or concerns.

  • @Nobrezando
    @Nobrezando Месяц назад

    I'm always looking forward to these Dagster videos. The fastest way to get the new things that they did. I was also a lot curious about this SDF

  • @adrianf.9491
    @adrianf.9491 Месяц назад

    I have to ask, how do you get in the Terminal the full path by just typing the beginning of each folder name?

  • @YEM_
    @YEM_ 2 месяца назад

    Is dlt embedded in Dagster now?

  • @kludgedude
    @kludgedude 2 месяца назад

    Integrate or replace slurm?

  • @alirezaghaffari16
    @alirezaghaffari16 2 месяца назад

    badly explained

  • @alirezaghaffari16
    @alirezaghaffari16 2 месяца назад

    instead of dagit install dagster-webserver from now on

  • @ledinhanhtan
    @ledinhanhtan 2 месяца назад

    I love using Dagster, especially when I can juggle the assets in my mind before writing any code. With Airflow, I have no mental picture of that.

  • @kosmylo
    @kosmylo 3 месяца назад

    What is the difference between dagit and dagster-webserver?

  • @ShaneZarechian
    @ShaneZarechian 3 месяца назад

    Is FreshnessPolicy being deprecated?

  • @JohnCF
    @JohnCF 3 месяца назад

    Came here excited to learn about new features in the latest Dagster version. But it looks like you've decided to widen the feature-gap between the open source offering and the enterprise offering... even though this will be a maintenance burden on your team... causing delays in "backporting" features and bugfixes to the open-source version going forward. Kinda disappointed...

    • @dagsterio
      @dagsterio 3 месяца назад

      Thanks for the comment @JohnCF. If you go through the enhancements introduced with this Dagster+ launch, you will see that many of them (in fact, all of them except for Dagster Insights) benefit both the open-source and the commercial offerings. The data cataloging capability is a good example of that. From our perspective, these new additions are moving us forward on both the OSS and the Dagster+ roadmaps. In addition, by providing more value to those organizations that adopt Dagster+ we are able to guarantee the longevity and accelerated development of Dagster Open-Source.

    • @JohnCF
      @JohnCF 3 месяца назад

      @@dagsterio Does that mean what's mentioned at 7:15 about column lineage is available in open-source too? The phrasing definitely sounded like it's only available for Enterprise users...

    • @dagsterio
      @dagsterio 3 месяца назад

      @@JohnCF Correct. Column level lineage is a Dagster+ feature and is not available in Dagster Open-Source.

  • @jordanfox470
    @jordanfox470 3 месяца назад

    Is there native support for mapping time based partitions to static partitions defined like "today", "rest of month", "rest of year", "rest of history"? This is a common setup for power bi datasets, which can be represented as assets in dagster. Would be nice to take advantage of auto materialize policies.

    • @dagsterio
      @dagsterio 3 месяца назад

      Dagster does not natively support mapping time-based partitions to static partitions like "today," "rest of month," "rest of year," and "rest of history" directly out of the box. However, you can achieve similar functionality by defining custom partitioning schemes and using the appropriate partition mappings. You can define custom partitions using StaticPartitionsDefinition for static and TimeWindowPartitionsDefinition for time-based partitions.

  • @tobiaspucher9597
    @tobiaspucher9597 4 месяца назад

    Awesome!!! Please more!

  • @flogzer0
    @flogzer0 4 месяца назад

    I'm fairly sure this sales guy never used Airflow

  • @AbhishekAgrawal-dv1id
    @AbhishekAgrawal-dv1id 4 месяца назад

    If the requirement is to get the data from S3 files into a BQ table but perform some validations on those files before inserting into the table, how would we do it with Embedded ELT? We are using Dagster OSS heavily and looking to use embedded-elt for getting data from files, tables and APIs..

    • @AbhishekAgrawal-dv1id
      @AbhishekAgrawal-dv1id 4 месяца назад

      @tim-at-elementl Thanks, Tim. How would you rate dlt for my use-case? I see dlt is far more mature..

    • @AbhishekAgrawal-dv1id
      @AbhishekAgrawal-dv1id 4 месяца назад

      Yeah, I am also leaning towards doing something like this. Thanks for this, Tim. Would you suggest using a similar approach to pull data from a different database? We'd still need to run minor validations on the incoming data, though. Would dlt help here at all?

  • @vikramtatke5930
    @vikramtatke5930 5 месяцев назад

    As a person with just 2 years of experience my mind was blown watching this. I am a single person writing code in my department so I don't have any seniors to learn from but I'm leading a data engineering project that deals with terabytes of data and each request is multiple times larger than the server's RAM and multiple such requests need to be processed in parallel to complete stuff in time. Also, we have the tiniest possible budget to aggregate 25 to 30 columns and billions of rows every day. Also, we need to cut down on costs. This was super helpful.

  • @atulverma7783
    @atulverma7783 5 месяцев назад

    anyone notice silicon valley reference in screenshots

    • @dagsterio
      @dagsterio 5 месяцев назад

      Yep. We are big fans. Enjoy the Easter eggs! ;-)

  • @JohnoScott
    @JohnoScott 5 месяцев назад

    Seems to be an alternative to dbt docs and dbt Cloud Explorer?

    • @jarredcolli-k5d
      @jarredcolli-k5d 5 месяцев назад

      For some teams, definitely, although it can be complementary to dbt docs, because it sucks in some of the data via the dbt integration. Essentially becomes a super set of documentation

  • @armanuki38911
    @armanuki38911 5 месяцев назад

    You lost me at "cloud".

  • @JohnoScott
    @JohnoScott 5 месяцев назад

    Where is Nick Shrock ?

    • @dagsterio
      @dagsterio 5 месяцев назад

      Behind the camera, helping out with the teleprompter while recovering from an injury.

    • @JohnoScott
      @JohnoScott 5 месяцев назад

      Wishing him a speedy recovery then. We miss him on RUclips !

    • @schrockn
      @schrockn 5 месяцев назад

      👋 Right here! I just happened to be unable to participate in the recording session for this. Team killed it!

    • @JohnoScott
      @JohnoScott 5 месяцев назад

      @@schrockn yes they did. Keen to hear your take on all this Nick ; video from you soon ?

  • @fredguth1315
    @fredguth1315 5 месяцев назад

    What does Dagster+ mean for the open source version?

    • @dagsterio
      @dagsterio 5 месяцев назад

      Many of the enhancements in the 1.7 release benefit all users (Open-source and paid Dagster+ users). In general, the open-source solution gains more capabilities with each release both to support open-source users and to unlock more capabilities in Dagster+ which are built on top of core.

  • @minimapai
    @minimapai 5 месяцев назад

    Exciting

  • @Robay146
    @Robay146 5 месяцев назад

    Great presentation.

  • @Amapramaadhy
    @Amapramaadhy 6 месяцев назад

    Please update the gh repo when possible with the data mesh example. Multiple code locations seem super useful. Thanks

    • @dagsterio
      @dagsterio 6 месяцев назад

      Hi! I had to put it in a different repo to accommodate for running multiple code locations and not breaking our existing setup for the deep dive projects. The dedicated repo for the data mesh example can be found here! github.com/dagster-io/data-mesh-demo

    • @Amapramaadhy
      @Amapramaadhy 6 месяцев назад

      @@dagsterio much appreciated

  • @Jesufemi_O
    @Jesufemi_O 6 месяцев назад

    Hi Dagster team, great stuff here! I really enjoyed watching this!! is the demo code available in github?

  • @atkinsonr
    @atkinsonr 6 месяцев назад

    nice talk but the slides are hard to follow on here. Would be better if recorded with autofocus off, and white balanced to the projector screen.

  • @danielbartley516
    @danielbartley516 6 месяцев назад

    100%

  • @maxisqt
    @maxisqt 6 месяцев назад

    This is the coolest tech demo I've ever seen. I have wanted for so long to see an end-to-end analytics stack demo, or tutorial, and never found it. You just did it in 15 minutes, using free, open source tools I can run locally on my laptop. Absolutely incredible!

    • @dagsterio
      @dagsterio 6 месяцев назад

      Thanks. The Dagster capabilities are expanding with each new release.

  • @cornstarch4575
    @cornstarch4575 6 месяцев назад

    At around 8:20 you mention it's vulnerable to SQL injection - could I get more detail on that?

  • @zuesbenz
    @zuesbenz 6 месяцев назад

    fk the learning curve on this shit.

    • @krtmlry2719
      @krtmlry2719 5 месяцев назад

      is it really hard? Im planning to learn this too. lmao

  • @quinnherden
    @quinnherden 6 месяцев назад

    That's interesting. Do you expand on this somewhere?

    • @dagsterio
      @dagsterio 6 месяцев назад

      You might find this blog by Sandy interesting: dagster.io/blog/dagster-ml-pipelines. - Otherwise you can listen to the entire Podcast featuring Sandy here: datastackshow.com/podcast/machine-learning-pipelines-are-still-data-pipelines-with-sandy-ryza-of-dagster/

  • @rembautimes8808
    @rembautimes8808 6 месяцев назад

    I work in a financial institution and there is definitely a need for a reliable and resilient data process. Look forward to finding out more about Dagster. I also agree, no point building something flaky and have it barf 🤢

  • @rembautimes8808
    @rembautimes8808 6 месяцев назад

    Yes I’m excited. Thanks

  • @hungnguyenthanh4101
    @hungnguyenthanh4101 6 месяцев назад

    link repo please.

    • @dagsterio
      @dagsterio 6 месяцев назад

      Sorry, one of our redirects got broken - here is the link: github.com/dagster-io/devrel-project-demos/tree/main

    • @dagsterio
      @dagsterio 6 месяцев назад

      More specifically for this session: github.com/dagster-io/devrel-project-demos/tree/main/dagster-deep-dives/dagster_deep_dives/resources_and_configurations

  • @jakobullmann7586
    @jakobullmann7586 6 месяцев назад

    I don’t know… this video is one year old, but still uses the legacy DAG syntax from Airflow 1, rather than the TaskFlow API from Airflow 2. So the syntax doesn’t make a difference anymore. Regarding the coupling to environment: Airflow has different executors. The KubernetesPodOperator is not the only way to run on a Kubernetes environment. The rest may or may not be true. Probably there are many things that Dagster does better than Airflow. But I’m disappointed that you would publish such a biased comparison.

  • @SinaJahan-e1w
    @SinaJahan-e1w 7 месяцев назад

    @dagsterio Do you have the source of the demo avaialble somewhere?

    • @dagsterio
      @dagsterio 7 месяцев назад

      All the code for the demos from the deep dives are in this repository ( github.com/dagster-io/devrel-project-demos )! This one in particular is in the partitions directory.

    • @Jahaniam
      @Jahaniam 6 месяцев назад

      @@dagsterio unfortunately it is private/ link is broken.

    • @dagsterio
      @dagsterio 3 месяца назад

      @@Jahaniam Sorry, the final parenthesis got included by RUclips in the URL - try this: github.com/dagster-io/devrel-project-demos

  • @shaounakn
    @shaounakn 7 месяцев назад

    I like what you folks have done with this product.

    • @dagsterio
      @dagsterio 7 месяцев назад

      Thanks - there is a lot more in store coming next month!

    • @quinnherden
      @quinnherden 7 месяцев назад

      +1 I am rooting for you guys. Thank you for all of your hard work

    • @dagsterio
      @dagsterio 7 месяцев назад

      We appreciate it - thanks @@quinnherden !

    • @shaounakn
      @shaounakn 7 месяцев назад

      @@dagsterio Sure, thanks for making these sessions, these are really helpful.

  • @hungnguyenthanh4101
    @hungnguyenthanh4101 7 месяцев назад

    Please send me link Git repo on video

    • @dagsterio
      @dagsterio 7 месяцев назад

      Try: github.com/dagster-io/devrel-project-demos

  •  7 месяцев назад

    Joining other comments, I'd love to see more step-by-step tutorials and use cases. It took a few videos to grasp the concepts, and this one is a good one to start with. Docs are good, but videos are even better. I would love to see more of duckdb / dagster and ingestion cases.

  • @ChristopherFlesher
    @ChristopherFlesher 7 месяцев назад

    In 7:47 of the video you show using the Launchpad to configure assets... I can't figure out how to access this page?

    • @colton-dagster
      @colton-dagster 7 месяцев назад

      Hi @user-hs9lo5gh3r, the most common way to bring up this menu is to select an asset from the global asset lineage, and then in the top right where it says "Materialize selected...", open the dropdown menu and select "Open launchpad". Hope this helps!

  • @Amapramaadhy
    @Amapramaadhy 7 месяцев назад

    What’s with these shorts? Feels like a kid got hold of your social account! Stick to real content