Code along - build an ELT Pipeline in 1 Hour (dbt, Snowflake, Airflow)

Поделиться
HTML-код
  • Опубликовано: 28 дек 2024

Комментарии • 192

  • @twickAttack
    @twickAttack 10 месяцев назад +53

    Hey Jay, thank you for the video. I'd be happy to see you doing more ELT pipelines and focus on your thought's process ( I can watch longer format 1-2 hours) - why you do things in that way, why is it important and whatnot; and you can throw some explainers to anything else you do and the reason behind it. I think senior DE and others with experience do things bit automatically and it takes time for the newbies to pick up on those skills. So, your thought process for doing things instead of just doing the things is priceless for anyone watching, including me. Appreciate your video, dude :)

    • @jayzern
      @jayzern  10 месяцев назад +10

      Thank you! Will try to create more useful content

    • @miguelhermar
      @miguelhermar 9 месяцев назад

      Completely agree 😊

  • @Dev_Zyoom
    @Dev_Zyoom 10 месяцев назад +15

    honestly never knew about dbt and glad to learn it here thank you

  • @PhiNguyen-iz9go
    @PhiNguyen-iz9go 9 месяцев назад +10

    This code along session (starting from scratch with environment setup, codebase structure ...) is soooooooo helpful. Hope to see more examples like this. Keep up the work my man

    • @PhiNguyen-iz9go
      @PhiNguyen-iz9go 9 месяцев назад

      I watched the video "How you start learning Data Engineering..." and wondering that can you do a live coding that step through all those aspects (from SQL, command lines... to Kafka...) in 1 project? I think it would help a lot...

    • @jayzern
      @jayzern  9 месяцев назад +1

      Glad to hear it's helpful! 👍
      It's great to hear feedback on what type of live coding videos you find insightful. Will keep note on Kafka and Command lines

  • @vaibs2312
    @vaibs2312 13 дней назад

    I was struggling to simplify airflow and DBT integration and this tutorial really helped me get through the finish line. Thank you!

  • @diaconescutiberiu7535
    @diaconescutiberiu7535 9 месяцев назад +5

    Awesome video! I already recommended this to my entire team. Please make more like this, they are extremely helpful.
    Idea for next video: dbt for Snowflake (again) but with Data Vault 2.0 modeling. I would love to see the logic behind creating dim and fact tables, how you define the stg files for creating the hubs/satellites/links.

    • @jayzern
      @jayzern  9 месяцев назад

      Oof yea I did consider doing a Data Vault model where we showcase how hubs, satellites and links work but didn't think ppl would be interested. Thanks for raising 👍

  • @nguyenphan_0101
    @nguyenphan_0101 9 месяцев назад +4

    thank you so much for this tutorial. hope you have more videos in the future

    • @jayzern
      @jayzern  9 месяцев назад

      Thanks man!

  • @EddieVanWilder
    @EddieVanWilder Месяц назад +1

    This video has the exact answer to my questions as I'm diving into data modeling for analytics. I'm sure everyone doing this for their first time that they will find this video super helpful.
    Would be cool to see dbt with Cosmos for smoother operation 👌
    EDIT: I was literally just getting into the Deployment part of the video, and there you introduce using Cosmos for Airflow. Kudos!!

  • @peekknuf
    @peekknuf 10 месяцев назад +2

    Extremely useful content, i especially liked live googling and debugging parts

    • @jayzern
      @jayzern  10 месяцев назад +1

      Thank you for the support! Hope other people find it useful too.

  • @nothan_nah
    @nothan_nah 10 месяцев назад +4

    Thanks for sharing this dbt tutorial! It’s definitely super hot rn and useful to learn. 🎉

  • @ChrisUK70
    @ChrisUK70 Месяц назад +2

    When ETL came about the Cloud did not exist, I was writing shell scripts and SQL almost 30 years ago to do ETL. Useful video thanks!

  • @aritra1414
    @aritra1414 Месяц назад

    Concise and to the point. It was very helpful. Thanks, please show more end to end complex projects like this

  • @Neeraj-NN
    @Neeraj-NN Месяц назад

    amazing vedio very clear to explain How snowflake,dbt,airflow and cosmos are all linked together to provide data transformation and the orchestration.

  • @kmonish9119
    @kmonish9119 7 месяцев назад +7

    I have been struggling with dbt and airflow for a long time. For some reason I could not connect the dots. Having some mixture of knowledge - I landed on this tutorial and it just glued all my scattered dots well. Thanks Jayzern!!! Really appreciate the efforts :)

  • @heck0782
    @heck0782 10 месяцев назад +2

    Great video and explanation. we need more videos from you.

  • @rileylee2866
    @rileylee2866 2 месяца назад

    very good session, helped me get a much more concrete idea about how those tools look like and how they work together

  • @mohitupadhayay1439
    @mohitupadhayay1439 4 месяца назад +8

    AN ABSOLUTE GOLDMINE OF AN INFORMATION WHICH NOT AY UDEMY OR RUclips TUTOR HAS PROVIDED YET!

  • @dominicaleung7329
    @dominicaleung7329 3 месяца назад +1

    Thank you very much. This is very nice and concise tutorial, exactly what I need.

  • @HandsomeSmells
    @HandsomeSmells 5 дней назад

    brilliant tutorial, thanks for this!

  • @DivineSam-w6m
    @DivineSam-w6m 5 месяцев назад +2

    This video is like a gold mine for building a portfolio especially for someone starting out as a Data Engineer like me!... Manny Thanks and Kudos to you!.. Love from India

    • @adityakulkarni3798
      @adityakulkarni3798 3 месяца назад +1

      Hey how did you use snowflake? Did you buy it because it shows me that it is a paid software

    • @leabensoussan4178
      @leabensoussan4178 2 месяца назад

      @@adityakulkarni3798 I am wondering the same thing

  • @anikethdeshpande8336
    @anikethdeshpande8336 7 месяцев назад +5

    i'm new to snowflake, dbt and airflow,
    this is awesome tutorial, got to learn a lot
    thank you jayzern

  • @christophercampo9099
    @christophercampo9099 3 месяца назад

    Thank you, thank you THANK YOU! This was so helpful, easy to follow and made perfect sense.

  • @fun2badult
    @fun2badult 9 месяцев назад +1

    Can you please post more videos like this? Really appreciate it. Helps me understand the Dbt/Snowflake/Airflow a lot

    • @jayzern
      @jayzern  9 месяцев назад

      Yes sir am working on future videos right now!

  • @popalex
    @popalex 9 месяцев назад +1

    Great video.
    I would love to see a complex ETL pipelines.

  • @GeorgeNyamao
    @GeorgeNyamao 4 месяца назад +2

    Thanks @jayzern. This tutorial is awesome. I will be recommending it to folks who struggle with connecting dbt with any database engine.

  • @CosmicNomad
    @CosmicNomad 4 месяца назад

    This is such an amazing video @jayzern! The project taken was not overly complex but also not barebones and covered a lot of important stuff! Thanks for being thoughtful and including the code along link (else some of formatting issues would have bugged many newbies)!
    I think you should keep creating more videos as you are a good teacher. Only suggestion I have is may be include a bit more explanation, which will help beginners even more! Kudos!

  • @giovannimaia9652
    @giovannimaia9652 6 месяцев назад +1

    Please post more videos, your videos are awesome and very instructive

  • @Lhtokbgkmvfknv
    @Lhtokbgkmvfknv 2 месяца назад

    It's beautiful! Thx man!

  • @pavankumard5276
    @pavankumard5276 9 месяцев назад

    Need more content like this!!! Really amazing video. Just one suggestion I would like to make before diving into the coding part it would be better if you could provide a real world scenario and reference that while writing you code. Thanks

    • @jayzern
      @jayzern  9 месяцев назад +1

      Appreciate the feedback man 🙏 will try to incorporate more real-world context before and during the live coding part, that's a great idea

    • @pavankumard5276
      @pavankumard5276 9 месяцев назад

      @@jayzern thanks a lot, waiting for some more tutorials😃

  • @JohnS-er7jh
    @JohnS-er7jh 5 месяцев назад

    Thanks very much for posting this! Definately earned another subscriber/viewer

  • @rakshavishwakarma1811
    @rakshavishwakarma1811 8 месяцев назад +3

    thank you so much it is 100% worth and useful... expect some more videos in detail.... like prod deployment through git and git interation with airflow

  • @kenneth1691
    @kenneth1691 9 месяцев назад +1

    Thank you so much for this, I've been trying to learn how to do this and you helped me solve this
    Do you have trainings!!

    • @jayzern
      @jayzern  9 месяцев назад

      Thanks man! Yea I'm working on live trainings too so stay tuned 🙌

  • @benlahcensoufiane1589
    @benlahcensoufiane1589 21 день назад

    Thank you for this content

  • @santoshkumarchirra5895
    @santoshkumarchirra5895 26 дней назад

    Hi @jayzern, thanks for video. Is the airflow running singular tests as well? Where did we mentioned "dbt test" in the airflow ?

  • @mayconpires.oficial
    @mayconpires.oficial 7 дней назад

    Thank for rich content!

  • @melvin9993
    @melvin9993 6 месяцев назад +1

    Dude this is so good :)

  • @dogenature4801
    @dogenature4801 2 месяца назад

    Hi! really enjoy your tutorial, would like to see a tutorial how to create data CI/CD pipeline starting from pulling latest branch, running data test on staging, and deploy changes to production after test is complete since not lot of youtuber explaining this

    • @jayzern
      @jayzern  2 месяца назад

      This is actually a brilliant idea, thanks for the rec!

  • @uppinder
    @uppinder 4 месяца назад

    26:00 item_discount_amount is supposed to be negative because the macro defined it as such. I also checked the data on snowflake and they're all negative amounts. Did I miss something?

  •  2 месяца назад +2

    Thank you for the video jayzern. When I push code into Git, should I push code of dbt only, or I need to push all code of dbt-dag ?

  • @maikerodrigo4249
    @maikerodrigo4249 5 месяцев назад

    Great tutorial, i've learning a lot thanks!

  • @anduamlaktadesse9284
    @anduamlaktadesse9284 7 месяцев назад +2

    so supportive and completing the project .

  • @prajnaaddagarla9085
    @prajnaaddagarla9085 8 дней назад

    Jay good job 🎉

  • @ahmednasr3811
    @ahmednasr3811 7 месяцев назад +2

    Thanks bro for your efforts ❤

  • @oludelehalleluyah6723
    @oludelehalleluyah6723 7 месяцев назад

    I haven't a lot from this tutorial...
    Thank you

  • @梁喬萍
    @梁喬萍 6 месяцев назад

    love this! thanks for sharing this tutorial, very useful

  • @MalvinSiew
    @MalvinSiew 8 месяцев назад +2

    Hi Jay, thanks for the video. I'm having an issue connecting to Snowflake backend at the stage you first perform 'dbt run' @ 14:50 .
    This is the error I get:
    15:17:54 Encountered an error:
    Runtime Error
    Database error while listing schemas in database "dbt_db"
    Database Error
    250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting
    I've checked the profiles.yml file and all details are correct. Please help!

    • @parag2039
      @parag2039 8 месяцев назад +1

      facing the same issue!!!!!! can anyone please help I've restarted and tried everything possible to figure out but failed

    • @parag2039
      @parag2039 8 месяцев назад

      @MalvinSiew
      I solved one of the two errors I was facing. I did not have Git installed in my system. You can simply ask AI for prompts to guide you through the installation process.

    • @vitorcavalcante8495
      @vitorcavalcante8495 7 месяцев назад

      Had the same problem, when passing the account_value with 'dbt init' I wasn't able to connecto using the ccount url value, only with the second option which was the - value

    • @oreschz
      @oreschz 5 месяцев назад

      did u solve it? I have the same problem. what is the solution?

    • @AakashKumarDhal
      @AakashKumarDhal 5 месяцев назад

      @@oreschz could you solve it?

  • @DagStylez
    @DagStylez 6 месяцев назад +1

    Excellent tutorial!!!

  • @AkashKandarkar
    @AkashKandarkar 6 месяцев назад +1

    Amazingly explained 👌

  • @digitalnaturediaries
    @digitalnaturediaries 13 дней назад

    amazing tutorial

  • @thainguyenbalamquang386
    @thainguyenbalamquang386 6 месяцев назад +1

    Thank you, love your work

  • @i-see-right-through-you
    @i-see-right-through-you 5 месяцев назад

    well done! great tutorial!

  • @reneporto-ai
    @reneporto-ai 7 месяцев назад

    WOW! That's is amazing tutorial, thanks a lot.

  • @prasadatluri
    @prasadatluri 10 месяцев назад

    Great video Jay

    • @RohithPatelKanchukatla
      @RohithPatelKanchukatla 9 месяцев назад

      Hii mr.prasad garu are you data engineer too?

    • @prasadatluri
      @prasadatluri 9 месяцев назад +1

      @@RohithPatelKanchukatla Hi there. I am a Data Scientist

  • @AwakenByMe
    @AwakenByMe 5 месяцев назад +1

    WOW!! ,Thank you so much for this wonderful video, Please keep making dbt + airflow videos,
    I have one doubt, I can see that one task in airflow which is stg_tpch_orders have run + test in your dag, But it is not showing up in mine,
    Have you added any tests on stg_tpch_orders ? but maybe missed to show it into the video ?

    • @jayzern
      @jayzern  5 месяцев назад

      Hmm it's hard to tell without looking at ur code, but there is a generic test for stg_tpch_orders that looks at the relationship between fct_orders and stg_tpch_orders. Check your generic_tests.yml file to confirm
      Thanks for the support man!

  • @srikantaghosh2386
    @srikantaghosh2386 5 месяцев назад

    At 32:21, how did you copy the dbt folders to airflow project?

  • @KhanhLe-yv2gg
    @KhanhLe-yv2gg 20 дней назад

    Your guideline is a gem. But the airflow part is not very clear, i deep dive so many times to fix hahaha

  • @montoyescoful
    @montoyescoful 6 месяцев назад +1

    Hi Jay. Question: Once you have created the Fact table, how does this process work if I run it again? Is it going to append new records and update the existing ones? Or is it going to drop and create the Fact table over again?

  • @ozland7172
    @ozland7172 7 месяцев назад

    Hello.. thanks for the tutorial.
    I know airflow runs the tasks/dags however I cannot follow one thing; how do we determine the order of the action items at 35:36 within dbt (I believe it is determined on dbt side) since we have only one dag running on this example? I appreciate if anyone replies.

  • @ShreyasSureshDhamore
    @ShreyasSureshDhamore Месяц назад

    Hi I am trying your proect and got stuk here can you here
    21:32:24 Unable to do partial parsing because saved manifest not found. Starting full parse.
    21:32:25 Encountered an error:
    Compilation Error
    Model 'model.DATA_PIPELINE.stg_tpch_orders' (models/staging/stg_tpch_orders.sql) depends on a source named 'tpch.orders' which was not found

  • @albertcampillo
    @albertcampillo 4 месяца назад

    Hi @jayzern, thanks a lot for your video, really valuable content!

  • @wilcity
    @wilcity Месяц назад

    Great video! What text editor are you using?

  • @ThiagoSilva-vh9fy
    @ThiagoSilva-vh9fy 4 месяца назад

    couldn't run int_order_items.sql because it ruturns a strange error: it says: "The selection criterion 'int_order_items.sql' does not match any enabled nodes". And if aI run "dbt run" it says: " unexpected '.' in line 1" at 20:22

  • @southafricangamer7174
    @southafricangamer7174 3 месяца назад

    So to my understanding, the singular tests really mean to check if nothing is the result of the query been tested.
    If the test is true, then nothing equates to the query been tested - Great your data is fine.
    If false, you should run that query to see what exactly are those rows.
    Confusing at first but makes sense now.

  • @nadhasthirundhitan
    @nadhasthirundhitan 7 месяцев назад

    excellent video, thank you

  • @corbanb
    @corbanb 4 месяца назад

    Jay! Thanks for the video and content very cool to see. Curious why Airflow over something like FiveTran besides the ability to self host? Any gotchas?

    • @jayzern
      @jayzern  4 месяца назад

      FiveTran is not really an orchestration tool - it's really meant for the "Extract Load" part only. It's great because of Unix philosophy, i.e. "do one thing, do one thing well only", whereas Airflow is more of a generalist, task-based orchestrator. Another thing is FiveTran is super expensive, unless you're working on something enterprise-y

  • @miguelhermar
    @miguelhermar 10 месяцев назад +1

    Thanks Jay! Could you also upload into the Notion document the code for the dbt_dag.py file for the Airflow deployment? That's still missing 🙏🏻

    • @jayzern
      @jayzern  10 месяцев назад +1

      Totally forgot about that, thanks for the reminder!

    • @miguelhermar
      @miguelhermar 10 месяцев назад

      No worries, I realized you used it from the Cosmos github repo so I managed to find it there and finally was able to wire up everyhing and deploy it. 🤓 Thanks Jay. It's a super helpful tutorial. @@jayzern

  • @mohammedvahid5099
    @mohammedvahid5099 9 месяцев назад

    Pleas make complete videos on DBT WITH snowflake migration project with real time scenario videos bro thnk u❤ nice explaind

    • @jayzern
      @jayzern  9 месяцев назад +1

      Thank you man! Will take that into consideration

  • @AaronAsherRandall
    @AaronAsherRandall 5 месяцев назад

    This is great! At what point would you need to dockerize the files though? Sorry, new to data engineering. Thank you!

    • @jayzern
      @jayzern  5 месяцев назад

      You can Dockerize it at the beginning, or once you have a baseline model working. I've seen cases where Data engineers start with Docker, or Dockerize it halfway! I personally prefer the latter

  • @saurabhpandey1882
    @saurabhpandey1882 6 месяцев назад

    Thanks Jayzern,! if I can be of some help for your next video let me know!

  • @jeevankumarkondasingu34
    @jeevankumarkondasingu34 7 месяцев назад

    Nice Explaination

  • @StanleySI
    @StanleySI 7 месяцев назад +1

    Just wonder in the real world scenario, where are all raw data stored? In AWS s3?

  • @JoseR-ui9vn
    @JoseR-ui9vn 6 месяцев назад

    Thanks Jayzern

  • @CSK-Studios
    @CSK-Studios 10 месяцев назад

    Hi Jay, good one..am trying same way but getting below error " 1 of 1 ERROR creating view model dbt_schema.stg_tpch_line_items................. [ERROR in 0.04s]
    06:17:33
    06:17:33 Finished running 1 view model in 2.02s.
    06:17:33
    06:17:33 Completed with 1 error and 0 warnings:
    06:17:33
    06:17:33 Compilation Error in model stg_tpch_line_items (models\staging\stg_tpch_line_items.sql)
    06:17:33 'dict object' has no attribute 'type_string'
    06:17:33
    06:17:33 > in macro generate_surrogate_key (macros\sql\generate_surrogate_key.sql)
    06:17:33 > called by macro default__generate_surrogate_key (macros\sql\generate_surrogate_key.sql)
    06:17:33 > called by model stg_tpch_line_items (models\staging\stg_tpch_line_items.sql)"

    • @jayzern
      @jayzern  10 месяцев назад

      Try checking if your dbt_utils version is correct. There seems to be a compile time error with calling generate surrogate key. The code is available in notion page.

    • @szhao2864
      @szhao2864 9 месяцев назад

      I got the same error. How did you solve it?

  • @dataengineermatheusbudin7011
    @dataengineermatheusbudin7011 5 месяцев назад

    Hey, thanks for the project tutorial. i was wondering if there is the best way to deploy airflow on a cloud enviroment... I see a lot of Ec2 or EKS (kubernetes). But maybe i could work on ECS + Fargate? Which deploy method would you please recomend regarding a production scenario? (like beyond studies, thinking about a daily job task). Thank you mate

    • @jayzern
      @jayzern  5 месяцев назад

      Airflow + EKS is probably the most common in the industry because of cost reasons and vertical scaling. You could use ECS + Fargate too, but fargate is really expensive!
      I don't have any recs atm, but will try to create more examples on production DAGs next time. Check out ruclips.net/video/Xe8wYYC2gWQ/видео.html in the meantime!

  • @truongnguyen813
    @truongnguyen813 7 месяцев назад +1

    I'm struggling within the step to load dbt data_pipeline, it did not show in the airflow dag. How could I be wrong, can you support?

  • @BishalKarki-pe8hs
    @BishalKarki-pe8hs 7 месяцев назад

    100% worth it

  • @bazi15
    @bazi15 Месяц назад

    make video related star and dimension modeling

  • @maryam4071
    @maryam4071 8 месяцев назад

    hi, I would like to know about singular test, we want to check negative value in test, why we use the condition as positive?

  • @thetrangia1091
    @thetrangia1091 10 месяцев назад

    thank you very much

  • @saikoundinya9913
    @saikoundinya9913 6 месяцев назад

    Dude, where did you even mention about dbt_project.yml file, in part 2 of the video, you directly jump to vscode
    what are the details ??

  • @pythonmathui3057
    @pythonmathui3057 5 месяцев назад

    I'm struggling with airflow connection to snowflake, can you make another video to elaborate it more?

    • @jayzern
      @jayzern  5 месяцев назад

      For sure, I didn't explain the airflow integration with snowflake as much as I wanted to

  • @khushikunte254
    @khushikunte254 2 месяца назад

    any prerequisites for this

  • @aiviet5497
    @aiviet5497 5 месяцев назад

    I need a longer video. Please give me.

  • @OmerNadler
    @OmerNadler 9 месяцев назад

    is do i need to pay on astro ? if i want to use this for prod env

  • @hasnaouiwafae6031
    @hasnaouiwafae6031 2 месяца назад

    I cannot run my dbt project. I’m still a beginner but I do not understand why this happens, considering that my macros directory is empty except for a .gitkeep file:
    Compilation Error
    dbt found two macros named "materialization_table_default" in the project
    "dbt".
    To fix this error, rename or remove one of the following macros:
    - macros/materializations/models/table/table.sql
    - macros/materializations/models/table.sql

  • @hiteshmohite7677
    @hiteshmohite7677 9 месяцев назад

    hey, I have a small request
    can you please make a video on how to make use of pyspark efficiently in low spec system with huge amount of data

    • @jayzern
      @jayzern  9 месяцев назад

      Low compute Spark + high volumes of data is challenging but will take note. Thx for the suggestion

  • @stephenarnold2343
    @stephenarnold2343 6 месяцев назад

    I materialized marts as tables but int_order_items, int_order_items_summary and fct_orders are created as views instead of tables. How do I convert these views to tables?

  • @CybersecYT
    @CybersecYT 8 месяцев назад

    How could i get the project folder structure?

  • @prabhatgupta6415
    @prabhatgupta6415 5 месяцев назад

    can u tell why have we used airflow since dbt cloud has feature to schedule the jobs?

    • @jayzern
      @jayzern  5 месяцев назад

      If your company only uses dbt and no other tooling, dbt cloud works too
      However in the real world, it's hard to control your CRON schedule when you have many tools in your stack. Orchestrators job is to focus on scheduling. Linux philosophy of do one thing, do one thing well TLDR

  • @SteynGun-n2u
    @SteynGun-n2u 4 месяца назад

    hi guys kindly help me out, does only snowflakes and dbt is enought are i have to learn hadoop, spark etc i am working as data analyst for last 1 year and planning to switch to de

  • @AakashKumarDhal
    @AakashKumarDhal 5 месяцев назад +4

    Error solved!!!!
    for anyone facing this error:
    Runtime Error
    Database error while listing schemas in database "dbt_db"
    Database Error
    250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting
    Try the second method to update account name for your project inside profile.yml file.
    account_locator-account_name

    • @Abhishek-jp6ib
      @Abhishek-jp6ib 4 месяца назад

      Thank you !

    • @albertcampillo
      @albertcampillo 4 месяца назад

      Thank you!

    • @juventuskid2
      @juventuskid2 2 месяца назад

      Hey! How did you go about updating the account name(or resolving the error)? I can't find the profile.yml file.

  • @rvmnet2112
    @rvmnet2112 8 месяцев назад

    One question here, As we have dbt jobs feature available in dbt cloud and it is very easy to create job here then why it is need to use airflow?

    • @jayzern
      @jayzern  8 месяцев назад

      Yea that's great question! In theory dbt cloud can trigger jobs too, but in practice you'd want to decouple your orchestration tool away from your transformation tool for a myriad of reasons: ability to orchestrate other tools together with dbt, avoid vendor lock from dbt, many companies are comfortable with Airflow etc. It really depends on your tech stack

  • @sanaomar2182
    @sanaomar2182 7 месяцев назад

    How did he stat? did he create a wroksheet? I tried it but it di not work, the very first steps ?? what arethey?

    • @Rajdeep6452
      @Rajdeep6452 7 месяцев назад

      yes you need to write the queries in a worksheet

  • @shloktalhar3981
    @shloktalhar3981 Месяц назад

    Tell me one thing , is data engineering good job profile for freshers

  • @anggipermanaharianja6122
    @anggipermanaharianja6122 2 месяца назад

    nice

  • @abdullahsiddique7787
    @abdullahsiddique7787 3 месяца назад

    Is data engineering dead with advent of AI ? What is the future of data engineering careers in your opinion ?

  • @neosmith009
    @neosmith009 8 месяцев назад

    Overall great, the airflow orchestration felt a bit clunky especially given that the source code had to be kept in the same directory.

    • @jayzern
      @jayzern  8 месяцев назад

      Thx for the feedback 👍 ideally should wrap this in a container image, but for simplicity decided to keep it as code

    • @neosmith009
      @neosmith009 8 месяцев назад

      @@jayzern Makes sense, any good resources on self hosting dbt core?

  • @StephenRayner
    @StephenRayner 5 месяцев назад

    You should check out Meltano

    • @jayzern
      @jayzern  5 месяцев назад

      I've heard great things about Meltano!

  • @SakshiGowda-vl1ke
    @SakshiGowda-vl1ke 2 месяца назад

    hi ! I'm having trouble connecting to snowflake. can someone please help me resolve it . I just started learning dbt and snowflake .
    Runtime Error
    Database error while listing schemas in database "dbt_db"
    Database Error
    250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting

    • @krishuynh3337
      @krishuynh3337 11 дней назад

      worth checking your snowflake credentials again, I got the same error due to an incorrect account id

  • @okkwok1753
    @okkwok1753 9 месяцев назад

    I am not sure why I cannot open the notes, can anyone help?

    • @jayzern
      @jayzern  9 месяцев назад +2

      I double checked the link and it's working, try this
      bittersweet-mall-f00.notion.site/Code-along-build-an-ELT-Pipeline-in-1-Hour-dbt-Snowflake-Airflow-cffab118a21b40b8acd3d595a4db7c15?pvs=74
      Let me know what error you see

  • @tianhockwoo3025
    @tianhockwoo3025 4 месяца назад

    Hello did anyone else face this error at Airflow after @32:50
    Broken DAG: [/usr/local/airflow/dags/dbt-dag.py]
    Traceback (most recent call last):
    File "/usr/local/lib/python3.12/site-packages/cosmos/operators/base.py", line 361, in __init__
    self.full_refresh = full_refresh
    ^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.12/site-packages/airflow/models/baseoperator.py", line 1198, in __setattr__
    if key in self.__init_kwargs:
    ^^^^^^^^^^^^^^^^^^
    AttributeError: 'DbtRunLocalOperator' object has no attribute '_BaseOperator__init_kwargs'. Did you mean: '_BaseOperator__instantiated'?
    please send help

    • @CosmicNomad
      @CosmicNomad 4 месяца назад +1

      I am facing the exact same error. Please post a reply, if you were able to figure out the fix. I'll do the same if I find a solution.

    • @CosmicNomad
      @CosmicNomad 4 месяца назад

      Ok, so I think I was able to find the thread related to this issue.. Its still open as of 8/18/2024 11pm PT..
      github.com/astronomer/astronomer-cosmos/issues/1161