Merging your data in a modern lakehouse data warehouse

Поделиться
HTML-код
  • Опубликовано: 15 окт 2024
  • Learn how you can move your data through different tiers using MERGE either with pySpark or SQL in Azure Synapse Analytics. Stijn Wynants walks you through the different steps.
    Stijn Wynants
    / sqlstijn
    pyspark.sql.SparkSession.createDataFrame
    spark.apache.o...
    Table deletes, updates, and merges
    docs.delta.io/...
    Delta Lake Documentation
    docs.delta.io/...
    📢 Become a member: guyinacu.be/me...
    *******************
    Want to take your Power BI skills to the next level? We have training courses available to help you with your journey.
    🎓 Guy in a Cube courses: guyinacu.be/co...
    *******************
    LET'S CONNECT!
    *******************
    -- / guyinacube
    -- / awsaxton
    -- / patrickdba
    -- / guyinacube
    -- / guyinacube
    -- guyinacube.com
    **Gear**
    🛠 Check out my Tools page - guyinacube.com...
    #AzureSynapse #merge #GuyInACube

Комментарии • 16

  • @craigbryden3747
    @craigbryden3747 2 года назад +9

    Great Video. I'd love to see a future one addressing how to handle type 2 SCDs.

  • @arthurcsp
    @arthurcsp 2 года назад +2

    Great video! That's exactly what I was searching for early.
    Thanks a lot!

  • @BooranJohnsonRishka
    @BooranJohnsonRishka 2 года назад +4

    Thank you for the video, this is amazing! I've been following this playlist of "Building a modern data lakehouse in Azure Synapse". Up to to this video demonstrates Inserts and Updates, but what about Deletes from the Bronze layer to Silver Layer? Can you please make a video about deletes in Synapse SQL, thanks.

  • @gabrielmorais7312
    @gabrielmorais7312 2 года назад +1

    Great vid, Thanks guys!
    Let me ask you, is there any difference in the cost of execution when you run a notebook using PySpark or SQL?

    • @stijnwynants7307
      @stijnwynants7307 2 года назад +1

      Hi Gabriel, both use the same SparkAPI in the background so they should cost/perform about the same. The cost is the amount of time your cluster runs!

    • @gabrielmorais7312
      @gabrielmorais7312 2 года назад

      @@stijnwynants7307 thank you very much Stijn!!

  • @EmmanuelAguilar
    @EmmanuelAguilar Год назад

    Question?. When my data merge with new data I need to re-build the data base in the workspace?, or the data base in the workspace read the delta file and have the changes?, please help me

  • @LandscapeInMotion
    @LandscapeInMotion Год назад

    Hi Stijn - how do you keep running the merge statement so that you get live data in your SILVER layer?

  • @pini22ki
    @pini22ki Год назад

    Why delta is not supported in azure synapse dedicated sequel pool?

  • @germanareta7267
    @germanareta7267 2 года назад

    Great video.

  • @nickleshkevich6153
    @nickleshkevich6153 7 месяцев назад

    Supercool!

  • @bunnihilator
    @bunnihilator Год назад

    It's not clear how this tie up to the first time data load to bronze.

  • @avinashkanche2993
    @avinashkanche2993 2 года назад

    Nice 👍

  • @prateekraina2781
    @prateekraina2781 2 года назад

    Cool 😎

  • @aamoody81
    @aamoody81 2 года назад

    This is great stuff