Microsoft Fabric: Data Warehouse vs Lakehouse vs KQL Database

Поделиться
HTML-код
  • Опубликовано: 24 дек 2024

Комментарии • 115

  • @LearnMicrosoftFabric
    @LearnMicrosoftFabric  9 месяцев назад +17

    Hey Fabricators! If you found this video helpful, please LIKE, SUBSCRIBE and SHARE with colleagues to get the word out - THANK YOU! 🙌🙌👍👍

    • @_TheDataTrain
      @_TheDataTrain 9 месяцев назад +3

      Like the pragmatic approach. You're walking the tour in a naturally calm way enabling viewers to follow your thinking very easily.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      ah thanks for the feedback! were you talking about the first half in Fabric or on the tablet? Just curious to hear which style people prefer, or a mixture of the two? Thanks for watching!

    • @LiamMcCauley
      @LiamMcCauley 9 месяцев назад +3

      @@LearnMicrosoftFabric I liked the mix of both - the tablet was useful for methodically going through each point, but I also like being able to relate that to seeing the tool on screen.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      Good to know, thanks!!

    • @mattroberts9665
      @mattroberts9665 9 месяцев назад +1

      @@LiamMcCauley Agreed. A mix of both was great in this video. But divvying out each section into more digestible sized videos maybe the way to go. As much as i watched this video all the way through. 30 minutes may be too long for others.

  • @pimentelallanap
    @pimentelallanap 9 месяцев назад +10

    This content is pure gold. Uncredible quality!

  • @brianszeto3418
    @brianszeto3418 9 месяцев назад +7

    Great video! The additional data security options for data warehouse is another reason to implement the gold tier using data warehouse instead of data lakehouse, for example if you want to have a unified data warehouse within your organization but you want to control who can access what dimensions/facts based on criteria such as departments, job roles, etc.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +1

      Yes exactly! Thanks for watching!! Have you been building things in Fabric yet Brian?

    • @brianszeto3418
      @brianszeto3418 9 месяцев назад

      @@LearnMicrosoftFabric not yet but looking at getting the DP-600 cert in the next few months hopefully!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +5

      @@brianszeto3418 nice, good luck with that! I will probably be moving to DP-600 specific content after this series (April time) which might be useful for you 👍👍

  • @mattroberts9665
    @mattroberts9665 9 месяцев назад +2

    I've got to say your videos are great and your delivery style is excellent. Remarkably personable.
    Keep doing what your doing. Distilling down this huge transition that some of us are going through, in that moving from the traditional Azure data services (ADF, Databricks, Synapse etc) into Fabric. Other RUclipsrs show similar material, but you're diving just that little bit deeper. Nice one.
    Matt

  • @stevenross3577
    @stevenross3577 Месяц назад +1

    The whole series is great but this video, and in particular your comparison of data stores and review of architectures, is pure gold. Amazing work...thank you for sharing your knowledge.

  • @tyronefrielinghaus3467
    @tyronefrielinghaus3467 9 месяцев назад +1

    Will, that was a a really great video : nice and comprehensive , and the flow at the end was great : thanks!!!

  • @karolk880
    @karolk880 5 месяцев назад

    One of the most valuable channels on YT, specially for data nerds! Definitelly one of my favorites! Thanks your your work.

  • @deepakgupta4479
    @deepakgupta4479 9 месяцев назад +2

    Superb Explanations...! and Great Video

  • @pandit26
    @pandit26 5 месяцев назад

    Fantastic video ! specially like the handwritten explanation about the differences between Lakehouse , warehouse and KQL. Well done !

  • @hansvetters8026
    @hansvetters8026 9 месяцев назад

    Great content indeed! It helps me to clear the mist in my head, many thanks for the work you are putting in this series! I am pretty much an on prem DWH architect making my feet wet to build out an ETL architecture in fabric.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +1

      That's good to hear! Glad the videos are helping :) hopefully it's making sense for you with your background? Lots of exciting opportunities in Fabric over an on-prem DWH 😊

  • @AnkitSingh-gw6fj
    @AnkitSingh-gw6fj 5 месяцев назад

    Great video. One question: currently we have kusto clusters in which we have the raw data stored. I want to store the summarised form in either one lake house, or kql database in fabric but not sure what would be the most efficient solution. Need something which can store huge amount of data and also able to run kql on the aggregated data.

  • @pandit26
    @pandit26 5 месяцев назад

    It is very different way of explanation than other youtube learning videos. It keeps you engaged till the end that is the best quality good job Will !

  • @VigneshKiswanth
    @VigneshKiswanth 6 месяцев назад

    Thank you for the excellent video. I have a question: Which is preferable in terms of Audit Framework and Watermarking? We encountered update conflicts in the warehouse table when attempting to update audit tables while ingest multiple tables simultaneously, which updates watermark and audit values. To circumvent this issue, we've adopted an insert-only approach for both Audit and Watermark frameworks, which has proven effective. There's also discussion about using a KQL database, but we're uncertain about its complexity.

  • @Unbox747
    @Unbox747 2 месяца назад

    For table cloning, thought it'd be possible to create another mirror of a table in the lakehouse that does the same?🤔

  • @azwarmzafar
    @azwarmzafar 9 месяцев назад

    your content is really straightforward, to the point, well designed and perfectly communicated. thanks a million.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +1

      Thanks a lot for your kind words! Glad you're finding it helpful 🙌

  • @rameshpaskarathas6512
    @rameshpaskarathas6512 8 месяцев назад

    Thanks! Keep up the great work.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  8 месяцев назад

      very generous, thanks a lot Ramesh 🙌🏽 i will do, just getting started ☺️

  • @04nimmot
    @04nimmot 6 месяцев назад

    Preparing for DP-600, my first Microsoft exam, but KQL wasn't covered in the learning material, thanks!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  6 месяцев назад

      KQL is not assessed in the DP-600 exam

    • @04nimmot
      @04nimmot 6 месяцев назад

      @@LearnMicrosoftFabric strange as its in the practise exam

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  6 месяцев назад +1

      @@04nimmot oh really? that's surprising to me... I'm 98% sure it's not in the exam. I didn't get any KQL questions in my exam. And it's not mentioned in the Study Guide: learn.microsoft.com/en-us/credentials/certifications/resources/study-guides/dp-600

    • @04nimmot
      @04nimmot 6 месяцев назад

      @@LearnMicrosoftFabric you're definitely correct its not in the learning guide, had a question on ALM Toolkit too. Bit of looking up XMLA and should be okay for the exam in two weeks, I think..

  • @LavanyaVijay-k7x
    @LavanyaVijay-k7x 22 дня назад

    Amazing video. Is there any video how can we do migration from Azure sql datawarehouse to Fabric datawarehouse?

  • @fabricioferreiraz
    @fabricioferreiraz 8 месяцев назад +1

    Simple amazing! I believe this is the best video that I’ve seen to a end to end solution for fabric.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  8 месяцев назад

      Hey thanks for watching and for the comment! Glad you enjoyed 🙌

  • @AdrianMcGavock
    @AdrianMcGavock 9 месяцев назад

    this is a great series, really enjoying the content... very nicely explained without unnecessary jargon or unnecessary complexity - well done sir 🍻

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      Awesome, glad you're enjoying it!! Next video coming out on Friday 😊

  • @rameshpaskarathas6512
    @rameshpaskarathas6512 8 месяцев назад

    Great video Will. Thoughts on having a lakehouse for gold layer as well but perhaps then loading this to warehouse for SQL specific tasks like masking and object or column security. I just like the flexibility of lakehouse languages offer.

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  8 месяцев назад

      yeh I’m pretty sure you can do that stuff on the Lakehouse SQL endpoint (no need to move over to DWH). just be careful that your audience has no more than Viewer role in the workspace otherwise anything you set on the SQL endpoint can be sidestepped (using Spark). hope that makes sense?

  • @ClintElyonson
    @ClintElyonson 9 месяцев назад

    NIce one Will, good to see you dominating the Fabric space. well done on great videos. I'm learning from you :)

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +1

      Hi Clint, great to see you here :) hope you're doing well, and thanks for the support 🙌

  • @TheXSpud
    @TheXSpud 8 месяцев назад

    Very clear and informative video of the different options in Fabric. A related question, admittedly not in the scope of this video, is using semantic models vs data warehouses to share data with customers (effectively the Power BI devs). At my org, we're generally planning to create semantic models to give users access to gold data, how do you stack that up against giving access to a gold data warehouse? Are there use cases for both for this task?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  8 месяцев назад

      Yeh depends on your requirements, and how much access/flexibility you want to give them.

  • @danielkrumov5095
    @danielkrumov5095 9 месяцев назад +1

    Hey, Will, I have a question. In the previous video comparing Data pipeline vs Dataflow.., you mentioned that it is not possible to make a shortcut from a Lakehous to a Data Warehouse. However, in the end-to-end architecture example in this video, the third layer is a Data Warehouse that is getting data from the previous layer - a Lakehouse. How can we set up this workflow, then?
    Otherwise, I find this video and your content extremely helpful and insightful! Keep up the amazing work!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +1

      Hey Daniel, great question, I probably should have clarified that point in one of the videos. Although you can't create a shortcut from a lakehouse into a DWH, you can actually query a lakehouse from a DWH using something like select * from [LakehouseName].[dbo].[TableName] - this makes your 'Silver' layer accessible in your 'Gold' DWH. You can build views from the DWH (from Silver lakehouse data) or if you want to 'materialize' some gold analytical models, you could use stored procedures with an INSERT INTO you gold table. Hope that makes sense? Another option is to use a data pipeline or dataflow to write gold tables into the data warehouse (unfortunately we can write to a data warehouse using a Notebook).

    • @danielkrumov5095
      @danielkrumov5095 9 месяцев назад

      ​@@LearnMicrosoftFabric Thank you so much for your quick and extensive answer! It definitely answered my question and I have more clarity now. Keep crushing it with the amazing content!

  • @andynovak9838
    @andynovak9838 7 месяцев назад

    Greetings Will -- fantastic job! You happen to have your hand drawn diagram available in electronic form that you could share?

  • @FrankLloydTeh
    @FrankLloydTeh 5 месяцев назад

    If I have an microservice built in Java and it has to read from a database in Fabric to use some stats when a user accesses a certain page, which type of database would you recommend ?

  • @kates1736
    @kates1736 8 месяцев назад +1

    Very helpful explanation of the topic! Thank you Will!

  • @AbelGarcia-ki5nd
    @AbelGarcia-ki5nd 7 месяцев назад

    Amazing thanks for the breakdown! I'd love to throw pricing in the comparison, maybe for another video?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  7 месяцев назад

      The cost for each will be similar. In Fabric, most of ‘cost’ comes from Compute (I.e. how you operate on the data), rather than which data store you use 👍

  • @bkundrat
    @bkundrat 9 месяцев назад

    Outstanding. The layout of the content is extremely helpful.

  • @eniolaadekoya5623
    @eniolaadekoya5623 2 месяца назад

    wow ..will thank you this video just cleared a lot of questions around Microsoft fabric

  • @SureshGuddanti
    @SureshGuddanti 8 месяцев назад

    Will, love the way you explained this much discussed topic . Thank you

  • @alexshteinberg1227
    @alexshteinberg1227 9 месяцев назад

    Great Video! really appreciate the breakdown and the possible outcomes.👍

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      Thanks a lot Alex! Glad you enjoyed the breakdown! Have you been building with any of these data stores yet?

  • @FatherLizard
    @FatherLizard 9 месяцев назад

    Another amazing video - thank you. Question: how does one decide on how many lakehouses/warehouses to have? Do you divide by the many functional areas of your business or can you dump all your organization’s data into just a few lakehouses?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +1

      Thanks for watching! Really depends on your requirements and preference as a business! Normally the biggest factors are whether you want git integration and separation of data for security reasons

    • @TheRamiroB
      @TheRamiroB 9 месяцев назад +1

      Hi, my two cents. Depends of the method that you use for organization data, if you came from mesh you focus on owner data, if you came form fabric probably y use funtional aproachh.. if you start from scratch yo divede by responsability grousps and security requeriments. but remember you can change in the future.

  • @krishnakantapanda6835
    @krishnakantapanda6835 4 месяца назад

    Very well explained!

  • @chinmaykajalwa
    @chinmaykajalwa 9 месяцев назад

    Great video Will. I have few questions.
    1. "Materialized views" are supported in warehouse, but not in lakehouse. Please confirm.
    2. In my ongoing project of Power BI to Fabric migration, i am thinking of the following pattern. Bronze and Silver layer in lakehouse and Gold layer spread across LH and WH. To be precise, star schema in LH and materialized views in WH. Your opinion about this ?
    3. There is one more entity "Data Mart". It has good RLS and OLS features. Any idea when to use this and how Data Mart is different from LH and WH?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      1. No, materialised views only in KQL database, not data warehouse. In Lakehouse you can imitate the effect by writing to a different table (which is effectively what a materialised view is)
      2. It’s difficult to give opinion on architecture without context, but in general LAKE-Lake-warehouse is a common pattern.
      3. Don’t have much experience with data mart so can’t really comment. Look at Radacad’s article on the subject I believe he mentions data marts in his comparison 👍

  • @rshekarLan
    @rshekarLan 2 месяца назад

    Very good and helped me a lot. Now I can say I know Fabric. Thank you

  • @GabrieladeAzevedoSampaio
    @GabrieladeAzevedoSampaio 7 месяцев назад

    Hi, this is Gabee from Brazil. Thank you for this amazing lesson! ⌛ I have a question about RLS in the Lakehouse. Does it apply to Power BI as well?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  7 месяцев назад

      Currently, no, you will have to define in the semantic model too. In the future, yes (when they roll-out a feature called OneSecurity, possibly by the end of the year).

  • @sandrojorgeoliveira175
    @sandrojorgeoliveira175 5 месяцев назад

    Will, thank you for share your acknowledgement!

  • @shunlai4218
    @shunlai4218 9 месяцев назад

    Hi Will,
    Thanks for all the insightful video.
    got a question about Power BI DirectLake mode,
    can a DirectLake dataset be created based on Data Warehouse as well?
    Many thanks!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      Yes can be built from both Lakehouse or data warehouse 👍 next video will focus on Direct Lake ☺️

  • @michaeldelaney102
    @michaeldelaney102 9 месяцев назад

    @LearnMicrosoftFabric do you have the side-by-side comparison as a downloadable file? Thank you

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад +1

      Hi Michael, I've added it to the resources section here: www.skool.com/microsoft-fabric/classroom/c75b239c?md=b89a483c2f8146d0adfc7366f714d3d3

  • @peternguynguyen5208
    @peternguynguyen5208 9 месяцев назад

    Very detailed explanations. Thank you

  • @carloschebair3406
    @carloschebair3406 6 месяцев назад

    great info! thanks for sharing!

  • @fabianaltendorfer11
    @fabianaltendorfer11 8 месяцев назад +1

    Insane, thank you so much Will.

  • @PhilcoCup
    @PhilcoCup 8 месяцев назад

    Awesome video man, didn't know your channel, subscribed! thx a lot

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  8 месяцев назад

      Welcome to the channel, lots of video to catch up on 😉 haha thanks for watching!!

  • @relaxplease2472
    @relaxplease2472 6 месяцев назад

    I have one doubt, in this video you are telling we need warehouse for data modelling. can we use warehouse that is automatically created for each lakehouse for data modelling? Instead of creating a new one? Can anyone explain this?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  6 месяцев назад

      I'm not telling people what I think they need, just talking through the characteristics of each to allow people to make their own decisions 👍 every company is different, with different requirements.
      You can use the T-SQL endpoint of a Lakehouse to perform some data modelling, but you will be limited to read-only operations.

    • @relaxplease2472
      @relaxplease2472 6 месяцев назад

      @@LearnMicrosoftFabric thank you for your response man. I got it

  • @mashagalitskaia8642
    @mashagalitskaia8642 4 месяца назад

    This is incredibly useful!

  • @PaulBailey3
    @PaulBailey3 9 месяцев назад

    Hey Will, I was reading today that Lakehouse default function is replace and not append any longer. Will these deltas still apply to lakehouses after that announcement?

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      Hi Paul, not sure I entirely understand your question, but the underlying format for all tables in Lakehouse is delta parquet, so parquet files with a delta log that tracks changes (whether they be through a replace or an append)

  • @TheRamiroB
    @TheRamiroB 9 месяцев назад

    Good video, so really clear...

  • @FrankyLeeuwerck
    @FrankyLeeuwerck 8 месяцев назад

    Top presentation!

  • @datboi6066
    @datboi6066 8 месяцев назад +1

    Great great video, thank you

  • @rabihbadr54
    @rabihbadr54 9 месяцев назад

    Very informative, thank you!

    • @LearnMicrosoftFabric
      @LearnMicrosoftFabric  9 месяцев назад

      Thanks for watching, I'm glad you found it informative 🙌

  • @csenaraths
    @csenaraths 9 месяцев назад

    Good Explanation !

  • @samirvaghasiya9918
    @samirvaghasiya9918 8 месяцев назад

    Thank you very much Will.
    Really useful :)

  • @rodygaf
    @rodygaf Месяц назад

    Amazing video

  • @sabzchetty
    @sabzchetty 7 месяцев назад

    For me. It is always the Lakehouse because everything else is derived from there.

  • @darins
    @darins 7 месяцев назад

    MARK
    ✅ Dynamic FilePath
    ✅ Add Date from Within File (this is good)
    ✅ Flatten Header Rows with fx (ooo la la, add-in's ftw)
    ✅ Remove null Columns (Wow!!)
    ✅ Change Header of Column (I forget to use reference tricks!)
    ✅ Filter by a List with Merge (hyperspeed!)
    ✅ Clean data in 1 step (I really really love fewer steps!)
    CHANDEEP
    ✅ Eliminate Promoted Header Step (love fewer steps!)
    ✅ Create Columns from a Record (I really love fewer steps!!)
    ✅ Add Total Row to Table (wierd, but cool)
    ✅ is Number or Structure (weirder, but cool)
    ✅ Projections (weirdest, but cooler)
    ✅ Apply a function to all columns of a table
    ✅ Group Kind. Local to count stints (🤯)
    It's a tie! Which mean I win! 🙏Thanks guys!!!

  • @dlevans72
    @dlevans72 15 дней назад

    well done

  • @carlosnavia1361
    @carlosnavia1361 7 месяцев назад

  • @pulfredy1
    @pulfredy1 2 месяца назад

  • @patdia2906
    @patdia2906 9 месяцев назад

    I love u

  • @hillwilkinsontraining1057
    @hillwilkinsontraining1057 5 месяцев назад