KNOW the difference between Data Base // Data Warehouse // Data Lake (Easy Explanation👌)

Поделиться
HTML-код
  • Опубликовано: 26 ноя 2024

Комментарии • 514

  • @MinhNguyen-ih5dt
    @MinhNguyen-ih5dt 3 года назад +345

    I have watched or read many explanation about the differences among these 3 terms, but so far this video is the simpliest yet cleariest and easiest to understand. Thanks a lot!!!

    • @chandoo_
      @chandoo_  3 года назад +8

      Wow.. thank you for that 😀

    • @udaynarri967
      @udaynarri967 3 года назад +6

      Exactly, this is how I feel. Thanks Chandoo.

    • @theh1ve
      @theh1ve 3 года назад +7

      I came to the comments to say the same thing! Thank you for this simple, illustrative explanation.

    • @hasanwasti7227
      @hasanwasti7227 2 года назад +1

      @@chandoo_ qqq

    • @ayasrhan9751
      @ayasrhan9751 2 года назад +2

      i very agree

  • @RAZREXE
    @RAZREXE 2 года назад +78

    There is no other video on youtube that explains DB/DW/DL this easy. Really appreciate the time and effort you put into making these videos.

  • @amirmalekahmadi9910
    @amirmalekahmadi9910 3 года назад +18

    Wise men can explain sophisticated things in a way that a 5-year kid can easily learn! Congrats Wise Man!

    • @chandoo_
      @chandoo_  3 года назад +2

      😍 That is a beautiful compliment. Thanks Amir.

    • @abdulrahmanbinillyas5944
      @abdulrahmanbinillyas5944 6 месяцев назад

      I have seen many videos but this explanation is very nice and clear

  • @ericvt
    @ericvt 2 года назад +2

    As a person in this industry, this is the best video ever. Exceptionally clear.

  • @paulrprichard
    @paulrprichard 3 года назад +19

    In a typical database there will be transactions taking place like insert of a table row, update of a table row, read of a table row that are in line with a set of business cases.
    In a datawarehouse there will be analysis taking place to across multiple rows from multiple tables.
    A data lake is where data goes to get drowned.

    • @chandoo_
      @chandoo_  3 года назад +9

      "A data lake is where data goes to get drowned." 😂😂😂

  • @sarago99
    @sarago99 3 года назад +32

    Simple to start with. No PPT slides, just notepad is enough to explain ❤️ Thank you bro. Keep up your good work 👍

  • @aiasaiascon3894
    @aiasaiascon3894 2 года назад +3

    One more comment for me.
    The best, most simple, laconic, yet rich, explanation about the diffs of the terms.

  • @kameshk6188
    @kameshk6188 3 года назад +11

    I dont think any other video in the internet explains this difference as clearly as this video. Thank you brother. Keep posting more videos to educate us.

  • @rajivjani8594
    @rajivjani8594 2 года назад +4

    Super! In just 8 minutes, you have put such a clear picture of data base, data warehouse and data lake, that I can never forget and in future, any time I deal with these terminology, I have crystal clear idea of what am I dealing with! You are a GREAT teacher Chandoo and I really appreciate your effort!

  • @jasonirwin5171
    @jasonirwin5171 Год назад +1

    I love this. This explains perfectly what I've been trying to explain at work. Instead of me keep arguing I am just going to show this video

  • @ravinaikwadi9899
    @ravinaikwadi9899 3 года назад +25

    Little correction - data warehouse is a system and/or db where Hundreds of heterogeneous dbs(eg- chocolate db, biscuits db, candy, icecream dbs) or file based systems like excel xml are altogether modelled/stored/streamed using ETL(tool) for data analytics & applications downstreaming, data science & AI build purpose also.

    • @ravinaikwadi9899
      @ravinaikwadi9899 2 года назад +4

      @@ChrisSmithFW Yeah, but he forgot to mention so.

    • @Morgue12free
      @Morgue12free Год назад +7

      I believe that's what he said. His explanation is just a lot more understandable than yours.

    • @Jishnu_OnTheRocks
      @Jishnu_OnTheRocks Год назад +2

      Your answer sounds like quoted from an NCERT textbook and his is more like a next door tuition teacher

    • @Lividbuffalo
      @Lividbuffalo 10 месяцев назад

      Wtf

  • @patrickschardt7724
    @patrickschardt7724 3 года назад +53

    I think because of your clear and concise points and humor, I learn more from you than other Excel tutorial channels.
    Keep up the great work.

    • @chandoo_
      @chandoo_  3 года назад +1

      Aww.. that means a lot Patrick :)

  • @DemetriPanici
    @DemetriPanici 3 года назад +31

    This video did a great job of helping me learn the distinction between these 3 things. Love it!

    • @chandoo_
      @chandoo_  3 года назад

      Thank you Demetri... 😍

  • @kiriharanmohan1259
    @kiriharanmohan1259 Год назад

    Even a person who is at the earliest stages of his data career would understand this. Thanks a lot.

  • @powerbis.1794
    @powerbis.1794 2 года назад

    Without any flattery- the BEST explanation of this topic I ever encountered on youtube!

  • @alvaromp1106
    @alvaromp1106 Год назад +1

    Very good, thanks!
    What I got is that it is more of a conceptual difference rather than technical, understanding that there must be some infrastructure nuances..

  • @joyaljjoy
    @joyaljjoy 2 года назад +1

    There was not ppt , just sweet and crisp explanation of the topic using a notebook. 👌🏽 Loved it.

    • @chandoo_
      @chandoo_  2 года назад

      Thank you Martin 😀

  • @krishkam186
    @krishkam186 2 месяца назад

    I love how you indirectly went into explaining facts and dimensions for the database.
    Suggestion, it would be very helpful for these 3 concepts if you explained the concept of NOSQL and file based vs table based storages.
    Ofcourse there's alot more to it, but a simple summary and the benefits will do.

  • @ZAN29
    @ZAN29 Год назад

    It’s a magic that I found you, thank you so much for explaining in simple words difficult at first glance things.

  • @BananaChan1010
    @BananaChan1010 Год назад

    I love how this man explains things

  • @dishaagrawal259
    @dishaagrawal259 2 года назад +1

    This was the best explanatory video that I came across!!

    • @chandoo_
      @chandoo_  2 года назад

      Glad it was helpful!

  • @mmiltenburg
    @mmiltenburg Год назад

    Very nice for non data specialists. I was searching for basic explanation and that's what you gave me!

  • @venwhen8567
    @venwhen8567 2 года назад +3

    Data gods have smiled. Thank you Chandu!!!

    • @chandoo_
      @chandoo_  2 года назад

      Thank you Venwhen.

  • @Wellness-100
    @Wellness-100 Год назад

    Awesome video of comparison/differences! The the explanation was very easy to follow! Thank you! Also the puns are hilarious! Keep the content coming.

  • @The-Right-is-Right
    @The-Right-is-Right 2 года назад +1

    Chandoo...you have a divine gift for explaining things so clearly. The drawings help so much too. I wish I had found your channel sooner.

  • @HariMuppa
    @HariMuppa 2 года назад

    Nice illustration , Since I entered into IT struggling a lot to understand between these 3 components. You have cleared all my doubts ... Thank you very much

  • @elishaa2096
    @elishaa2096 3 года назад

    Thank u so very much sir. Now I can comfortably create my Web app.
    Irony is, about 2 months ago, I searched to know the difference but to no avail. Now utube just suggest this to me when I am busy watching comedies.
    Eventually, everything falls in place. Just time

  • @khangnguyendac7184
    @khangnguyendac7184 Год назад

    Wow, this video explanation is very easy to understand & really helpful. Just by watching this video 2-3 times, I already have a big picture about the data lake, database, data warehouse in my mind. Thank you so much for making the video!!!

    • @chandoo_
      @chandoo_  Год назад

      Glad it was helpful!

  • @sohanpatel1998
    @sohanpatel1998 2 года назад

    OMG!!. Never knew this thing could be made this interesting. Great Humour! Subscribed !!

  • @GratefulThird
    @GratefulThird 2 года назад

    FANTASTIC explanation!
    Now I realize I have created all three of these over the years. I wish I had understood these concepts better back in the day.

    • @chandoo_
      @chandoo_  2 года назад +1

      Glad it was helpful!

  • @mukulrana1616
    @mukulrana1616 2 года назад

    Best Explanation on the internet!

  • @mada881010789
    @mada881010789 Год назад

    Just wanted say I love you so much and I appreciate your effort to make us educated on these type of stuff .

  • @EternalEvanesce
    @EternalEvanesce 2 года назад +1

    Thanks Chandoo.
    RUclips algo was brilliant today suggesting me this goldmine!

  • @sandeep4uin
    @sandeep4uin Год назад

    Thanks Chandu for making these concepts so simple to understand. Whenever I get confused I just refer to your videos for quick and accurate understanding of the concepts.

  • @gintomino4136
    @gintomino4136 2 года назад

    This is the clearest and understandable in layperson’s term. Thank you!

  • @srinivasansoundararajan8826
    @srinivasansoundararajan8826 Год назад

    AWESOME, SIMPLIFIED EXPLANATION THANK YOU SO MUCH.

  • @fatimasaleem6463
    @fatimasaleem6463 10 месяцев назад +1

    i really appreciate your effort and time which you put into your video.TBH your video is on the point and very interesting i never thought that someone explain these topic that much easily.May God Bless you and give you r more power so you make more video for us

  • @asadullahmalik1503
    @asadullahmalik1503 10 месяцев назад +1

    Excellent video, with great and user friendly explanation. Loved it

  • @akshaypatil8155
    @akshaypatil8155 Год назад

    The best video which explains behind the scenes....simple language and simple example....All the best for ur future endeavors...

  • @zwelimjanepatric1584
    @zwelimjanepatric1584 2 года назад

    I have always struggled to understand what a datawarehouse is but this video made it so simple to understand thank you

  • @osasueghaghe4040
    @osasueghaghe4040 Год назад

    With your explanation, I am confident that I can get an A in this course. 😊

  • @busyshah
    @busyshah 3 года назад

    Simplicity is the utmost form of sophistication.
    Subscribed

    • @chandoo_
      @chandoo_  3 года назад +1

      Welcome Shahnawaz... 😀

  • @eshwarsai5027
    @eshwarsai5027 Год назад +2

    One of the finest explanations. 👍
    Loved it ❤️

    • @chandoo_
      @chandoo_  Год назад +1

      Glad you liked it!

  • @rajeshbhosale2008
    @rajeshbhosale2008 Год назад

    Thanks, Chandoo, for this humorous primer on these database buzzwords! Keep posting such conceptual nuggets in your signature style! 👍🏻

  • @edgarmartinez2710
    @edgarmartinez2710 3 года назад +3

    I had no idea about data warehouse or data lakes. Thanks Chandoo for sharing your knowledge and the great breakdown of each.

  • @victorbegnini5754
    @victorbegnini5754 10 месяцев назад +1

    Best video there is about the topic! 🎉
    Thanks, man

    • @chandoo_
      @chandoo_  10 месяцев назад

      Glad you liked it!

  • @machinimaaquinix3178
    @machinimaaquinix3178 Год назад

    Short, sweet and right on point to help quick learning, you got a new sub!

  • @prutwo2
    @prutwo2 3 года назад +2

    I love you you explained these terms so simply! On a side note, Bigquery is more of a Data Warehouse than a Data Lake.

  • @IyadKhuder
    @IyadKhuder Год назад

    A super simple and understandable explanation! Thanks man! Thumbs up!

  • @morris5984
    @morris5984 2 года назад +1

    Just found your channel. I’m sharing your videos with my team that is a bit behind on these concepts. Thanks!!

  • @wallysonruan4246
    @wallysonruan4246 Год назад

    You just won another fan and subscriber. Nice content, Chandoo. You humor is well dosed too.

  • @rblanche5634
    @rblanche5634 8 месяцев назад

    Brilliant.
    Well explained, clear, simple and concise.
    Thank you very much

  • @juanpimentel5577
    @juanpimentel5577 Год назад

    Clear, concise, and to the point. Thank you so much for sharing your knowledge!

  • @nagendragitta7310
    @nagendragitta7310 2 года назад

    Simply superb... Greate explanation with general example..

  • @vivekvenugopal7519
    @vivekvenugopal7519 2 года назад +1

    Hi, your video is very clear to understand the basics... but still I have question, please clarify.
    1) why can't ETL take the source table and target table in the databases itself to create reporting or historical data table. Why we need to load into another database and call it as Data warehouse? Is there any significant difference like performance or something? Please explain this part... you explained "Why we are using each type", but I want you to cover why can't we use one instead of other. Eg., why cant we use create historical table in databases itself, why we need data warehouse separately. What is the special thing to go to DW instead of DB... also "why DL? And why can't DW?"

    • @chandoo_
      @chandoo_  2 года назад +4

      The answer is more technical.
      While you "CAN" keep both DB & DW kind of tables in the same place, normally people don't do it. Because,
      1) Databases are "designed" so that they can add / change or delete data very quickly and efficiently. They also ensure that your data integrity is maintained (if a customer is deleted, they can no longer transact for ex.)
      2) Data warehouses are "designed" so that they can add data and generate reports (or summaries) quickly. As there are usually no delete or edit operations in DW, the system is optimized to instead focus on piling up data and doing massive calculations quickly.
      3) Hence, Internally the architecture and software / hardware design is different for these two systems. So it makes sense to keep them separate.
      Think of it like this. While both a car & tractor can drive, you won't use them interchangeably as they have their strengths & weaknesses.

    • @vivekvenugopal7519
      @vivekvenugopal7519 2 года назад

      @@chandoo_ Thanks for your reply. I wanted to know that "Design" behind the Strength and weakness of each type. I understand it is very technical and cannot cover in text reply. Thank you so much for explaining this to make us understand the differences.

  • @babruvahanaa
    @babruvahanaa Год назад

    Very easy and clear explanation of DB/DW/DL :)

  • @sudhakarthati1539
    @sudhakarthati1539 Год назад

    I liked your video because of your clear and concise points and humor. Keep up the great work.

  • @m_subir
    @m_subir 2 года назад

    Very simple yet effective articulation!!

  • @Ravi-Krishna
    @Ravi-Krishna 2 года назад

    Your teaching style is simple and superb, thank you.

  • @nerelladhanalakshmi7583
    @nerelladhanalakshmi7583 2 года назад

    Good explanation! Expecting more like this..

  • @luanmoreno9863
    @luanmoreno9863 2 года назад

    Thanks a lot! That was one of the best explanations I ever heard. Sometimes I think people want things to be difficult so they seem more intelligent...

  • @keeloraz9452
    @keeloraz9452 2 года назад

    Hey Chandoo. Great video, but also I’m so happy to have found your channel and see you speak as over so many years your website has been solving my excel queries when I Google them.

  • @giridatta1525
    @giridatta1525 2 года назад

    First time watching your video,I am unlucky because I didn't found till now,you are doing great job thank you so much.keep doing more...

  • @karthiksridharan1282
    @karthiksridharan1282 Год назад

    Wow. It's easy to understand. You are a genius

  • @bmcseal01
    @bmcseal01 2 года назад

    Wow, I didn't think I'd learn anything, but I learned some more about OLAP (DW) vs OLTP (DB).

  • @mewguy69
    @mewguy69 3 года назад

    Omg u were that guy who owns that website which helped me in my early career.

  • @vittal_rao
    @vittal_rao Год назад

    Easy and simple explanation which makes us clear about the concept. Great video..

  • @GR-yy9jn
    @GR-yy9jn Год назад

    Love this! Thanks for explaining it really really in easiest manner and choice of words.

  • @rbogomil
    @rbogomil 2 года назад +3

    Great job, clear explanation and I also enjoy your humor. Would be great if you could create a video describing the difference between data scientist, engineer, analyst and architect. Kudos on your excellent work!

    • @ryanshannon6963
      @ryanshannon6963 2 года назад +1

      If you're starting in I.T. doing analysis type work, you'll start as an Analyst. This can be anything from reporting, automated feed maintenance/RCA, and even development. Most of the above 3 (maybe save for Data Scientist) start here.
      Data Engineer is probably the most logical next step from analyst. You'll definitely be doing more development and analytical work as an analyst prior to this. This shifts your scope from retrieving data from a data warehouse/db/lake (lake is quite rare for a run of the mill analyst), to actually designing and some possible light architecting of table/schema structures for data to import into from other sources (typically starting as transactional information into a database from an app, or maybe an external source of some sort). Typically as an engineer you won't start on data warehouse modelling until you've had some experience with general transactional architecting/engineering since the data within a warehouse shouldn't be updated/deleted, only inserted. It will be deleted, possibly if you've archived it in some situation (like data that's over x-years old and based on specific policies), but even then it probably wouldn't be deleted. If the architecture allows, you may just duplicate the tables, or partition them in some way and then archive the older pages. They may also determine certain structural recommendations (rowstore vs columnstore table structures, for example, or using NoSQL vs relational databases), but usually it's in concert with an Architect if the process being designed is large enough, or has significant impact, especially in terms of performance. However, after discussions between Engineers and Architects, the Engineers (and to a lesser extent, Analysts) will IMPLEMENT the requisites of decided Architecture. Engineers are typically more hands on than Architects, but Archs may get their hands dirty if something is largely conceptual and they want to start plugging away earlier in the phase to ensure design solidity.
      Data Architect is anything from designing the schema for your transactional infrastructure (your primary database), data warehouse, or even data lake, as well as helping navigate and determine how to import data into those repositories, as well as even more expansive things such as CI/CD pipelines, *maybe* networking tasks if you're familiar enough with that (usually system administrators do that, though), or even helping implement connection string/authentication against your cloud resource targets originating from nearly any source caller (on premises machine, like a developer computer, a VM hosting an app service, CI/CD agent, or a completely separate cloud service not native to your cloud service, even on a completely different domain or client server).
      An Architect is going to be responsible for HOW disparate system objects are going to interact with each other and any potential issues given certain implementations or design sequences. Typically Architects are going to have some knowledge as to what different approaches are available and determine which makes sense given what's required for the need or problem that needs resolution. As an Architect you're not expected to know how to implement everything as if you were doing all the work yourself. However, having a basic understanding of the limitations of each element in the design will definitely help you determine which is possible and which may not be earlier in design phase, which helps mitigate wasted developer time later during spikes (Proof of Concept phases) and help with further engineering alignment tasks.
      Most people consider scientists as the babies in the room because the data they require should be perfect in terms of not needing to accommodate any changes to their representations outside of any algorithmic modelling is concerned. It's entirely possible a Scientist will ask the Engineer to modify schema and data to accommodate some sort of analysis or data modelling they're trying to complete. It's not a-typical for an Engineer to work closely with a Scientist, but not typical for the Scientist to work with the Architect, aside from initial standing up of a new Data Warehouse or Data Lake. Typically the Engineer maintains or may make the every-day changes to those structures once the inputs/outputs/transformational processes have already been established. Scientists are typically Statisticians or anything having to do with applied mathematics. They will also typically work with code that isn't strictly SQL, such as Python, R, Power BI, DAX, (maybe MDX, but I think that's fallen largely by the way-side), etc...Scientists are tasked with supplying the answers to complex problems for the business using quantitative analysis. These are the people that determine what Ads you may see given your previous and most recent search history. Something you searched for 3 years ago may not be as relevant as something you searched for yesterday. That would be a typical example of what a Scientist may do. Also, Google translate, things like that will be developed by the Scientist, but the Architect will design the bridges to source that data whereas the Engineer will make that design a reality. The Analyst will make sure data makes sense as it starts trickling through the design process and if there's any issues, the Analyst and maybe working with the Engineer will troubleshoot the why/how and determine a fix where either of them may implement that fix to ensure it works as intended.
      If you look at it as a decision tree, it may look something like:
      Analyst > Engineer > Architect
      Analyst > Engineer > Scientist
      Analyst > Scientist (again, typically short cut by a Masters in Statistics or similar)
      Hope that helps!

  • @angelinagokhale9309
    @angelinagokhale9309 Год назад

    Thank you very much for this illustrative explanation! Very easy to comprehend indeed!

  • @thaith6704
    @thaith6704 2 года назад

    You video is very helpful, it cleared my cloud about DB, DW, DL Thank you very much!

  • @ravikumarkumashi7065
    @ravikumarkumashi7065 2 года назад

    I dont think there are any other vedios in the internet that explains database/datawarehourse/datalake like you did..thanks for your explainations

  • @lilig9239
    @lilig9239 2 года назад

    The best explanation that I heard. Thanks!

  • @JackMcMotivate
    @JackMcMotivate 2 года назад

    Your video is the only one that made this clear to me.. thank you teacher!

  • @JovyIgnatius-g1d
    @JovyIgnatius-g1d Год назад

    This was a great explanation in a simple, clear, and concise manner.

  • @ankur1129
    @ankur1129 2 года назад

    This video is a piece of ART! Awesome work :) :)

    • @chandoo_
      @chandoo_  2 года назад

      Thank you so much 😀

  • @Edwinn100
    @Edwinn100 2 года назад

    Just the way I like it, Barney style! Amazing job!!!

    • @chandoo_
      @chandoo_  2 года назад

      Thank you so much!

  • @HatCross
    @HatCross 2 года назад

    DL explained so simply - Thank you Chandoo

  • @francksgenlecroyant
    @francksgenlecroyant 2 года назад

    Perfect explanation. I immediately subscribed 👊👊👊

  • @baderalsahli3619
    @baderalsahli3619 3 года назад

    my best mentor ever

  • @naazlyhameed8468
    @naazlyhameed8468 2 года назад

    Awesome.. understood data lake for the first time

  • @maryamfatima5882
    @maryamfatima5882 Год назад

    very well explained, crystal clear, and superb way of explanation in between videos and images. Amazing.

  • @landifa
    @landifa 2 года назад

    You should add: data hub, delta lake, lake house, data virtualization.... A neverending story :)

  • @VivekKBangaru
    @VivekKBangaru 8 месяцев назад

    Thanks Man, I started learning big data concepts and this video is very useful for me

  • @Dewdrops0010
    @Dewdrops0010 3 года назад

    Excellent Thank you sir - I am non tech from product side but this makes it clear for me - THANKS Again

  • @c0mbat15
    @c0mbat15 2 года назад

    Great explanation. I've personally found data warehouses rarely service multiple areas of an organisation. They are built with the influence of one area in mind and therefore deemed not fit for purpose for other areas. That means that other areas are left running their own jobs on the databases. If data lakes are meant to provide a partial solution for that issue then great. But they do run in to the issue of huge tech debt at that point.

    • @chandoo_
      @chandoo_  2 года назад +2

      Thanks Carleast. Many organizations also implement "data marts", kind of like topic / theme specific data warehouses. This opens another can of worms where data is duplicated and often inconsistent.

  • @chaitufomo2535
    @chaitufomo2535 2 года назад

    Man that was good way to explain stuff. I like it and understood in a better way.

  • @mayank.kr.30
    @mayank.kr.30 4 месяца назад

    Great explanation and very easy to understand example.

  • @Ryankubo123
    @Ryankubo123 Год назад

    Great explanation and awesome video! Very helpful!

  • @abhishekprelog
    @abhishekprelog 2 года назад

    This is the first video I saw on your channel and it made me instantly subscribe. Brilliant explanation.

    • @chandoo_
      @chandoo_  2 года назад

      Thank you and welcome aboard Abhishek.

  • @aZnPriDe707
    @aZnPriDe707 2 года назад

    Solid explanation. We can always count on Indian uncles for STEM

  • @Testkh1
    @Testkh1 2 года назад

    Excellent explanation... Can't resist myself to appreciate your efforts publicly...

    • @chandoo_
      @chandoo_  2 года назад +1

      Thank you so much 😀

  • @BooksNtalks2022
    @BooksNtalks2022 2 года назад

    Your videos are wonderful and soo easy to understand. Also your sense of humor 😂😂...loving it.

  • @sn5229
    @sn5229 Год назад

    so well explained..!! very second worth watching.!! thanks a lot.

  • @KaySchneutzer
    @KaySchneutzer Год назад

    Thanks for this very clean and simple explaining - really helpful.

  • @samueltsadiq1222
    @samueltsadiq1222 Год назад

    Excellent. Thank you. You are a great teacher Sir.

  • @roomuser
    @roomuser 2 года назад

    Thanks a lot.. well depicted.. now it’s clear to me about the grey area.

  • @secretnobody6460
    @secretnobody6460 Год назад

    Correct me if im wrong. But i see database as the source of live dashboards? It is the representation of data that are being used currently.
    Data warehouse is like a storehouse for the historical data that the database have produced. Its like a back up copy of the data?
    And data lake is like a cloud storage where you just store all kinds of data randomly just for the sake of storage? It may contain datawarehouse data, tables, isolated tables, reports etc??

  • @MartinCGaudet
    @MartinCGaudet Год назад

    Super clear. Very very well explained.

  • @dominiquez5643
    @dominiquez5643 Год назад

    Thank you so much for the time put in your videos! extremely helpful!