What is Data Pipeline | How to design Data Pipeline ? - ETL vs Data pipeline (2024)

Поделиться
HTML-код
  • Опубликовано: 21 ноя 2024

Комментарии • 1,2 тыс.

  • @sivanagarajugamidi
    @sivanagarajugamidi 3 года назад +425

    Master piece tutorial for data engineering

    • @ITkFunde
      @ITkFunde  3 года назад +8

      Thanks Siva

    • @indexima6517
      @indexima6517 3 года назад

      hey! don't hesitate to follow us and to take a look at our videos which deal with the same topics :)

    • @vijayjayaram606
      @vijayjayaram606 3 года назад

      @@indexima6517 I guess the videos on ur channel deals with more on, what do we do after receiving the data, analytics if I understand correctly.
      Here, its more of pumping the data from one place to a common place, and make it available for interested people down the lane

    • @gautamdeusa
      @gautamdeusa 2 года назад

      @@ITkFunde It's truly one of the finest and easiest video to follow and relate. Many thanks. Will check other videos.

    • @lwhieldon1
      @lwhieldon1 2 года назад

      Thank you for breaking down concepts that are difficult to understand!

  • @nathancarranza9860
    @nathancarranza9860 3 года назад +613

    Something I’ve noticed is that Indians are good teachers and give great illustrations. Good work. Greetings from the US.

    • @ITkFunde
      @ITkFunde  3 года назад +72

      Thanks Nathan for making me feel even more proud of being an Indian thank you for the compliment means a lot brother 🙏😊

    • @fsfernandes20
      @fsfernandes20 3 года назад +24

      Yes Indians like to make difficult concept easy

    • @nathancarranza9860
      @nathancarranza9860 3 года назад +30

      I don’t use my real name online, but I do give real compliments.

    • @AutitsicDysexlia
      @AutitsicDysexlia 3 года назад +13

      @@nathancarranza9860 Plot Twist: His real name was not Nathan. It was always Vladimir Putin.

    • @visionxx8656
      @visionxx8656 3 года назад +14

      Can't believe Putin is from US

  • @ericdasse8174
    @ericdasse8174 3 года назад +62

    That was great! As a data engineer in the making, this is the first time I have understood the concept of data pipelines so clearly. Thank you very much

    • @lukmanaliyu7386
      @lukmanaliyu7386 2 года назад

      Hello Eric, I'd love to know how it's going for you at the moment with the DE track

  • @altamashjawad6691
    @altamashjawad6691 3 года назад +65

    Loved this video, probably the best explanation on advanced data pipeline out there. If in your next videos, maybe create a playlist which can show each of the section of this pipeline in detail with little examples using Python or any language etc. Just an idea, brilliant work!

  • @lcsxwtian
    @lcsxwtian 3 года назад +3

    Simply one of the best videos on data pipeline on RUclips. Deserves so much more attention.

  • @MrBignate12345
    @MrBignate12345 4 года назад +35

    Please continue to create videos like these! So easy to understand. Love your visual teaching style and the examples you give.

    • @ITkFunde
      @ITkFunde  4 года назад +1

      Thank you MrBignate...The aim is to simplify these techie jargons for everyone to correlate and enjoy learning.

  • @alexanderulloaopazo6275
    @alexanderulloaopazo6275 4 года назад +11

    Thank you! I had read a lot of papers about Data Pipeline, but I couldn't get the main idea. However, your video was so easy to understand!! Now I have a better picture of the complete process. Thanks again.

    • @ITkFunde
      @ITkFunde  4 года назад

      Thank you Alexander !!!

  • @bigglesharrumpher4139
    @bigglesharrumpher4139 Год назад +9

    Great video - it seems while technology has advanced, the concepts of batch loads and real-time data is actually decades old. Back in early 2000's we controlled all ETL and real-time loads with Unix or DOS or SQL scripts that provided return codes for success/failure which triggered alert emails, and we had KPI's for Data quality, backing-out jobs for failed loads, and many other control systems. It just seems there are more 'out-of-the-box' software to handle these now as opposed to custom-built solutions. Great presentation!

  • @mardiidking4030
    @mardiidking4030 Год назад +8

    This topic is so complex as a beginner, but I understand this explanation so well. I didn't even have to go back in the video or rewatch it to understand. This is beautiful.

    • @ITkFunde
      @ITkFunde  Год назад +1

      Thank you so much for your kind words and support 🙏🙏♥♥

  • @ramakambhampati5094
    @ramakambhampati5094 Год назад +2

    You are a real "Data Pipeline Spiderman".... fantastic instructor..please share more videos....thanks

    • @ITkFunde
      @ITkFunde  Год назад

      Thanks Rama ☺️☺️

  • @prabur4027
    @prabur4027 3 года назад +25

    This would be the Best start for the Data Engineers.. A clear precise and short pictorial representation of Data Pipeline (Basics). Best video so far I had seen.. 😊 Thanks.. Much Appreciated.. 👍

    • @ITkFunde
      @ITkFunde  3 года назад +1

      Thanks Prabu 👍☺️🙏

    • @vivekjoshi3769
      @vivekjoshi3769 2 года назад

      Do data analysts also use data pipeline creation in their jobs ? Or are they expected to know it ?
      Asking as some companies write knowledge of ETL in JDs.

    • @prabur4027
      @prabur4027 2 года назад +1

      @@vivekjoshi3769 knowing any of the ETL tools would help in constructing the pipelines and they can visualize data flow from source to target.. Yes mostly it is used..

  • @ramakrishnachimmani7273
    @ramakrishnachimmani7273 4 года назад +7

    Thank you. The best way of explanation. I was looking for this kind of video for long time. As a traditional ETL developer, I questioned my self, why people are using a term called 'Data pipeline' though we have ETL process and what is the exact difference between them. Thanks again.

    • @ITkFunde
      @ITkFunde  4 года назад

      Thanks Rama for your positive feedback !!

  • @juliansihite1289
    @juliansihite1289 2 года назад +8

    This guy really explain everything clearly and simple!
    Good job brother, keep sharing and contributing! You're a great teacher :)

    • @ITkFunde
      @ITkFunde  2 года назад

      Thanks Julian 😊❤️🙏

  • @ravirty8962
    @ravirty8962 3 года назад +1

    Simply superb tutorial with good example.

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Ravi☺️

  • @sy-vf4js
    @sy-vf4js 3 года назад +34

    And again, another easy-to-digest video. Thumbs up!

    • @ITkFunde
      @ITkFunde  3 года назад

      Thank you 🙏🙏☺️

  • @othmanbelmouzouna3893
    @othmanbelmouzouna3893 3 года назад +2

    Very good tutorial with valuable explanations. Thanks.

  • @squarehead6c1
    @squarehead6c1 3 года назад +23

    Great intro, just what I needed. I learned the distinction between ETL and general pipe lines, and Kafka's place in the architecture.

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Ronnie☺️

  • @rajguru1998
    @rajguru1998 3 года назад +1

    Finally understood the pipeline in 10 mints... thank u

  • @kenford3738
    @kenford3738 3 года назад +6

    Great job explaining the difference between Data Pipelines and ETL.

    • @ITkFunde
      @ITkFunde  3 года назад +1

      Thanks Ken 🙏☺️

  • @empressbelless3232
    @empressbelless3232 3 года назад

    This is meant to be a compliment. I appreciate how articulate your English is with each word you speak! Easy to listen to!

  • @MrBignate12345
    @MrBignate12345 4 года назад +30

    Would love to learn more about how to choose the right frameworks/technologies for data pipelines and data warehouses/lakes for differing requirements. It would be nice to see a playlist of you designing or comparing solutions for an analytic stack.

    • @ITkFunde
      @ITkFunde  4 года назад +10

      Thanks MrBignate I have created various playlists one of which is " Crunching Data Series "...I will surely make more videos on similar topic. It is because of encouragement from audience like you which helps me move forward so thanks and really grateful for your positive feedback.

  • @arunachalampalani4321
    @arunachalampalani4321 Год назад +1

    Couldn't have asked for more. Very well explained, Thank you mate.

  • @KolawoleAdekoya
    @KolawoleAdekoya Год назад +3

    Simplified and clear explanation of the concepts. Great diction and presentation. Well done!

  • @mitchelleleeuw2266
    @mitchelleleeuw2266 Год назад +2

    ☺️I’m new in Data Engineering and man you created a clear picture of what I’ve been learning and trying to understand 🙂love this… definitely subscribing 🤩

  • @K0n5tant
    @K0n5tant 3 года назад +5

    Your way of explaining these concepts is excellent, thank you!

  • @damiiete
    @damiiete 2 года назад +2

    Great explanation for introduction to data pipelines. Thanks for clarifying the distinction between ETL and data Pipelines.

  • @kalyanchakri5258
    @kalyanchakri5258 4 года назад +4

    Love your way of teaching in a simple understandable concepts. Im mad of you..!

    • @ITkFunde
      @ITkFunde  4 года назад

      Thanks Kalyan for your feeback it helps a lot..

  • @haydarissa9371
    @haydarissa9371 2 года назад +1

    Very elegant way to explain data pipelining and ETL approach. I appreciate the examples given especially the master data management. Well done.

  • @Gridblue
    @Gridblue 2 года назад +3

    Thank you for the video, I learnt what data lake hydration projects are, my previous company had no proper KT, I struggled to grasp what I was doing. This was very nicely explained and cleared the doubts that I had.

  • @hasinirajapaksha333
    @hasinirajapaksha333 Месяц назад

    in my life this is the best explanation I ever heard ,PERFECT .keep doing that good luck sir.🙏

  • @brentcos9370
    @brentcos9370 3 года назад +4

    Very informative, especially for a non-computer science guy like myself. Thanks!

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Brent that is the essence of this channel - Making I.T. interesting for everyone.

  • @janbazahmedkhan8023
    @janbazahmedkhan8023 2 года назад

    bhai itna lucid style mein kisi ne nai smjhaya. great work!!

  • @chocochipbananasplit
    @chocochipbananasplit 4 года назад +3

    I got more out of your video than reading 5 articles on the matter! Your content is great!

  • @AbelAkeni
    @AbelAkeni 2 года назад

    Succinct, presented with clarity! Beginners, get in here! 👏🏾👏🏾👏🏾

  • @Manoj419419
    @Manoj419419 4 года назад +3

    Great explanation and examples used. Thanks a ton !!

  • @sripathysrinivas4579
    @sripathysrinivas4579 Год назад +1

    Fantastic!!! Thanks for your time and explaining the basics!!!

  • @ravinduabeygunasekara833
    @ravinduabeygunasekara833 3 года назад +3

    This is superb!. I am very strange to Data Engineering, and this video gave me a super insight! Keep up the good work

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Ravindu ☺️

  • @hasmilaomar5562
    @hasmilaomar5562 2 года назад +1

    It is good that u explain the concept of data pipeline by referring to water pipeline. So much easier to understand and remember. Thank you for your video!!

  • @dhritimanbnrj
    @dhritimanbnrj 3 года назад +12

    best productive 10 minutes of my life.

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Dhritiman for this super comment you made my day 🙏☺️

  • @smitadash8113
    @smitadash8113 2 года назад +1

    Very crisp and clear explanation of data pipeline. Thank you very much for explaining in detail. Much helpful.

  • @JibrilLamai
    @JibrilLamai 3 года назад +3

    This is a very good explanation and the best I have seen so far in my quest to understand this concept. Thank you very much. Now I can confidently visualize and explain the same concept with ease and a great understanding of it.

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Jibril glad it helped 🙏☺️

  • @sabysreya
    @sabysreya 3 года назад +2

    Simple and super easy to understand 👌👍👏👏

  • @knorth2386
    @knorth2386 4 года назад +3

    Hi Anshul, your video was helpful. I have experience with ETL but didn't know that it was a specific type of data pipeline. Thanks for showing the different type of systems and technologies used for the concept visual that you explained with.

    • @ITkFunde
      @ITkFunde  4 года назад +1

      Thank you Kyle coming from an experienced guy means a lot. Hoping for continued support !!

  • @asthagoel3000
    @asthagoel3000 2 года назад

    Big concepts explained very quickly in an easy to understand manner. Thanks!

  • @hussamcheema
    @hussamcheema 4 года назад +3

    Excellent Explanation. Keep making more videos regarding Data Engineering, AI, and Data Science.

    • @ITkFunde
      @ITkFunde  4 года назад

      Thanks a lot mate for your feedback and suggestion!!

  • @joshuaabok3329
    @joshuaabok3329 2 года назад

    Amazing analogy. Amazing explanation of data pipeline. This is just awesome.

  • @sid1r
    @sid1r 3 года назад +3

    Thank you so much for a great and easy to understand data pipeline introduction. I love how you focus on the concepts and not jargons, as it allows for people to understand the essence of data pipeline.

  • @elifylmaz7940
    @elifylmaz7940 Год назад +2

    Wov, I think I just watched one of the best explanation video in my life. You did an amazing job! The structure you explain the details and use cases, the examples you give in real world applications made a lot of sense to me. Thank you so much!

    • @ITkFunde
      @ITkFunde  Год назад

      Thanks Elif for your kind words means a lot ☺️🙏

  • @FIRE_EVERYTHING
    @FIRE_EVERYTHING Год назад +4

    Excellent high level overview Anshul, I appreciate that you differentiated between batch data and real time data with the Lambda Architecture as it seems most applicable to modern organizations. Your explanation of dashboards as consumers was also very realistic. Your video helped me better understand the general steps in the process. +1 Subscriber.

    • @ITkFunde
      @ITkFunde  Год назад

      Thanks Matthew for supporting ❤️

  • @Aditya-zv5et
    @Aditya-zv5et 4 месяца назад

    really a great video for someone who is trying to understand data pipeline

  • @maelherbert321
    @maelherbert321 3 года назад +4

    Really content. Bravo from France 👏👏👏

  • @diemuino
    @diemuino 10 месяцев назад +1

    Excellent explanation and examples, Anshul. Thank you for the video!

    • @ITkFunde
      @ITkFunde  10 месяцев назад

      Thanks Dear

  • @jamesmcmurtry5351
    @jamesmcmurtry5351 3 года назад +6

    Great visual layout. Would love to see this applied to an ELT model with Snowflake and it's advantages/disadvantages. Possibly a suggestion on ML complementary tools like Looker and Kraken.

  • @DevOps-AWS55
    @DevOps-AWS55 3 года назад +1

    Awesome Explaination of Data Pipeline

  • @mikebrooks4182
    @mikebrooks4182 3 года назад +3

    Thanks for a great overview of how the Lambda architecture can expedite the delivery of data to data consumers. For future videos, it would be helpful to map this to the roles, responsibilities, and skill requirements needed to manage this environment.

    • @ITkFunde
      @ITkFunde  3 года назад +1

      Thanks Mike for suggestion will try to add this

  • @sunderdase3511
    @sunderdase3511 Год назад

    A simple and superb explanation about Data pipeline structure. Thanks a lot. Really appreciate!

  • @sourabhsuri8812
    @sourabhsuri8812 3 года назад +12

    Thank you so much brother, for clarifying some of the concepts.. Truly appreciate it. Can you suggest - Which way is the Tech Heading now - Data Warehouse Vs. Data Lake? Are DWH a thing of past?

    • @ITkFunde
      @ITkFunde  3 года назад +13

      Thanks Sourabh, DWH is here to stay its not going anywhere. Today data world has become enormously huge and there is space for DWH and DL to co exist also Datalake can not solve all business problem. There is a hybrid approach coming up wherein you have your DWH on top of your Datalake

    • @vivek1joshi
      @vivek1joshi 2 года назад +1

      Data Mesh

  • @edsonsabino
    @edsonsabino 2 года назад +1

    Great! The part that I liked the most was the one in wich he explained the difference between ETL and data pipeline

  • @pallavimondal2655
    @pallavimondal2655 3 месяца назад

    I am a newbie to this ETL process, confused with all jargons! This definitely helped to get the picture of it. Keep up the good work

  • @ankesh251
    @ankesh251 3 года назад +1

    Best Video about Data Pipeline. haven't thought its this simple

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Ankesh 🙏☺️

  • @vasudevp1703
    @vasudevp1703 2 года назад

    You are amazing......One of the best explanation....Desperately in need of your full course on this subject...God Bless you

  • @NathJones11
    @NathJones11 2 года назад +2

    A very clear explanation of the differences between the two methods. Often I see everything limped under an ETL umbrella, when it may not accurate.

  • @aryanj2435
    @aryanj2435 3 года назад +1

    Awesome tutorial!

  • @KESHA1982
    @KESHA1982 2 года назад +1

    So glad, that I've found your channel on RUclips. Thanks a lot!

  • @GonzaloArangoF
    @GonzaloArangoF 2 года назад

    It is a clear presentation for common people. Tanks!!

  • @MGKA-vr8si
    @MGKA-vr8si 2 года назад

    One of the best presentation to know more about data pipeline. thanks.

  • @DanishAnsari-hw7so
    @DanishAnsari-hw7so 2 года назад +1

    Such an awesome explanation, short, crisp and to the point. Great!

  • @issamfakhari3152
    @issamfakhari3152 4 года назад +1

    Great explication!!!!

  • @obiradaniel
    @obiradaniel 2 года назад +1

    Thank you very much, very elaborate and concise, this import for everyone in the technical data cycle, data engineer, analyst, administrator and data scientist.

  • @meemanikandan
    @meemanikandan 3 года назад +2

    Good one to understand Data Pipeline. Thanks!

  • @francis191
    @francis191 2 года назад

    Clear simple and easy to understand - great presentation

  • @kiransatyan
    @kiransatyan 2 года назад

    So much valuable content in such short duration video... with so much clarity. Awesome !! Thank you !!

  • @sulaimankhan8033
    @sulaimankhan8033 2 года назад +1

    Watched it again & again for clarity - Good !!

    • @ITkFunde
      @ITkFunde  2 года назад

      Thanks ♥️♥️

  • @bhuvaneshsingh3953
    @bhuvaneshsingh3953 2 года назад +1

    If you can explain it simply enough, you have understood it well enough - superb explanation.. 👍

    • @ITkFunde
      @ITkFunde  2 года назад

      Thanks Bhuvanesh means a lot 🙏😊

  • @commonman1271
    @commonman1271 2 года назад +1

    Excellent explaination

  • @essboogy
    @essboogy 3 года назад +2

    This is excellent. Really interesting and easy to follow. I am just starting training with IBM to be a Data Engineer. Leaving healthcare for good!

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks a lot ☺️☺️🙏

  • @amitparmar8076
    @amitparmar8076 Год назад

    What a superb explanation with simplistic examples and scenario.

  • @bhuvaneshkumar9763
    @bhuvaneshkumar9763 Год назад +1

    Precise explanation of Data pipe line...👏👏👏

  • @NarendraSharmaa
    @NarendraSharmaa 2 года назад +1

    What a Explanation out of world , please share more such videos on topics like Kafka , Streaming

  • @shirshodatta2886
    @shirshodatta2886 2 года назад +1

    Beautifully explained. Channel subscribed.

  • @nitinahuja2472
    @nitinahuja2472 2 года назад +2

    Worth of time watching IT k Funde videos.

  • @emmanuelafuwape9118
    @emmanuelafuwape9118 2 года назад

    this was such a well detailed explanation of Datapipeline and more. I am so elighted!! Structured so well Thank you!

  • @vijaym4598
    @vijaym4598 3 года назад +1

    excellent bro...very useful information...ur explanation is very understandable....thanks for video.

  • @nikolaysokolov9027
    @nikolaysokolov9027 2 года назад +1

    It's excellent explanation. Thanks!

  • @udaythakur1405
    @udaythakur1405 3 года назад

    Easy & Crisp To Understand. Really appreciate.

  • @alaad1009
    @alaad1009 2 года назад +1

    Excellent explanation 👌 one of the best I've seen on the subject!

  • @TainuiaKid1973
    @TainuiaKid1973 2 года назад

    this is a very good explanation. one of the best technical videos I've ever watched on YT. thank you!

  • @anjanikumarchoubey7969
    @anjanikumarchoubey7969 3 года назад +1

    Very effective lecture in introducing the data pipeline and promote to adopt in improving the Business /egovernance services and advisories

  • @jananisri6214
    @jananisri6214 2 года назад +1

    One of the best tutorials in youtube so far which gives an overview of data engineering process and that too within 10 minutes. Really appreciate your effort and time you put into making this video. Thank you so much. Please keep doing more such tutorials.

  • @venkataramanamurthypasumar4542
    @venkataramanamurthypasumar4542 2 года назад +1

    Excellent presentation on the topic Data Pipeline with live examples.👌👍

  • @APUSHstudent777
    @APUSHstudent777 2 года назад

    I am prepping for an interview and preparing how to talk about this topic. You explain this very simple and easy to follow. Thank you.

  • @luisjimenez917
    @luisjimenez917 3 года назад +1

    Excelent!! Thanks...!! Congratulations!!!

    • @ITkFunde
      @ITkFunde  3 года назад

      Thanks Luis ☺️☺️

  • @formulaRoot
    @formulaRoot 2 года назад +1

    Beautiful! Thanks for this!

  • @sharmilanadgir5030
    @sharmilanadgir5030 2 года назад

    Thank you for this simple and clear explanation of data pipeline. Now I have a clear picture of how data flows from consumer to producer

  • @victorbgdream8328
    @victorbgdream8328 3 года назад +1

    Thanks, very nice and simple concept.

  • @sushilamahato
    @sushilamahato 2 года назад +1

    It cleared my doubts. Thanks a lot for teaching so well!! 👍

    • @ITkFunde
      @ITkFunde  2 года назад

      Thanks Sushila ☺🙏

  • @jasper5016
    @jasper5016 2 года назад

    Thanks a lot. This tutorial taught so many things within 10 mins.

  • @dhansraj7345
    @dhansraj7345 3 года назад

    Very nice architecture in a simple hand drawn picture and presentation also. Awesome job

  • @ryanhutchins2634
    @ryanhutchins2634 2 года назад

    Great introductory video. Thanks for sharing your knowledge.

  • @bmcseal01
    @bmcseal01 2 года назад +1

    Best explanation of data pipeline ever!