Part 1- End to End Azure Data Engineering Project | Project Overview

Поделиться
HTML-код
  • Опубликовано: 13 дек 2024

Комментарии • 148

  • @mr.ktalkstech
    @mr.ktalkstech  7 дней назад +3

    Thank you for watching! If you found Part 1 valuable and want to dive deeper, the full tutorial is available on Udemy.
    ▶ Get the Full Course on Udemy -> www.udemy.com/course/end-to-end-azure-data-engineering-real-time-project/?referralCode=626B44A4C9AA848ACB53
    Thank you for supporting my work, and I’m excited to help you continue your learning journey!

    • @TheRawFootages
      @TheRawFootages 6 дней назад +2

      why did you hide your content sir? I thought you are the only teacher who help poor students like us providing best content in free.

    • @mr.ktalkstech
      @mr.ktalkstech  6 дней назад +2

      I sincerely apologize for this situation. Unfortunately, due to Udemy's policies, I had to remove the content. Thank you so much for your understanding and continued support.

  • @seedhiBaatNoBakwas.
    @seedhiBaatNoBakwas. 5 месяцев назад +10

    Great playlist for someone who has zero knowledge on ETL/AZURE. Good to clear fundamentals of azure resources

  • @pavankulkarni352
    @pavankulkarni352 Месяц назад +1

    This is the cleanest explanation I have ever come across on azure.

  • @madhavtn7947
    @madhavtn7947 Год назад +7

    I just started doing projects on data engineering and to be honest, this series needs to be on top results. Very useful content and easily understandable to newbees. Eagerly waiting for new projects using new tools and cloud services

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад

      Thank you so much :)

    • @BOSS-AI-20
      @BOSS-AI-20 Год назад

      @@mr.ktalkstech Hello Sir, can I have your linkedIn Id

    • @blackspring4605
      @blackspring4605 6 месяцев назад

      Do you if they are all open source ?

  • @RukshanEdirisinghe-v9q
    @RukshanEdirisinghe-v9q 4 месяца назад +7

    this helped me to find my job. Thank you

  • @prabhatgupta6415
    @prabhatgupta6415 Год назад +6

    wow what a explanantion ..Huge respect.
    keep doing u will have good followers soon

  • @jansenoliveira2823
    @jansenoliveira2823 Год назад +3

    Amazing content. Congrats and thanks!

  • @prabhatgupta6415
    @prabhatgupta6415 Год назад +3

    Plz plzz bring more.. U teach very well

  • @ZombieHelion2561
    @ZombieHelion2561 9 месяцев назад

    Hi Mr.K your lessons give the view of the roles of Data Engineering. I really appreciate your videos and would like to thank you sir. May God bless you and your family.

  • @jabalraji7883
    @jabalraji7883 Год назад

    Am a starter in DE....your illustration is awesome and I have subscribed to page for more updates...

  • @dhruba454
    @dhruba454 5 месяцев назад

    Amazing content, Thank you for sharing this video series..

  • @shyamsunderMerugu
    @shyamsunderMerugu Год назад +1

    Excellent....Superb tutorial. Fantastic explanation in a nut shell...

  • @sandeepkumar0612
    @sandeepkumar0612 Месяц назад

    this video is savior for new aspirants

  • @Win_whatsimportantnow
    @Win_whatsimportantnow 5 месяцев назад

    This video series is a game changer for me

  • @bhavindedhia3976
    @bhavindedhia3976 Год назад +1

    You are really amazing seriously waiting for more such projects

  • @mansouralshamri1387
    @mansouralshamri1387 6 месяцев назад +7

    Why do we use Databricks? Azure Synapse Analytics does ETL.

    • @kenamia9136
      @kenamia9136 Месяц назад +2

      Perhaps He wants to expose you to as many tools as possible

  • @tao-adl
    @tao-adl 7 месяцев назад

    Awesome, some key concepts finally clicked in my brain. Great breakdown!

  • @charankatta
    @charankatta Год назад +4

    hi, great tutorial and indeed good learning for starters as me. Can you also please make end to end azure data engineering real time project with continuous data stream & readily available big data (so that we can readily download from your link). It would be of great help for us.

  • @vamshiikrishna
    @vamshiikrishna Год назад +1

    Please carry with more viedos your knowledge sharing is helping us a lot🙏

  • @helovesdata8483
    @helovesdata8483 Год назад +1

    we are using the Medallion architecture at my job now.

  • @pradeeppeace4541
    @pradeeppeace4541 5 месяцев назад

    Thank you for explaining concept simple with presentation.

  • @ajinkyadhoke4713
    @ajinkyadhoke4713 Год назад +1

    Excellent Explanation...🔥🔥🔥🔥

  • @kiruthikal5910
    @kiruthikal5910 5 месяцев назад +1

    Excellent content brother

  • @justvenkyy...3423
    @justvenkyy...3423 10 месяцев назад

    such a good explanation. great work. please post on complex challenges that faced by data engineers and its solutions.

  • @Manohar-q7k
    @Manohar-q7k Месяц назад +4

    In real world, if we take similar setup, may I know what would be the reason for using Databricks instead of Data Factory for the transformation of the data between the layers?

    • @AmanSingh-ig1en
      @AmanSingh-ig1en Месяц назад

      Although we can use dataflow in adf for transformation but it is easy to use pyspark with dataframe and all for transformation and pyspark is fast also. And one more thing we most use adf for orchestration

  • @passions9730
    @passions9730 Год назад

    very good session...thanks for brining this project. subscribed to channel by seeing the content..

  • @seeemant
    @seeemant Год назад

    Amazing, pls add AKS too

  • @azureportol
    @azureportol 11 месяцев назад +6

    I am not able to find data set about this project

    • @sayantanpodder2478
      @sayantanpodder2478 9 месяцев назад +1

      Please refer other comments before commenting

  • @Dipsvloggermany2021
    @Dipsvloggermany2021 Год назад +3

    Can you make an end to end project using Microsoft Fabric ? And please make more end to end to project like this

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад +1

      Hi, sure, I am already looking into Fabrics, you can expect the video in the near future, thanks for understanding :)

  • @chubsmash7602
    @chubsmash7602 6 месяцев назад

    Thank you for these videos, really appreciate the time and efforts.

  • @hashmatsulthana
    @hashmatsulthana Год назад

    Thank you so much for this content .can you also please bring up video for ADF to snowflake?

  • @vps071
    @vps071 Год назад +1

    great informative video! quick question..why is Synapse analytics needed? Can't PowerBi directly get feed from the gold layer in datalake?

    • @mr.ktalkstech
      @mr.ktalkstech  11 месяцев назад +1

      Thank you so much :) We can connect directly from Data Lake as well- but its always recommended to use a structured database as a serving layer for reporting which will be scalable and handling the security will be simpler :)

  • @Dataenginner
    @Dataenginner Год назад +5

    The concept you just showed in 11 mins is more worth then others playlist 😂, good to be ur subscriber man ❤ please keep making videos and help student

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад

      Thank you so much for the biggest compliment :)

    • @jayanttiwari3762
      @jayanttiwari3762 Год назад

      @@mr.ktalkstech even i feel the same. Bhai your concepts are very clear, awesum videos.

  • @AltafAnsari-tf9nl
    @AltafAnsari-tf9nl 11 месяцев назад

    Awesome explanation

  • @rasikakurhade1011
    @rasikakurhade1011 10 месяцев назад +2

    Hi Mr.K,
    I have also worked on the same migration project wherein we migrated data from on prem sql server to azure data lake gen2. We have already transformed data into SQL server as per the business requirement and then copied it to data lake gen2 using activities in ADF.
    In this video you explained about lake house architecture which I was not aware earlier when I worked on this project.
    So I have a small doubt:
    As we transformed data already before migrating it to azure as per the client requirements in SQL server then after loading it to the azure data Lake, in which layer of lake house architecture it would have been copied by us among bronze, silver and gold? And is it possible to copy data directly to gold layer? It was my first project so I couldn't pay attention to more details, could you please help me understand about it.
    Thanks in advance!

    • @mr.ktalkstech
      @mr.ktalkstech  10 месяцев назад +1

      Thanks for reaching out :) If the data is already transformed and it doesn't require any further transformation at all- then we can load directly to the Gold layer.

    • @rasikakurhade1011
      @rasikakurhade1011 9 месяцев назад

      @@mr.ktalkstech : Thanks for clearing the doubt.

  • @sidsrivastava6987
    @sidsrivastava6987 10 месяцев назад

    Damn i did an exact project like this in my internship at Amazon

  • @pandeyvivak8223
    @pandeyvivak8223 Год назад +1

    can you please bring more videos like this. Also DP203 certification guide videos.

  • @sowmyakotapally6677
    @sowmyakotapally6677 17 дней назад

    hi,
    Can u make Video to cover Azure and Spark relates interview questions and answers wrt to real time scenarios focusing on optimization done in specific for the use case and not the general methadologies.
    These are the questions I was asked recently.
    1) How do u recover a corrupt parquet data file
    2) U have millions of records in bronze layer and after transformations u have 50 million records in gold layer.
    U find that there are corrupt files in only one partition at the gold layer.
    How will u recover the file of that particular partition without rerunning the entire pipeline because we have millions of rows in both bronze layers
    3) What are the actual optimization done in project by you to achieve a) Execution time optimization b) Join level optimization
    Interviewer did not want generic answers which we know or would have read theortically.
    He wanted in specific How i implemented in the project
    Please do video with such tricky questions

  • @dp9794
    @dp9794 5 месяцев назад +1

    How to integrate data from sources like salesforce, AWS, Azure data lakes, Genesys, SAP

  • @sonusolanki9927
    @sonusolanki9927 5 месяцев назад

    Please share dataset to complete this project, really amazing videos

  • @raghuprasad3920
    @raghuprasad3920 Год назад

    Hi Sir, Thank you for the video can you also do a 'End to End (Snowflake + Azure) Data Engineering Project' ?

  • @Badr_ouz
    @Badr_ouz 6 месяцев назад

    Good explaination

  • @atulbisht9019
    @atulbisht9019 2 месяца назад +2

    Sir this usecase doesn't make sense. They would want to eliminate/cut the on prem data warehouse to azure environment then why wiill we be connecting to it. For one bulk loat it is understandable but for daily refreshes the source should be an OLTP system?
    Still thanks for making this playlist...it is really helpful to understand important azure services.

  • @ShanumUmaira-l3f
    @ShanumUmaira-l3f 2 месяца назад

    How do we load data from gold layer to synopsis...using ADF? or data bricks?

  • @rammik1494
    @rammik1494 8 месяцев назад

    Thank you so much for explaining the architecture. Wonderful content 😊
    I have a question though- what is the use is azure synapse analytics as we already have gold layer with clean data. Why can’t we connect bi tool directly to gold layer?
    Can you please let me know sir?

  • @nuzhatnsu
    @nuzhatnsu Год назад +1

    thanks for providing an amazing video... please provide the link to the dataset so we can practice.. thanks in advance

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад

      It's an open source Adventure works database- follow the below link to import the database to the SSMS (I used the light weight version)
      learn.microsoft.com/en-us/sql/samples/adventureworks-install-configure?view=sql-server-ver16&tabs=ssms

    • @Mehtre108
      @Mehtre108 Год назад

      ​@@mr.ktalkstechbro what is project name

  • @sergendula3256
    @sergendula3256 5 дней назад

    hello sir i have a problem with the transformation in part 6(data transformation) i keep getting the error
    AnalysisException: Found duplicate column(s) in the data to save: ship__to__address__id, sub__total, credit__card__approval__code, ship__method, ship__date, purchase__order__number, account__number, modified__date, order__date, revision__number, tax__amt, customer__id, due__date, sales__order__number, online__order__flag, bill__to__address__id, total__due, sales__order__id
    and the more i try to redefine the logic it still gives thesame errors

  • @ricardogomes4077
    @ricardogomes4077 Год назад

    plzz bring more using semi-structured and unstructured data

  • @UjjwalDhiman-lm5pj
    @UjjwalDhiman-lm5pj 6 месяцев назад

    Project is amazing, can I get the database with tables you used in this project

  • @venzotv1976
    @venzotv1976 Год назад

    Why do we need Synapse if PowerBI can read from any Gen2 storage at Gold level?

  • @likhim
    @likhim 11 месяцев назад

    Hi Sir can u pls advise after free tier over how much cost it will come to use azure for learning this project

  • @ranjansrivastava9256
    @ranjansrivastava9256 Год назад +1

    Dear really great video. I have couple of questions on this architecture a. Which type of challenges do we face if we connect Power BI to Databricks directly to prepare dashboards. b. We can do the transformations in Synapse as well , and how do we connect Gold Layer to Synapse to prepare the data before connect to the Power BI dashboards. c. What challenges do we face if we connect On-Premises SQL Server data to Power BI directly to prepare the dashboards. Kindly help me on that.

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад +1

      Previously databricks doesn't have a serverless DB (I guess they recently added it)- having serverless DB to Power BI integration will be better as we don't need to wait for the cluster to turn on as it will be readily available to query the tables.

    • @ranjansrivastava9256
      @ranjansrivastava9256 Год назад +1

      @@mr.ktalkstech one more query was there like:- suppose client does not like to go on cloud :- c. What challenges do we face if we connect On-Premises SQL Server data to Power BI directly to prepare the dashboards.

    • @alaricmbooh3628
      @alaricmbooh3628 11 месяцев назад +1

      Some challenges could be related to scalability

  • @UnrealK9999
    @UnrealK9999 Год назад

    thanks for this!!

  • @udaynj
    @udaynj Год назад

    Why do you need DataBricks AND Synapse? Synapse does data transformation/loading also. Seems duplicative to me. Can you pls explain? Thanks

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад +4

      Yes, you are right- synapse does both- in most cases Databricks is preferred for doing the data transformation, which works really well for the big data workloads and with the streaming data.
      But the main Idea of using databricks for this projects is to cover different resources as possible in the architecture, so that it would help people to understand how each resources works together. Hope that makes sense :)

  • @DipuApple
    @DipuApple 10 месяцев назад

    is the project OS independent ? like any1 using mac linux ubuntu try it out ? or azure is only for Microsoft ?

  • @nallakumarp2886
    @nallakumarp2886 6 месяцев назад

    Its very useful video . Can you please let me know if you have any hadoop data migration from hdfs to Azure sql server project . if Yes kindly share the link

  • @DrayCool-df8kj
    @DrayCool-df8kj Месяц назад

    Thank you. Do you have a community ? I wanna join please.

  • @gautamgovinda5140
    @gautamgovinda5140 Год назад

    👍

  • @abhishekkalia6990
    @abhishekkalia6990 5 дней назад

    bro why rest of the videos are hidden now?

  • @abhaybhatnate7428
    @abhaybhatnate7428 11 месяцев назад

    Sir can you please upload the data set plz...... unable to do the project

  • @Chennairthymes
    @Chennairthymes 2 месяца назад

    Can you please share the project title for this project

  • @saikumarjakki3802
    @saikumarjakki3802 10 месяцев назад

    HI where i can get the on prem data can u share that link it will be help full

  • @Mehtre108
    @Mehtre108 Год назад

    Did pyspark use in databrics sir

  • @zahidalam7831
    @zahidalam7831 8 месяцев назад

    Hi Mr k,
    Kindly help me out how to put this project in our resume. Whats the best way to present this project into resume so that we
    can explain the thing whatever we used in this.

    • @zahidalam7831
      @zahidalam7831 8 месяцев назад

      Kindly tell me

    • @zahidalam7831
      @zahidalam7831 7 месяцев назад

      Plz suggest me

    • @pavankumard5276
      @pavankumard5276 2 месяца назад

      I have not watched the entire video but you can put something like migrated on premise sql db to azure

  • @PavanKalyan-ec9mw
    @PavanKalyan-ec9mw Год назад

    i think we can transfer this data using data migration service right, if it's just for one time.

  • @AngamVijaykumar
    @AngamVijaykumar Месяц назад

    Please say the use case for the project

  • @shravyakulal5756
    @shravyakulal5756 15 дней назад

    Is this project available in udemy?

  • @abhishekkalia6990
    @abhishekkalia6990 5 дней назад

    hey bro i have already supported you. I being charged and tried to copy the link but at the time of access it has gone. i want this project access

  • @mansinayak3360
    @mansinayak3360 6 месяцев назад

    Do we need any subscription to build this project at any stage?

  • @moizmirza9179
    @moizmirza9179 7 дней назад

    why did you disabled other parts brother, I was following the tutorial :(

  • @ashabhumza3394
    @ashabhumza3394 Год назад

    Can I also do this project along side this video? I mean without paying anything for using Azure.

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад +1

      Hey. thanks for reaching out. You can create a free azure account which will give you free credit of 200 dollars for 1 Year period, and you can use it to do the project if you would like to :)
      azure.microsoft.com/en-us/free/

    • @rajkumarbandi6195
      @rajkumarbandi6195 Год назад

      @@mr.ktalkstech its 30 days I think

    • @AbhishekParmar-gy3fz
      @AbhishekParmar-gy3fz 11 месяцев назад

      @@mr.ktalkstech Hi is 200 dollars enough to complete the whole project?

  • @nitikjain993
    @nitikjain993 Год назад

    Could you please make this same project in using AWS services?

  • @adityadhawle6735
    @adityadhawle6735 8 месяцев назад

    thanks bro

  • @Mehtre108
    @Mehtre108 Год назад

    Hello Sir,
    What should i mention project name on resume
    Description
    Roles n responsibilities

    • @Mehtre108
      @Mehtre108 Год назад

      I am new in this field so pls help sir

  • @atharvbajare7398
    @atharvbajare7398 4 месяца назад

    please provide me dataset you have used during this project

  • @anishsaha1777
    @anishsaha1777 5 месяцев назад

    Can the entire project be done by using the free subscription of Azure?

  • @prabhatgupta6415
    @prabhatgupta6415 Год назад

    SIR CAN YOU BRING SOMETHING ON HEALTH CARE PROECT

  • @beniffland7310
    @beniffland7310 Год назад

    Can I ask is Microsoft Fabric basically using these same services?

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад

      Fabric contains Data Factory + Synapse + Data Lake (It does not have other services used in this Project)

  • @Chennairthymes
    @Chennairthymes Месяц назад

    Can you please say me the project name

  • @satyajeetdesai6076
    @satyajeetdesai6076 2 месяца назад

    is this project with free resources?

  • @kanthikumar122
    @kanthikumar122 Год назад

    Can you please suggest good institute to learn azure data engineer course

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад

      I am not sure about that, Sorry :)

    • @Ady_Sr
      @Ady_Sr Год назад

      Buy ur own subscription. Learn 1 module at a time from open sources like youtube n documents.

  • @TheAdventureArchive1
    @TheAdventureArchive1 10 месяцев назад

    sir datasets

  • @portiseremacunix
    @portiseremacunix 3 дня назад

    unsub...

  • @chamarthysowjanya
    @chamarthysowjanya Год назад

    Hai Kishore u r explaining simply Superab how can I contact u

    • @mr.ktalkstech
      @mr.ktalkstech  Год назад

      Thank you :) email: mrktalkstech@gmail.com

  • @carterh7470
    @carterh7470 8 месяцев назад +1

    🤌🤌🤌🤌 this is perfect

  • @karthireddy5838
    @karthireddy5838 8 месяцев назад

    Amazing content, thanks for this video!!

  • @sureshk8882
    @sureshk8882 6 месяцев назад

    very nice explained