Mr. K Talks Tech
Mr. K Talks Tech
  • Видео 79
  • Просмотров 1 238 271
Part- 4 (Data Ingestion using Azure Databricks)- End to End Streaming Azure Data Engineering Project
Welcome back to the next part of our exciting end-to-end real-time streaming Data Engineering project! In this video, we’ll walk through the Data Ingestion process using Azure Databricks in ingesting events to the Event HUB
Download the code from here: buymeacoffee.com/mrktalkstech/e/333857
Support me for my work :) Please Like, Share and Subscribe to my YT channel :)
#DataEngineering #AzureDatabricks #AzureFunctions #EventHub #MicrosoftFabric #RealTimeData #StreamingData #PowerBI #KustoDB #DataActivator #CloudData #WeatherData #EndToEndProject #DataIngestion #DataPipeline #AzureEventHub #RealTimeAlerts
Join this channel to get access to perks:
ruclips.net/channel/UCzdOan4AmF65PmLLks8Lmwwjoin
-...
Просмотров: 1 222

Видео

Part- 3 (Data Ingestion using Azure Databricks)- End to End Streaming Azure Data Engineering Project
Просмотров 1,6 тыс.21 день назад
Welcome back to the next part of our exciting end-to-end real-time streaming Data Engineering project! In this video, we’ll walk through the Data Ingestion process using Azure Databricks in ingesting events to the Event HUB #DataEngineering #AzureDatabricks #AzureFunctions #EventHub #MicrosoftFabric #RealTimeData #StreamingData #PowerBI #KustoDB #DataActivator #CloudData #WeatherData #EndToEndP...
Part 2- End to End Realtime Streaming Azure Data Engineering Project (Environment Setup)
Просмотров 2,7 тыс.Месяц назад
Welcome back to the next part of our exciting end-to-end real-time streaming Data Engineering project! In this video, we’ll walk through the entire environment setup process, covering all the essential resources we’ll use to build a real-time weather reporting system using Azure services and Microsoft Fabric. TimeStamp: 0:00:00 - Setting up the Weather API 0:03:58 - Creating Resource Group 0:07...
End to End Realtime Streaming Azure Data Engineering Project (Part -1) | Complete Guide with Demo
Просмотров 5 тыс.Месяц назад
Welcome to an exciting session where we'll build a real-time streaming Data Engineering project using Azure services! This is one of the best ways to get hands-on experience with real-time streaming while understanding architectural decision-making for cost and performance optimization. In this project, we’ll: - Set up a real-time weather report system using Weather API - Implement data ingesti...
6 Proven Steps to Land a Data Engineer Job with ZERO Experience (Even as a Fresher!)
Просмотров 4,9 тыс.2 месяца назад
#dataengineering #azuredataengineering #aws #gcp #jobs In this video, I have explained 6 important steps that will help you to get hired as a Data Engineer. This is probably the most asked question to me, and that's the reason I decided to make a video on this topic. I hope this is helpful. Please let me know if you have any questions in the comment section. - - - Book a Private One on One Meet...
Must-Know PySpark Interview Question for Data Engineers - Live Demo & Tips!
Просмотров 3,4 тыс.2 месяца назад
#ApacheSpark #DataEngineering #AzureDataEngineer #SparkSQL #DataTransformation #DataFrame #InterviewQuestion #BigData #AzureDatabricks #PySpark #DataAnalysis #DataScience #SQLQuery #Optimization #Efficiency #Tutorial In this video, we'll dive into a popular PySpark interview question often asked by financial and banking companies-calculating the running total for grouped data. We'll explore the...
What is Apache Spark? Learn Apache Spark in 15 Minutes
Просмотров 7 тыс.3 месяца назад
#apachespark #databricks #sparkteam #dataengineering #pyspark #architecture In this video, I have covered the most important topic of Data Engineer which is "Apache Spark". Especially, I have talked about the complete end to end Architecture of Spark Spark covering all the individual components as below, 1. Driver Programme 2. Worker Node 3. Cluster Manager 4. Spark Context or Spark Session 5. ...
Data Engineer's Biggest Nightmare: Solving Merge Conflicts Step-by-Step Guide with Demo
Просмотров 2,1 тыс.3 месяца назад
#AzureDevOps #Git #DataEngineering #InterviewQuestions #DevOpsTutorial #AzureDatabricks #GitRevert #BranchRecovery #CodingBestPractices #TechInterviewTips #MergeConflicts 🔍 What We'll Cover: In this video, we dive into a crucial topic for every Data Engineer: Merge Conflicts. Merge conflicts are one of the most common issues encountered when working collaboratively with Git repositories. Unders...
Untold secrets of Data Engineering Project !!! Reality and Facts in 15 minutes
Просмотров 9 тыс.4 месяца назад
Follow me on linkedIn: www.linkedin.com/in/mrk-talkstech/ Book one-on-one meeting: www.buymeacoffee.com/mrktalkstech/e/166354 Welcome to my channel! In this video, we’re diving deep into the creation of a real-time Data Engineering project from scratch. This comprehensive guide covers everything you need to know, from the initial discussions to the final implementation and maintenance. What We’...
Don't fail to answer these questions in your next Interview- Data Engineering Interview Question
Просмотров 3,3 тыс.5 месяцев назад
#AzureDevOps #Git #DataEngineering #InterviewQuestions #DevOpsTutorial #AzureDatabricks #GitRevert #BranchRecovery #CodingBestPractices #TechInterviewTips 🔍 What We'll Cover: In this video, we tackle two essential Azure DevOps and Git-related interview questions for data engineers. The first question addresses how to revert changes mistakenly committed to the main branch. Using a practical exam...
Tips and Tricks- Delta Lake Table in Apache Spark - Azure Data Engineering Interview Question
Просмотров 4,6 тыс.5 месяцев назад
#PySpark #DeltaTable #AzureDatabricks #BigData #DataEngineering #DataLake #InterviewQuestions #TechTutorial #DataAnalytics #DataTransformation #SchemaEvolution #ACIDProperties #DataGovernance #TimeTravel #DataPipelines #Azure #Databricks 🔍 What We'll Cover: Delta Table Overview: Understand why Delta Tables have become a popular choice for big data analytics in cloud-based platforms. Code Walkth...
Can you solve this simple Spark SQL Interview Question? | Azure Data Engineering Tutorials
Просмотров 2 тыс.5 месяцев назад
#SQL #SPARKSQL #Databricks #SQLInterview #SQLTutorial #DataScience #DataEngineering #Programming #Coding #TechInterviews In this video, we explore a SPARK SQL interview question that may seem simple but can be quite tricky. This question has stumped many candidates recently, which is why we're diving deep into it. If you find this video helpful, please like, share, and subscribe for more tutori...
15 Minutes- Libraries in Databricks Explained -Tips & Tricks | Azure Databricks Tutorials
Просмотров 3,1 тыс.5 месяцев назад
15 Minutes- Libraries in Databricks Explained -Tips & Tricks | Azure Databricks Tutorials
Azure Synapse Analytics- Interview Questions | Serverless SQL Pool VS Dedicated SQL Pool with Demo
Просмотров 4,8 тыс.6 месяцев назад
Azure Synapse Analytics- Interview Questions | Serverless SQL Pool VS Dedicated SQL Pool with Demo
Most popular Data Warehousing Interview Question- Azure Data Engineering Interview Question Tutorial
Просмотров 3,3 тыс.7 месяцев назад
Most popular Data Warehousing Interview Question- Azure Data Engineering Interview Question Tutorial
90% of Data Engineers doesn't know this - PySpark Azure Data Engineering Interview Question Tutorial
Просмотров 5 тыс.7 месяцев назад
90% of Data Engineers doesn't know this - PySpark Azure Data Engineering Interview Question Tutorial
Tips and Tricks- Azure Data Engineering Interview Questions | Managed Identity vs Service Principal
Просмотров 12 тыс.7 месяцев назад
Tips and Tricks- Azure Data Engineering Interview Questions | Managed Identity vs Service Principal
Microsoft Fabric - End to End Azure Data Engineering Project - Bing news Data Analytics
Просмотров 26 тыс.9 месяцев назад
Microsoft Fabric - End to End Azure Data Engineering Project - Bing news Data Analytics
A 6-Month Complete Guide to Become a Data Engineer in 2024 😎 - Azure Data Engineer Roadmap 2024
Просмотров 26 тыс.11 месяцев назад
A 6-Month Complete Guide to Become a Data Engineer in 2024 😎 - Azure Data Engineer Roadmap 2024
Unlocking Secrets in Azure Databricks with Azure Key Vault! 🗝️✨ | Azure Databricks Tutorials
Просмотров 8 тыс.Год назад
Unlocking Secrets in Azure Databricks with Azure Key Vault! 🗝️✨ | Azure Databricks Tutorials
Top 5 Notebook Features You Can't Miss in Azure Databricks 🚀| Databricks and Azure DevOps Tutorials
Просмотров 3,8 тыс.Год назад
Top 5 Notebook Features You Can't Miss in Azure Databricks 🚀| Databricks and Azure DevOps Tutorials
15 Minutes- Spark Clusters in Databricks Explained -Tips & Tricks | Azure Databricks Tutorials
Просмотров 21 тыс.Год назад
15 Minutes- Spark Clusters in Databricks Explained -Tips & Tricks | Azure Databricks Tutorials
Part 1- What is CI/CD? - Continuous Integration and Continuous Deployment in Azure Databricks Demo
Просмотров 31 тыс.Год назад
Part 1- What is CI/CD? - Continuous Integration and Continuous Deployment in Azure Databricks Demo
Create your first Data Ingestion Pipeline in Microsoft Fabric | Microsoft Fabric Tutorials
Просмотров 9 тыс.Год назад
Create your first Data Ingestion Pipeline in Microsoft Fabric | Microsoft Fabric Tutorials
Sending ADF Pipeline alerts to Microsoft Teams Demo | Azure Data Factory Tutorials for Beginners
Просмотров 7 тыс.Год назад
Sending ADF Pipeline alerts to Microsoft Teams Demo | Azure Data Factory Tutorials for Beginners
How to create Azure Databricks? Explained Easy | Azure Databricks Tutorials for Beginners
Просмотров 4,1 тыс.Год назад
How to create Azure Databricks? Explained Easy | Azure Databricks Tutorials for Beginners
What is Tumbling Window Trigger? Implementing Incremental Load in Azure Data Factory | ADF Tutorials
Просмотров 10 тыс.Год назад
What is Tumbling Window Trigger? Implementing Incremental Load in Azure Data Factory | ADF Tutorials
How to create and enable Microsoft Fabric using Azure Portal? | Microsoft Fabric Tutorials
Просмотров 11 тыс.Год назад
How to create and enable Microsoft Fabric using Azure Portal? | Microsoft Fabric Tutorials
What is Microsoft Fabric? | Learn Microsoft Fabric in 15 minutes | Step by Step Guide
Просмотров 8 тыс.Год назад
What is Microsoft Fabric? | Learn Microsoft Fabric in 15 minutes | Step by Step Guide
What's the Future of this Channel?
Просмотров 275Год назад
What's the Future of this Channel?

Комментарии

  • @shravyakulal5756
    @shravyakulal5756 9 часов назад

    Is this project available in udemy?

  • @abhishekkalia6990
    @abhishekkalia6990 20 часов назад

    This project is indeed informative and well explained. Thank you.👍🏻

  • @muhammadariffahrudin4876
    @muhammadariffahrudin4876 21 час назад

    Hi Thanks for your video tutorial it's very insightful, I want to ask why we don't use fabric notebook instead of databriks notebook?

  • @vikramsingh5757
    @vikramsingh5757 День назад

    Imagine a Senior Data Engineer as the Senior Manager of a large department. They oversee the entire operation and have full control over all resources (the Azure Tenant). Within this department, the Senior Manager creates different teams (Resource Groups) to organize different services like Data Lake, Synapse Analytics, and Spark. To manage one of these teams, the Senior Manager appoints an Assistant Senior Manager as the team leader (Owner role) of a specific group. The Assistant Senior Manager has full control over the team and assigns roles to other members. For example, they may grant a Manager (Contributor role) the responsibility of managing the services, while a Team Leader or Team Member (Reader or Contributor roles) may be given specific access, like overseeing data ingestion or analysis. The Assistant Senior Manager ensures everyone has the appropriate access level to complete their tasks, while the Senior Manager maintains overall oversight of the department’s operations.

  • @TheMapleSight
    @TheMapleSight День назад

    What is the difference between doing it in PySpark and SQL? I think in SQL it's much easier: %sql drop table if exists test; CREATE TABLE test ( id int, total int ) INSERT INTO test (id, total) VALUES (1,10), (1,20), (1,30), (1,40), (2,20), (2,40), (2,60), (2, 80); SELECT *, SUM(total) OVER (PARTITION BY id ORDER BY total) as emp_run FROM test;

  • @sowmyakotapally6677
    @sowmyakotapally6677 День назад

    hi, Can u make Video to cover Azure and Spark relates interview questions and answers wrt to real time scenarios focusing on optimization done in specific for the use case and not the general methadologies. These are the questions I was asked recently. 1) How do u recover a corrupt parquet data file 2) U have millions of records in bronze layer and after transformations u have 50 million records in gold layer. U find that there are corrupt files in only one partition at the gold layer. How will u recover the file of that particular partition without rerunning the entire pipeline because we have millions of rows in both bronze layers 3) What are the actual optimization done in project by you to achieve a) Execution time optimization b) Join level optimization Interviewer did not want generic answers which we know or would have read theortically. He wanted in specific How i implemented in the project Please do video with such tricky questions

  • @antti-juhanamaki7420
    @antti-juhanamaki7420 2 дня назад

    Great video and project, but that Synapse part seems a bit irrelevant, or at least not worth of money; Isn't Synapse a bit overkill solution for loading data from lakehouse to database? why to use it instead of e.g. having a similar database in Azure and orchestrate with ADF+Databricks combo that was anyway used in the project?

  • @BalbirSodhi
    @BalbirSodhi 3 дня назад

    One of the best training course. Thanks for covering all the steps. very useful for learning Azure Databricks

  • @ASHISHSHARMA-eg6oi
    @ASHISHSHARMA-eg6oi 4 дня назад

    How can I follow your project when you have already set up eveyrthing?

  • @masudkamrul5949
    @masudkamrul5949 4 дня назад

    Eagerly waiting for your next videos of this project

  • @jeevang14u
    @jeevang14u 5 дней назад

    How can you execute for previous dates which is already gone.. we can just trigger for the old files, that we can do it with schedule trigger as well. its not like we can run the tumbling trigger for previous days..

  • @nikhil390
    @nikhil390 5 дней назад

    This was much needed training, Thanks for sharing

  • @frederikdelforche4833
    @frederikdelforche4833 6 дней назад

    nice project & clear explanation --> please continue to create this type of content

  • @selvakumarr.k.8660
    @selvakumarr.k.8660 6 дней назад

    useful presentation

  • @nikhilvenkat586
    @nikhilvenkat586 6 дней назад

    the end to end project is great explanation very useful content --- but while we are trying to do that from our end prerequisites is Advanture database with sample sales.lt tables on on-prem , those authentication process for new practitioners like me its difficult i request to do that video seperately to practise

  • @dineshdeshpande6197
    @dineshdeshpande6197 6 дней назад

    Can we use UAMI - User assigned managed Identity to have connectivity in resources and what r its advantages and disadvantages and use case to use it

  • @ramswaroop1520
    @ramswaroop1520 6 дней назад

    Hi sir , Why did you removed remaining parts of the playlist 😢 .... We didn't expect this from you .

  • @FUCKdarshan
    @FUCKdarshan 7 дней назад

    dnt call it end-to-end project if you are skipping all of the initial steps you freakin' loser😡🤬

  • @ravulapallivenkatagurnadha9605
    @ravulapallivenkatagurnadha9605 7 дней назад

    Nice video

  • @user-nu7nt1bk9o
    @user-nu7nt1bk9o 8 дней назад

    Thank you for taking the time to create those wonderful videos. Can you tell us what would be the cost to complete all those training videos on a pay-as-you-go subscription? are there any resources that can be paused?

  • @amairaanam6007
    @amairaanam6007 8 дней назад

    superb explanation loved it❤

  • @mkdTech369
    @mkdTech369 8 дней назад

    Thanks for it. Please upload more videos

  • @radhaa225
    @radhaa225 8 дней назад

    Thank you

  • @MohammedAkbar-n3e
    @MohammedAkbar-n3e 8 дней назад

    AS AN FRESHSER WITH OR INTERN DE HOW TO WRITE SUCH CODES DO YOU HAVE ANY SOURCE WHERE WE CAN GET SUCH CODES ( I HEARD MANY DE DO COPY PASTE FROM AI OR GOOGLE JUST NEED TO KNOW LOGIC REPLACE FILES PATH)---BY SEEING SUCH HUGE CODES LOOK LIKE DE IS ALSO CODING----ITS DIFFICULT FOR NON IT GUYS IN ON PREM TO CLOUD U HAVE --CHANGES FOR DATES N COLUMN NAMES WHAT IN CASE THE REQUIREMENTS IS DIFFERENT LIKE ( REPLACE ALL TABLES NULL NOT NULL, REMOVE ALL TABLES BLANK SPACE, REMOVE ALL TABLESS DUPLICATES, ECT(ADD ON AS PER UR EXPERIENCE WHAT IS THE CODE FOR THAT???) IN BING API( U HAVE DONE TRANSFORMATION FOR OBJECT TO STRING N STING TO DISCTIONARY THEN MERGING FOR INCRIMENTAL LOAD --- WHAT IN CASE OF OTHER TRANSFORMATION REQUIED AS PER COMPANY -- HOW TO GET SUCH HUGE CODES? NOW THIS PROJECT( LOTS OF CODING---( KNOWING BASIC PYTHON N SQL IS OK BUT THIS LAST 2 PROJECT SERIES HAVE HUGE CODING -- HOW TO GET SUCH SCRIPTS PLEASE HELP)

  • @One_minute_vedio
    @One_minute_vedio 9 дней назад

    Big fan sir

  • @sharaniyaswaminathan8760
    @sharaniyaswaminathan8760 9 дней назад

    Very helpful!! 👍🏼

  • @sravankumar1767
    @sravankumar1767 9 дней назад

    I have downloaded but I am unable to open the file

  • @sravankumar1767
    @sravankumar1767 9 дней назад

    Superb explanation 👌 👏 👍 very helpful 👏 👍 👌

  • @shaheensyed5187
    @shaheensyed5187 9 дней назад

    Thanks for your great explanation. Will keep watching your channel.

  • @mr.ktalkstech
    @mr.ktalkstech 9 дней назад

    Download the code from here -> buymeacoffee.com/mrktalkstech/e/333857 Support me for my work :) Please Like, Share and Subscribe to my YT channel :)

  • @SigmaSid98
    @SigmaSid98 9 дней назад

    Excellent 🙏🏻🙏🏻🙏🏻 ... Dedicated paid course is what we need from you SIR. You are great 🙏🏻🙏🏻🙏🏻

    • @mr.ktalkstech
      @mr.ktalkstech 9 дней назад

      Thank you so much :) Sure, have plans for doing that :)

  • @satish1012
    @satish1012 9 дней назад

    The Views in ASA are permanent? Or there are just virtual?

  • @stkchan9762
    @stkchan9762 9 дней назад

    integration runtime Error Code: 1002 Error: A problem occurred while receiving the configuration file from the server. Suggestion: You may retry the express setup on Azure portal, or switch over to manual setup.

    • @mr.ktalkstech
      @mr.ktalkstech 9 дней назад

      At which stage you are getting this error?

  • @saileshKandula
    @saileshKandula 9 дней назад

    Please upload a dataset where we can download the dataset and can practise

  • @satish1012
    @satish1012 9 дней назад

    Great Presentation! Just one suggestion for those working on real-time projects: If you're writing data into Parquet files and storing them in Azure Data Lake (which is essentially object storage at its core), here's something to consider. For example, if you have 1 million records in a Parquet file and a new record is added in the source, the system will update the entire 1 million records along with the new row. To prevent this issue, it's crucial to organize the data in Azure in a way that avoids large-scale updates. Instead of storing all sales data in a single folder, consider organizing it by year, then by month. If the dataset is extremely large, you can even break it down further by day. This way, when updates occur in the source, only a small part of the data is affected, rather than needing to update the entire dataset. /SalesData /Year=2023 /Month=01 /Day=01 sales_data_2023_01_01.parquet /Day=02 sales_data_2023_01_02.parquet /Month=02 /Day=01 sales_data_2023_02_01.parquet /Year=2024 /Month=01 /Day=01 sales_data_2024_01_01.parquet /Day=02 sales_data_2024_01_02.parquet

  • @stkchan9762
    @stkchan9762 9 дней назад

    at 3:30 Where can I get my own password?

    • @mr.ktalkstech
      @mr.ktalkstech 9 дней назад

      Hi, We are creating a new user with password, so give any complex password (you can also give the same one which I gave in the video)

  • @saileshKandula
    @saileshKandula 9 дней назад

    no dataset has been provided for practicing.

  • @Cannel-q4j
    @Cannel-q4j 10 дней назад

    when is the next release

  • @anandk5677
    @anandk5677 11 дней назад

    You are chandoo right??

  • @MohammedAzhar-g2q
    @MohammedAzhar-g2q 12 дней назад

    waiting for videos seems like waiting for conformation of dhoni playing next ipl for chennai

  • @bichannel3419
    @bichannel3419 13 дней назад

    Thank you very much for making this useful content! Shows your sincere effort.

  • @selvakumarr.k.8660
    @selvakumarr.k.8660 14 дней назад

    good presentation

  • @MohammedAzhar-g2q
    @MohammedAzhar-g2q 14 дней назад

    can any one help with sql store procedure query im getting below error: Started executing query at Line 1 Changed database context to 'gold_db'. (0 record affected) An object or column name is missing or empty. For SELECT INTO statements, verify each column has a name. For other statements, look for empty alias names. Aliases defined as "" or [] are not allowed. Change the alias to a valid name. Total execution time: 00:00:01.645

  • @MohammedAzhar-g2q
    @MohammedAzhar-g2q 14 дней назад

    can any one tell or wirte the sql store procedure query here

  • @MohammedAzhar-g2q
    @MohammedAzhar-g2q 14 дней назад

    can you provide store procedure script all your videos of each projects series doesnt have scripts in discription im getting error for store procedure script

  • @PatelTushya
    @PatelTushya 14 дней назад

    Respect++

  • @sirajuddinmohamedsaleem937
    @sirajuddinmohamedsaleem937 15 дней назад

    @mr.ktalkstech when we can expect the part 2 video.

  • @sarveshdeshpande1772
    @sarveshdeshpande1772 15 дней назад

    Brilliant man …. This is helping me a lot. Sidhi baat no bakwas hona chahiye is series a naam