How to Submit a PySpark Script to a Spark Cluster Using Airflow!

Docker Image BEST Practices - From 1.2GB to 10MB

Now I Know Why Most People Don’t Use gRPC

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

Avengers wake up, Marvel Rivals is fire

Noob To Pro With DRAGON REWORK in Blox Fruits

How to Run a Spark Cluster with Multiple Workers Locally Using Docker

The Data Guy

Просмотров 8 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 18 дек 2024

Комментарии • 55

@АртёмМеркулов-ю3к 15 дней назад ⁺¹
Thank you so much for this video! Very helpful!
@thedataguygeorge 8 дней назад
No problem, hope you got it running!
@SzTz100 3 месяца назад ⁺³
I haven't tried this yet, but if it works, you are a prince amongst men.
@thedataguygeorge 2 месяца назад
Hahahaha let me know how it goes my man!
@SzTz100 Месяц назад
@@thedataguygeorge I just tried it and it worked for me, great job.
@josephdaquila2479 4 месяца назад ⁺²
This also seems like a good introduction to Dockers. I am definitely getting a feel for the advantages of the tool
@thedataguygeorge 4 месяца назад
Yes great way to get started!
@subhashkumar209 2 месяца назад ⁺¹
Well, I had been searching for a course where we can do the spark development using an IDE and run complete end to end testing and deploy to Azure Databricks. For past 3 years I didn't find anything but today I watched this video and I can say that atleast a beginning point. In case you can create more videos on Azure Databricks development locally and deployment. I assure you you will be the king
@rasmusandreasson1548 9 месяцев назад ⁺¹
The king! Thank u for good content!
@thedataguygeorge 9 месяцев назад
Thanks so much Rasmus!
@not_saboor 9 месяцев назад ⁺¹
Thanks for this !
@thedataguygeorge 9 месяцев назад
No problem Saboor!
@Levy957 9 месяцев назад ⁺¹
i love your videos
@thedataguygeorge 9 месяцев назад
Thanks Levy! Love your support!
@LavieAdam-qq4uf 12 дней назад ⁺¹
Can i submit multiple custom jobs to the cluster at the same time?
@thedataguygeorge 8 дней назад
Yes!
@mayowaoludoyi5425 8 месяцев назад ⁺¹
Thank you for this walkthrough video. How can I establish connection to a relational database like Oracle from "dockerised" spark like this. I understand there is a different set up that requires JDBC. Where does it fits in this your setup?
@thedataguygeorge 8 месяцев назад ⁺²
Hey, you would add it similarly to how I connect to snowflake in other scripts, where you use the python ODBC drivers to establish connections to relational db's like Oracle
@dongtandung9671 Месяц назад
do you have this on a repo so that we can take a look at the whole thing?
@nansambassensalo3065 2 месяца назад
Also wondering how you addressed the JAVE_HOME path setup. My error message is that it's not set.
@early-riser18 7 месяцев назад
Thank you for the rundown - very helpful. Could you add a link to the code written please? Some code in the Dockerfile is hidden by the right-side screen fold and has to be guessed. Thanks :)
@not_saboor 7 месяцев назад ⁺¹
Can you explain the part on Jinja Templating you mentioned in 3:40
@thedataguygeorge 7 месяцев назад
Sure! What specifically about it are you interested in learning more about?
@josephdaquila2479 4 месяца назад
So this tutorial would also help me set this up to where I'm running computations on a server?
@thedataguygeorge 4 месяца назад
Definitely!
@imanitrecruiterineurope4142 7 месяцев назад
Hi!
It seems that the applications aren't taking any resources and are stuck in a loop on my end. What could be the cause?
@Jalabulajunx 7 месяцев назад ⁺¹
you don't have a directory like requirement, how will req/req.txt work?
@thedataguygeorge 7 месяцев назад
With Spark, you'll typically initiate a spark session and provide a list of requirements you need for that particular session
@josephdaquila2479 4 месяца назад
So the spark workers could be more physical computers or multiple vm's?
@thedataguygeorge 4 месяца назад
Multiple vm's!
@stars-and-clouds 10 дней назад
Missed the part where you need to add the spark conf file
@thedataguygeorge 8 дней назад
Good Call out!
@royteicher 3 месяца назад
Why all images names are 'da-spark-image' ? I get a pull access denied. This toturial is amazing and exactly what I was looking for, but can't make it happen
@Sudo801 2 месяца назад
I also have this issue
@Sudo801 2 месяца назад ⁺¹
So I got it working finally, even while getting the the pull access denied prompt. My issue ended up being the line "RUN curl downloads.apache.... " in the Dockerfile had an error I needed to fix for it to work.
@rahulgoala1991 Месяц назад
I am facing the same issue... what changes are required to make it work? Please help.
@АртёмМеркулов-ю3к 15 дней назад
@@Sudo801 Thank you for this comment!
@ritwikverma2463 2 месяца назад
I am not able to create .env.spark file in macbook m1, please sgare solution
@thedataguygeorge 2 месяца назад
Why aren't you able to create it?
@whramijg Месяц назад
so you came up with this all by yourself?
@thedataguygeorge Месяц назад
All by reading severally articles online lol
@csmithDevCove 9 месяцев назад ⁺¹
What about connecting sparp-nlp to this
@thedataguygeorge 9 месяцев назад
You would just want to add it to be installed within the docker image!
@artemqqq7153 3 месяца назад
Is it possible to build this without makefile? It is challenging to install it on windows...
@santielewaut 25 дней назад
WSL?
@ccc_ccc789 9 месяцев назад
Thanks
@thedataguygeorge 9 месяцев назад
No problem!
@josephdaquila2479 4 месяца назад
Please also explain daemons vs tasks
@thedataguygeorge 4 месяца назад
Will do in a future video!
@rafaellourenco4599 7 месяцев назад ⁺²
Bro, you skiped all the bugs stuff
@thedataguygeorge 7 месяцев назад ⁺¹
Sorry was solving them off camera but will make sure to show more of the troubleshooting process next time!
@rafaellourenco4599 7 месяцев назад ⁺²
@@thedataguygeorge can you share a repo with this project?
@Jalabulajunx 7 месяцев назад
I am always getting entry point.sh not found, has anyone figured it out?

Следующие

Автовоспроизведение

How to Submit a PySpark Script to a Spark Cluster Using Airflow!

How to Submit a PySpark Script to a Spark Cluster Using Airflow!

Docker Image BEST Practices - From 1.2GB to 10MB

Docker Image BEST Practices - From 1.2GB to 10MB

Now I Know Why Most People Don’t Use gRPC

Now I Know Why Most People Don’t Use gRPC

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

The Breakfast Club Reacts To Jay-Z’s Attorney Saying Him & Diddy Aren’t Friends + More

Avengers wake up, Marvel Rivals is fire

Avengers wake up, Marvel Rivals is fire

Noob To Pro With DRAGON REWORK in Blox Fruits

Noob To Pro With DRAGON REWORK in Blox Fruits

Yelling at my GF in front of FaZe Rug and Brawadis..

Yelling at my GF in front of FaZe Rug and Brawadis..

Run Spark and Hadoop faster with Dataproc

Run Spark and Hadoop faster with Dataproc

Best Docker Container Server Setup // Docker Swarm, CephFS, and Portainer

Best Docker Container Server Setup // Docker Swarm, CephFS, and Portainer

What is OpenTelemetry?

What is OpenTelemetry?

01 - Install Apache Spark Using Docker | Docker | Apache Spark

01 - Install Apache Spark Using Docker | Docker | Apache Spark

Learning Docker // Build Container Images

Learning Docker // Build Container Images

Docker vs. Kubernetes: The ONLY Video You Need to Finally Understand Containers!

Docker vs. Kubernetes: The ONLY Video You Need to Finally Understand Containers!

Getting started with Docker Compose

Getting started with Docker Compose

Save Databricks Costs: Local PySpark Dev Environment with Docker

Save Databricks Costs: Local PySpark Dev Environment with Docker

Do NOT Learn Kubernetes Without Knowing These Concepts...

Do NOT Learn Kubernetes Without Knowing These Concepts...

НОВЫЕ СТРАШНЫЕ ЗАПИСИ ЧЁРНЫХ ЯЩИКОВ / ЧЕРНЕЦ

НОВЫЕ СТРАШНЫЕ ЗАПИСИ ЧЁРНЫХ ЯЩИКОВ / ЧЕРНЕЦ

💥 Колоссальные Потери⚔️ Серьезное Продвижение На Курском Направлении📅 Военные Сводки За 18.12.2024

💥 Колоссальные Потери⚔️ Серьезное Продвижение На Курском Направлении📅 Военные Сводки За 18.12.2024

Малыши Скорпионы слезли с мамы и хотят кушать! Что делать? 😲

Малыши Скорпионы слезли с мамы и хотят кушать! Что делать? 😲

I.N "HALLUCINATION" | [Stray Kids : SKZ-PLAYER]

I.N "HALLUCINATION" | [Stray Kids : SKZ-PLAYER]

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

ふわふわシフォン大作戦🩷スイーツ戦隊のキラキラミッション✨【銀座コージーコーナー】 #shorts #シフォンケーキ #クリスマスケーキ #クリスマス #ケーキ #チョコケーキ #christmas

"Yurayotgan mashinalar yonib ketdi" - guvohlar

"Yurayotgan mashinalar yonib ketdi" — guvohlar

Дагестанцы проверили Мышонка / Хабиб готовит Махачева и Умара к бою на UFC 311

Дагестанцы проверили Мышонка / Хабиб готовит Махачева и Умара к бою на UFC 311