first congratulations on sharing knowledge. A question in the case I already have a hadoop cluster created and would like an IDE to facilitate the creation of script in pyspark and even scala. Wouldn't google have anything already installed free? What do you recommend me. Thank you Fernandes Brasil
Thank you Fernandes! Well you can use Jupyter Notebooks on Dataproc for exactly this purpose. You can also integrate with VSCode: cloud.google.com/code/docs/vscode.
Hi there! If you want to stay up to date with the latest machine learning and deep learning tutorials subscribe here:
ruclips.net/user/decisionforest
Good tutorial, could you please create a video on how to use spark-submit to execute jobs on all available cluster in a big data scenario on GCP.
first congratulations on sharing knowledge.
A question in the case I already have a hadoop cluster created and would like an IDE to facilitate the creation of script in pyspark and even scala.
Wouldn't google have anything already installed free? What do you recommend me. Thank you Fernandes Brasil
Thank you Fernandes! Well you can use Jupyter Notebooks on Dataproc for exactly this purpose. You can also integrate with VSCode: cloud.google.com/code/docs/vscode.
How do connect the PySpark to BigQuery?
How to access the Spark UI from Dataproc?
I have windows, how can I do for the scripts?
Thanks for the video, very easy
You're welcome! Happy it helped.
I could not find the bash script to create cluster on Dataproc
Hi Milad, the link to the download is in the description, it contains all the scripts.
I tried to use edf format file