❌ Apache Spark & Jupyter on Google Cloud Dataproc Cluster ❌ Spark + Jupyter + Dataproc

Поделиться
HTML-код
  • Опубликовано: 1 ноя 2024

Комментарии • 15

  • @DecisionForest
    @DecisionForest  4 года назад +1

    Hi there! If you want to stay up to date with the latest machine learning and deep learning tutorials subscribe here:
    ruclips.net/user/decisionforest

  • @shaz-z506
    @shaz-z506 4 года назад

    Good tutorial, could you please create a video on how to use spark-submit to execute jobs on all available cluster in a big data scenario on GCP.

  • @fernandes7949
    @fernandes7949 3 года назад

    first congratulations on sharing knowledge.
    A question in the case I already have a hadoop cluster created and would like an IDE to facilitate the creation of script in pyspark and even scala.
    Wouldn't google have anything already installed free? What do you recommend me. Thank you Fernandes Brasil

    • @DecisionForest
      @DecisionForest  3 года назад

      Thank you Fernandes! Well you can use Jupyter Notebooks on Dataproc for exactly this purpose. You can also integrate with VSCode: cloud.google.com/code/docs/vscode.

  • @patricciaavila
    @patricciaavila Год назад

    How do connect the PySpark to BigQuery?

  • @manojselvakumar4262
    @manojselvakumar4262 2 года назад

    How to access the Spark UI from Dataproc?

  • @calendr13
    @calendr13 3 года назад

    I have windows, how can I do for the scripts?

  • @imayzoo
    @imayzoo 4 года назад

    Thanks for the video, very easy

  • @miladto
    @miladto 3 года назад

    I could not find the bash script to create cluster on Dataproc

    • @DecisionForest
      @DecisionForest  3 года назад

      Hi Milad, the link to the download is in the description, it contains all the scripts.

  • @puggyk4220
    @puggyk4220 3 года назад

    I tried to use edf format file