Step by Step Tutorial Setup Jupyter Notebook to run R & Python with Apache Spark on MacOS

Поделиться
HTML-код
  • Опубликовано: 18 сен 2024
  • This video tutorial was created as a guide to setup Jupyter notebook to run R and Python with Apache Spark on MacOS. If this tutorial is helpful, please like it and subscribe. Thanks. Commands used in the video are following (please expand):
    If you do not already have Python 3.6 or better, you can download and install form:
    www.python.org...
    Check if pip is already installed, if not, install pip from:
    pip.pypa.io/en...
    Install virtualenv
    pip install virtualenv
    Create virtual environment
    python -m virtualenv myspark
    Activate virtual environment (Mac or Linux):
    ./myspark/bin/activate
    Activate virtual environment (Windows):
    . myspark/bin./activate
    Install notebook within virtual environment:
    pip install notebook
    Make sure JDK 17 has been setup on your computer, if not you need to download and setup JDK 17
    To download JDK 17:
    www.oracle.com...
    Setup environment variable:
    JAVA_HOME and add JAVE_HOME/bin to the path
    Download and Setup Spark from:
    spark.apache.o...
    Download Spark compress tar file (tgz), expand and rename the expanded folder to spark
    Setup environment variable SPARK_HOME to the path of spark folder
    Add SPARK_HOME/bin to the path
    Install R and Rstudio:
    posit.co/downl...
    Then Launch R in command line window, run following commands under R command prompt:
    install.packages("usethis",repos = "cran.us.r-proje...")
    install.packages("pkgdown",repos = "cran.us.r-proje...")
    print("Install devtools begin")
    install.packages("devtools",repos = "cran.us.r-proje...")
    print("Install IRkernel begin")
    devtools::install_github("IRkernel/IRkernel", force = TRUE)
    system.file('kernelspec', package = 'IRkernel')
    print("kernalspec begin")
    IRkernel::installspec()

Комментарии •