Spark Standalone Architecture

Поделиться
HTML-код
  • Опубликовано: 17 окт 2024

Комментарии • 25

  • @rohitbhagwat3031
    @rohitbhagwat3031 3 года назад +9

    Excellent!!!! Hats off to you teaching. Don't stop teaching. Each and every point got cleared. Keep it up.

  • @ittzzmalind9101
    @ittzzmalind9101 3 месяца назад +1

    Bro I have a question.
    In this video data was distributed to two nodes even before executors are created ( before program logic starts ryt)
    But in my program if I read data from another system (like s3) during program execution only data will be loaded to cluster.
    So program execution ( creation of executor) should start first ryt before distributing data?

  • @Devarati
    @Devarati 2 года назад

    Fantastic teaching with amazing clarity, point by point explanation. Thank you.

  • @PavanKumar-vi7hd
    @PavanKumar-vi7hd Год назад

    Your way of teaching is excellent Gowtham

  • @channuangadi8506
    @channuangadi8506 Год назад

    This is the best explanation i ever saw
    Very indepth explanation

  • @gsekhar4367
    @gsekhar4367 2 года назад

    It's crystal clear i like ur way of teaching

  • @gsekhar4367
    @gsekhar4367 2 года назад +1

    Brother here master and worker are deamons or physical servers?

  • @sowjanyarbhat4715
    @sowjanyarbhat4715 2 года назад

    Is it fine to use Spark Standalone for a POC project , or should I use with Hadoop itself? The requirement is basically to migrate Django Cron jobs what we run on Celery(with RabbitMQ) to Spark .

  • @manikandankbk7633
    @manikandankbk7633 3 года назад +2

    Bro, when I run in standalone mode with 1 master and 2 workers (in laptop, I have made host as master and first worker... (and) 2nd worker in VM)... Only when I put the same input file in both the worker node in gives me correct output. For eg. I need to put the same 1gb input file in both workers.
    If I partition them into 512mb (I did it manually, but the location of file in both worker is same) , first half of the file in one worker and other half in second worker, it gave incorrect results. Why is it so?

    • @atulsingh9202
      @atulsingh9202 Год назад

      When manually partitioning data in a Spark standalone cluster, ensure that each partition resides on a separate worker node and has unique file paths to avoid data duplication and incorrect results

    • @exit-bag
      @exit-bag 28 дней назад

      Spark typically requires a distributed file system like hdfs I think, otherwise you need to make each file available to all the executors manually

  • @sonip472
    @sonip472 2 года назад

    Excellent video, very well explained.

  • @dhananjayreddy9998
    @dhananjayreddy9998 2 года назад

    In Standalone mode, if there is no requirement of splitting the file into multi nodes what is the purpose of having worker nodes(multi nodes).

    • @atulsingh9202
      @atulsingh9202 Год назад

      In Spark's standalone cluster mode, worker nodes provide resource isolation, fault tolerance, scalability, parallelism, and efficient resource management, even if data splitting is not required.

  • @bommanasravan8279
    @bommanasravan8279 4 месяца назад

    Excellent

  • @BigDataWithSky
    @BigDataWithSky 9 дней назад

    Pls post current spark architecture again

  • @parthlodaria5656
    @parthlodaria5656 3 года назад

    Great Explanation!!

  • @parammani4717
    @parammani4717 3 года назад

    Hi, ji can you upload spark architecture with yarn

  • @rajashekarm6162
    @rajashekarm6162 2 года назад

    Fantastic brother, but in real-time most entities uses Spark with Yarn Deployment mode, these many months you still didn't upload that video.!!!

    • @dataengineeringvideos
      @dataengineeringvideos  2 года назад

      Hi bro
      Thanks
      Please find the video for yarn deployment in spark
      ruclips.net/video/3c62-F6bu5k/видео.html

  • @bharathmamidisetti3896
    @bharathmamidisetti3896 3 месяца назад

    Thalaiva❤❤❤❤

  • @RohitSaini52945
    @RohitSaini52945 2 года назад

    Thank you !

  • @SANGEETHAD-mb2hj
    @SANGEETHAD-mb2hj 2 года назад

    In general
    How will u read data line by line in python

    • @gsekhar4367
      @gsekhar4367 2 года назад

      Really its crystal clear explantion i like ur videos way of explantion tq

  • @raviy10
    @raviy10 2 года назад

    Thank you !!!