Apache Spark Memory Management | Unified Memory Management

Поделиться
HTML-код
  • Опубликовано: 17 окт 2024

Комментарии • 37

  • @0yustas0
    @0yustas0 3 года назад +6

    spark.memory.fraction expresses the size of M as a fraction of the (JVM heap space - 300MiB) (default 0.6). The rest of the space (40%) is reserved for user data structures, internal metadata in Spark, and safeguarding against OOM errors in the case of sparse and unusually large records.

  • @bramar1278
    @bramar1278 4 года назад +2

    I really appreciate your time and efforts in making quality videos. Please explain us how these different memory allocations cause problems or exceptions. How to solve these exceptions or problems. A screen shot of the possible issues and code/configuration changes to solve the issue will be really helpful and we would be really greatful if you could provide these details as well. Once again I appreciate your work and efforts

  • @anupambiswas2588
    @anupambiswas2588 4 года назад +7

    Very well explained. But a follow up video on the practical implementation will be appreciated. Anyways great effort!!

  • @sanskarsuman589
    @sanskarsuman589 Год назад

    you said that reservedmemory is a part of executor memory but in diagram you are showing 1gb of executor memory plus 300 m of reserved memory???

  • @gobieee1
    @gobieee1 3 года назад

    good explanation, btw, Is there any way to increase the executor memory dynamically?

  • @ahyanroboking9237
    @ahyanroboking9237 2 года назад

    how do we find out if any executor is overallocated memory with --executor-memory but actually the job needs very less memory than provided executor memory parameter . Does this cause spark executer to reserve this memory and not being useful for other executors ?

  • @shubhamshingi4657
    @shubhamshingi4657 3 года назад +1

    very good explanation and up to the point, thanks for this

  • @nibeshranjanprusty9818
    @nibeshranjanprusty9818 3 года назад

    Hi Sir,the doubts fog clearing from mind after watching your spark videos,kindly make one session on real-time project from requirement to deployment it will very helpful ,Thank you.

  • @Manisood001
    @Manisood001 3 года назад

    Hello sir,
    I have some questions if you could answer in free time
    when i read spark.read.csv(and provide inferSchema=True) Does it take all rows to guess the datatype of a column
    what is sampleRatio option in spark.read.csv ? is it related to infershema
    can i tell spark to use all rows while infering the schema for a column

  • @shyamsundar8665
    @shyamsundar8665 3 года назад

    What is the reserved used for or responsible for ?

  • @gopinathchandrasekaran5235
    @gopinathchandrasekaran5235 3 года назад +1

    Great Video !!

  • @saurav0777
    @saurav0777 4 года назад +1

    Can u pls show how to monitor this memory usage and distribution via spark URL in your next upload?

  • @rajendraprasad8656
    @rajendraprasad8656 3 года назад

    Hi, Your videos are giving a good real time knowledge on spark and i thank you for that.. Could you please make a video on how to submit spark code(Pyspark) using shell script. also how to submit a spark job using shell script if both can be done differently. Thanks in advance

  • @SuperDinu4u
    @SuperDinu4u 4 года назад +1

    If execution memory can evict blocks of data from storage memory, what happens to those evicted blocks if they are to be consumed again Will they be computed again and stored again

    • @pratiksingh9480
      @pratiksingh9480 3 года назад

      @Data Savvy Same question I too have . When execution memory evicts storage memory , does it spill to disk ?

  • @saurabhgulati2505
    @saurabhgulati2505 3 года назад +1

    Where does resource manager such as yarn overhead lies in executor memory?

  • @i_ambhosale
    @i_ambhosale 2 года назад

    If data size is 16Gb and memory is 20GB what will use cache or persist?

  • @sujaykbful
    @sujaykbful 3 года назад +1

    Can you explain erasure coding vs replication

  • @gsekhar4367
    @gsekhar4367 2 года назад

    Can u pls do video on data skewness and on schema registry

  • @sureshu5671
    @sureshu5671 3 года назад

    Good information. thanks

  • @engineerbaaniya4846
    @engineerbaaniya4846 Год назад

    Well explained

  • @shubhamgupta9375
    @shubhamgupta9375 4 года назад

    How to verify this if storage memory is not evicting execution memory

  • @projjalchakraborty1806
    @projjalchakraborty1806 4 года назад

    How to resolve OOM using above discussed concept???

  • @fuatylmaz9132
    @fuatylmaz9132 4 года назад +2

    I respect your effort but I expected more detailed video not such a basic video

  • @bramar1278
    @bramar1278 4 года назад

    Hi Harjeet, I am experienced professional who need some help in understanding current market conditions for future planning purposes (a kind of short call for mentoring). Please let me know if you can provide 10-15 min time.

  • @Manisood001
    @Manisood001 4 года назад +1

    amazing

  • @tanushreenagar3116
    @tanushreenagar3116 2 года назад

    nice

  • @pradipchinuke1539
    @pradipchinuke1539 4 года назад +1

    १ video on How to validate data..?

    • @DataSavvy
      @DataSavvy  4 года назад +1

      Thanks... Added this to my list

    • @zillala4553
      @zillala4553 3 года назад

      @@DataSavvy thank you 👍👍

  • @rajdeepsinghborana2409
    @rajdeepsinghborana2409 3 года назад

    Sir
    Is Hadoop and Spark are dying 😞

  • @BishalKarki-pe8hs
    @BishalKarki-pe8hs 4 месяца назад

    vak mugi