What is RDD? Why it is used in SPARK? Spark in Tamil #4

Поделиться
HTML-код
  • Опубликовано: 10 окт 2016
  • Spark tutorial.
    Spark class room
    training
    www.tamilboomi.com
    For More :
    Whatsapp : +91 9619663272
    Facebook : tamilboomiofficial
    aruthecoach
    email : arumugamsip@gmail.com
    Twitter : @aruthecoach
    Spark In tamil..! Tamilboomi..!
    What is RDD??
    It is an immutable distributed collection of objects.
    Csv, txt json...
    We cannot change the data in RDD..!
    Resilient Distributed Datasets
    Each dataset in RDD is divided into logical partitions,
    which may be computed on different nodes of the cluster.
    Properties:
    Immutable
    Partitioned
    Fault tolerant -- liniage..!
    Lazily evaluated
    Can be persisted.
    ~-~~-~~~-~~-~
    Please watch: "How Consultancies & Training Institutes cheats US? | Are you a FRESHER SEARCHING FOR A JOB?"
    • How Consultancies & Tr...
    ~-~~-~~~-~~-~

Комментарии • 17

  • @prakashmd8553
    @prakashmd8553 7 лет назад +3

    Thanks ji . Hard to find these kind of tutorials in our own language . Keep going

  • @ranjithkumarkarthic8696
    @ranjithkumarkarthic8696 Год назад

    Simply superb

  • @mohamedhamith
    @mohamedhamith 7 лет назад +1

    thank you bro... you are doing excellent work... no one can stop ur learning apart from God

  • @Madasami_Muthumalai
    @Madasami_Muthumalai 6 лет назад +1

    Nice explanation. Feeling happy to see the wonderful explanation in tamil

  • @pragadeeshdharsha4703
    @pragadeeshdharsha4703 7 лет назад +1

    ur explanation is good.keep adding more videos and tutorials on spark.

  • @srikrishnarr6553
    @srikrishnarr6553 4 года назад

    Good and done in lay man language ....Ji expect many such videos from you ...I am subscribing sir

  • @mohamedsameer7300
    @mohamedsameer7300 7 лет назад +1

    nice good bro move ahead....!!!!!!!!!!

  • @mohamedhamith
    @mohamedhamith 7 лет назад +1

    please provide java course.... ur explanation is too good

  • @krish3486
    @krish3486 Год назад

    Sir clear this doubt, is spark rdd convert the data like CSV , text , json files to rdd files in programming.

  • @rajendiraprasath9261
    @rajendiraprasath9261 2 года назад

    next video eppo varum

  • @timeandvalue
    @timeandvalue 4 года назад

    Babu..
    Small correction.
    .
    CSV means comma separated values

  • @yahoonan21
    @yahoonan21 7 лет назад +2

    Hi Aru ji, How frequently data is stored in disc?. Though it may be a streaming data, lots n lots of data are transacting for a period of time. so How can we provide reliability in capturing a data from servers(like twitter,FB, Fouresquare),

    • @Tamilboomi
      @Tamilboomi  7 лет назад +1

      Hi Kiruba,
      1) We can specify when the data needs to be stored in to the disc Or if our memory (RAM) size is less than the amount of data to be processed it automatically stores the data in to disc.So Its our call.
      2) For reliability on streaming data we can store it in any underlying storage like HDFS.