Batch Processing Example using Amazon Data Pipeline | S3 to DynamoDB using Amazon EMR | Tech Primers

Поделиться
HTML-код
  • Опубликовано: 28 янв 2025

Комментарии • 26

  • @TechPrimers
    @TechPrimers  4 года назад +3

    ⏱ Chapter Timestamps
    ===================
    0:00 - Introduction
    00:28 - Architecture Flow
    2:31 - Pre-requisite: Amazon S3 Buckets Creation
    3:44 - Pre-requisite: Amazon DynamoDB Table Creation
    5:15 - Pre-requisite: Amazon S3 file format Creation
    8:20 - Pre-requisite: Amazon Data Pipeline Creation
    12:40 - Data Format Configuration in Data Pipeline
    14:14 - Activate Data Pipeline
    17:52 - Data Loaded into DynamoDB
    19:03 - Serverless Data Pipeline Trigger using EventBridge and Lambda
    20:06 - Summary

  • @rajeshanneboina4729
    @rajeshanneboina4729 2 года назад +2

    Interviewer was asked me about Batch processing but I explained him Cron Job using Spring scheduler, but now I come to know what's batch processing from in detail, thanks a lot and it means a lot for me...

  • @ilyasabbasi42
    @ilyasabbasi42 4 года назад +10

    I love your dedication brother....late night doing great job for the dev community....👍👍👍 @Tech Primer

  • @keshavamugulursrinivasiyen5502
    @keshavamugulursrinivasiyen5502 2 года назад +1

    Nice demo, useful.

  • @rajish15
    @rajish15 4 года назад +1

    Wow! Great bro... I love ur dedication.. great job 🔥🙏

  • @mayanktripathi4u
    @mayanktripathi4u 3 года назад +1

    Thanks for the video, this really helps.
    Could you please suggest what are the options we could use to concatenate two file based on common column from S3. For this should I use Lambda or some other services may be Data Pipeline / Glue etc?

  • @arvindimage
    @arvindimage 4 года назад +2

    As always, Amazing and Awesome.
    Very informative & insightful. Thanks, @techprimers

  • @vishalmatam
    @vishalmatam 4 года назад +2

    Great video and congrats on being the AWS Community Builder! AWS DataPipeline looks it's a competitor of Airflow, the architect UI is similar to Airflow DAGs, interesting!. Also, curious to know what you use for presentations Keynote or PowerPoint?

    • @TechPrimers
      @TechPrimers  4 года назад +2

      Thank you Vishal. I use Google Slides.

  • @StyleTrick
    @StyleTrick 4 года назад +2

    Great video Ajay once again :). Is the data ingestion continuous or done in intervals?

    • @TechPrimers
      @TechPrimers  4 года назад +1

      Since this is batch it's done in adhoc fashion. Whatever data is present in the file is injected all at once

  • @vishalsonker8768
    @vishalsonker8768 Год назад

    Could you please give some clarity on EC2 instance role creation at the time of data pipeline creation?

  • @manojrajendran2260
    @manojrajendran2260 3 года назад +1

    Great video.. Thumbs up..

  • @kFarheenMoin
    @kFarheenMoin 4 года назад

    Nice video, is is possible to run multiple instances of a single lambda in parallel to perform serverless processing?

  • @literallynobody4840
    @literallynobody4840 3 года назад

    Nice one.One question.Next time when another file is uploaded in S3 and we activate pipeline then will it try to upload the first file as well..if yes how to avert that?

  • @rakeshroshan829
    @rakeshroshan829 4 года назад +1

    Great job...👍👍👍

  • @aakashironic
    @aakashironic 2 года назад

    Is it possible to do update item? Or it will only do put item in dynamo db?

  • @anilchaudhry804
    @anilchaudhry804 3 года назад

    can you build a scalable pipeline working on real time data and use services like kinesis

  • @sbsharvani
    @sbsharvani 3 года назад +1

    Brother, how do you know these many topics? Really great 👍👍

    • @TechPrimers
      @TechPrimers  3 года назад +2

      By learning anything which is new and trying to practice that in my free time

  • @yoganandareddypolimera4109
    @yoganandareddypolimera4109 3 года назад

    please upload airflow cluster setup and data pipeline

  • @SoumilShah
    @SoumilShah 2 года назад

    There are much cheaper way such as athena CTAS or even Glue Nice video concept is good

  • @manojrajendran2260
    @manojrajendran2260 3 года назад +1

    Guys if you're stuck save the file as .txt and change data type to small "s", "n", "l".

  • @praveenj3112
    @praveenj3112 4 года назад

    There is developer can do here