GCP Dataflow Batch data processing

Поделиться
HTML-код
  • Опубликовано: 11 дек 2024

Комментарии • 21

  • @sandeepguptha6440
    @sandeepguptha6440 2 года назад +4

    Hello Anjan,
    Please provide dump data in description for all your videos so that it is easy to follow your procedure. Thanks

  • @prabhatsri81
    @prabhatsri81 Год назад

    Hi anjan, you have mentioned to reference architectural diagram at top of the slide but at the top you have given steps. Which architectural diagram you are referring to.

  • @ashwinjoshi3331
    @ashwinjoshi3331 Год назад +1

    Thanks a lot for the video. It's really helpful. Just one question - I need to take data from Oracle on premise and dump it to BigQuery . Is there any article or document which can be referred for using Apache Beam in Python ? I tried to search but could not get much details . Any input would be appreciated

    • @anjangcpdataengineering5209
      @anjangcpdataengineering5209  Год назад

      Please wait for next video , you will see one such use case . I will publish that video by this weekend .

  • @thecloudbaba8668
    @thecloudbaba8668 8 месяцев назад

    Good contents on Dataflow...could you also pls share the bulk.csv file .. its not uploaded on repo

  • @aravindgovindaraj
    @aravindgovindaraj 2 года назад

    thanks for your session, May i know how to schedule this job. consider daily at 1 AM morning ?

  • @madhavab1533
    @madhavab1533 2 года назад +2

    Thank you

  • @jeeruveeresh8942
    @jeeruveeresh8942 Год назад

    Hii anjan...could you please create a pipeline ...extracting data from differrent sources like onprem and cloud and load it into gcp

  • @learngooglecloud
    @learngooglecloud Год назад

    Hello Anjan, Thanks for this video. Can you please provide the csv files?

  • @sagar_patro
    @sagar_patro Год назад +1

    Hi Anjan, I feel if you try giving a simple example it will be easier for beginner.

  • @Maturshab2210
    @Maturshab2210 5 месяцев назад

    How to schedule these sir

  • @avinashboyina2402
    @avinashboyina2402 Год назад +1

    Hi sr, your teaching way is extradonary. can you please share the csv file also.

  • @sunkarihari-x6w
    @sunkarihari-x6w 4 месяца назад

    Hi can you share sample xl data file this video

  • @kesavaram5401
    @kesavaram5401 2 года назад

    Thanks for the very useful information. I tried same example in my gcp project but getting
    Traceback (most recent call last):
    File "/usr/lib/python3.9/runpy.py", line 197, in _run_module_as_main
    return _run_code(code, main_globals, None,
    File "/usr/lib/python3.9/runpy.py", line 87, in _run_code
    exec(code, run_globals)
    File "/home/learningtrading17/bulkdeal_aggr.py", line 4, in
    import apache_beam as beam
    ModuleNotFoundError: No module named 'apache_beam'
    -bash: --temp_location: command not found
    Could you please help me on this error..

    • @anjangcpdataengineering5209
      @anjangcpdataengineering5209  2 года назад

      Are you trying this in cloud shell ? or some Jupiter note book?

    • @kesavaram5401
      @kesavaram5401 2 года назад

      @@anjangcpdataengineering5209 I am trying on Cloud Shell

    • @anjangcpdataengineering5209
      @anjangcpdataengineering5209  2 года назад

      @@kesavaram5401 install Apache beam for GCP on python virtual environment and try , hope it will work

    • @kesavaram5401
      @kesavaram5401 2 года назад

      @@anjangcpdataengineering5209 I ran pip install 'apache-beam[gcp]' on virtual environment
      Collecting google-auth=1.18.0
      Using cached google_auth-1.34.0-py2.py3-none-any.whl (152 kB)
      Using cached google_auth-1.33.1-py2.py3-none-any.whl (152 kB)
      it is not moving ahead after reaching above steps..

  • @1itech
    @1itech Год назад

    bro u want more sub plz add github path........................