How GPUdirect RDMA massively speeds up machine learning at AWS

Поделиться
HTML-код
  • Опубликовано: 2 дек 2024
  • GPUdirect RDMA was invented to massively speed up the ML frameworks that so many customers use to train their models using our large fleets of NVIDIA GPUs.
    Amr Ragab from EC2 Engineering took a few minutes at SC'22 to explain to us how it works - and how easy it is to take advantage of.
    If you have ideas for technical topics you'd like to see us cover in a future show, let us know by finding us on Twitter (@TechHpc) and DM'ing us with your idea.

Комментарии •