Profiling CUDA Applications with nvprof, nsight systems and nsight compute.

Поделиться
HTML-код
  • Опубликовано: 30 сен 2024
  • Demo on howto use nvprof, NVIDIA Nsight Systems and Nsight Compute to profile and analyse CUDA code. Development and compiling (nvcc compiler) are used on Google Colab.
    This video is part of lecturer of "Hardware Accelerated Computing" by Vincent Claes

Комментарии • 1

  • @fpgabe
    @fpgabe  6 месяцев назад

    When you get the error: /bin/bash: line1: nsys: command not found you have to run the following code in a Jupyter cell:
    !wget developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/nsight-systems-2023.2.3_2023.2.3.1001-1_amd64.deb
    !apt update
    !apt install ./nsight-systems-2023.2.3_2023.2.3.1001-1_amd64.deb
    !apt --fix-broken install