Four Ways to Check if Ollama is Using Your GPU or CPU

Поделиться
HTML-код
  • Опубликовано: 17 янв 2025

Комментарии • 6

  • @zandanshah
    @zandanshah Месяц назад +1

    Thanks, keep the good work going.

  • @igoriane93
    @igoriane93 22 часа назад

    In my specific case (7900XT) it tries to locate ROCm0, as Iam using windows my computer freezes.

  • @vmeow9895
    @vmeow9895 2 месяца назад

    why after a while, the response_token/s gradually decreases?
    i use rx 6600xt

    • @TigerTriangleTech
      @TigerTriangleTech  2 месяца назад

      Hello and thanks for watching. I'm not sure, but in a chat situation, I would think it could be because it is storing previous prompts in memory in order to "remember" the previous conversations. How much it keeps could depend on the context window of the model. I have not tested this but that's my best guess. Someone else may have a better answer.

    • @vmeow9895
      @vmeow9895 2 месяца назад +1

      @@TigerTriangleTech Thanks for your reply to my question, Hope you can solve it later😁