Ollama-Run large language models Locally-Run Llama 2, Code Llama, and other models

Поделиться
HTML-код
  • Опубликовано: 25 дек 2024

Комментарии • 68

  • @vishalnagda7
    @vishalnagda7 9 месяцев назад +9

    I'm feeling lucky that I got this video in my suggestions.

  • @divyaramesh3105
    @divyaramesh3105 9 месяцев назад +2

    Thank you Krish sir. In Building RAG from scratch ,sunny sir showed about Ollama. Both of you were giving foundational knowledge and updates in GenAI. It was very useful sir.

  • @mehdi9771
    @mehdi9771 9 месяцев назад

    We need a long versions videos like previously and thanks for your efforts ❤

  • @neerajshrivastava5600
    @neerajshrivastava5600 5 месяцев назад

    Krish, Fantastic Video and great explanation!!! Keep it up

  • @AjaySharma-jv6qn
    @AjaySharma-jv6qn 9 месяцев назад +1

    Content is helpful, thanks for your effort.🎉

  • @rajendarkatravath2207
    @rajendarkatravath2207 9 месяцев назад

    Thanks krish! for sharing this knowledge . what an amazing model it is .....!

  • @SomethingSpiritual
    @SomethingSpiritual 8 месяцев назад +4

    why ollama not taking full gpu? its taking full cpu only, pls guide

  • @manjeshtiwari7434
    @manjeshtiwari7434 9 месяцев назад +3

    Thank You so much for a such a great video , I have a query , I am getting very slow response does the speed of response depends on system config , I have chekced out system use and while running it isn't using much resource , can you tell how can we increase response speed

  • @kenchang3456
    @kenchang3456 9 месяцев назад

    Hey Krish, thanks for doing this video in Windows.

  • @deeks_edits
    @deeks_edits 9 месяцев назад

    You are the best!🤓

  • @Lastvideoz
    @Lastvideoz 4 месяца назад +1

    Very good explanation, I have question can I train this model for specific taks mean features extraction or others?

  • @ankkol2011
    @ankkol2011 9 месяцев назад

    Thankyou so much for these videos

  • @NISHANTKumar-ct3pb
    @NISHANTKumar-ct3pb 9 месяцев назад +1

    Thanks , it's great video. Wanted to ask when we say local what is the configuration of local is it a cpu or GPU based system? Are models compressed / quantized or same as original ? Is there a model size limitation vs local system config?

  • @computerauditor
    @computerauditor 7 месяцев назад

    Really insightful krish!!

  • @amoghjain702
    @amoghjain702 2 месяца назад

    Thanks Krish for the great video. I get Error: llama runner process no longer running: -1 when I try to run the model. Did you encounter this issue?

  • @roshanchandel7929
    @roshanchandel7929 8 месяцев назад

    The heroes we need!!

  • @omarnahdi3380
    @omarnahdi3380 9 месяцев назад +1

    Hey sir😄, please make a video on BioMistral( a LLM trained on Medical and Scientific Data). It would perfectly fit your AI Nutriationist. Thanks for your daily dose of GenAI

  • @jacobashwinmathew3763
    @jacobashwinmathew3763 9 месяцев назад

    Can you make a complete video of production ready open source LLM basically LLMOps

  • @lionelshaghlil1754
    @lionelshaghlil1754 9 месяцев назад +1

    Thanks Krish, the briliant, innovative and master of the AI 😊, I have a question please related to the hosting, so assume I'd like to implement my solution on a server, will I need to have both, OLAMA and my app in two seperate dockers? they would communicate together? or they could be implemented in one single docker?

    • @krishnaik06
      @krishnaik06  9 месяцев назад +1

      It can be implemented in one docker

    • @ayushmishra5861
      @ayushmishra5861 7 месяцев назад

      Have you got clarity on the same, can you please share.

  • @nasiksami2351
    @nasiksami2351 9 месяцев назад

    Great tutorial! Can you please make a video on finetuning model on custom csv dataset and integration with Ollama.
    For instance, consider I have class imbalance problem in my dataset. Can I finetune a model, then ask it in Ollama, to generate more samples of minority class using the finetuned model?

  • @apurvakulkarni9699
    @apurvakulkarni9699 2 месяца назад

    very nice video

  • @Shrieenidhi
    @Shrieenidhi 3 месяца назад

    If the model is installing locally means, will it take space of the RAM?

  • @marcoaerlic2576
    @marcoaerlic2576 6 месяцев назад

    Thanks for the video.

  • @BelhsanMohamed
    @BelhsanMohamed 8 месяцев назад

    as always thanks for the information

  • @shashank046
    @shashank046 6 месяцев назад

    Hi, how do I use gpu on open web ui? My model response is really slow and is not using gpu even though is used the command for using gpu for installing as mentioned on the open web ui GitHub page ..

  • @usingsk
    @usingsk 7 месяцев назад

    Thanks for Sharing knowledge. Can we fine tune with company domain content in downloaded model and the data is not shared. I mean it comply with IPR if we use locally

  • @velugucharan8096
    @velugucharan8096 9 месяцев назад

    Sir please complete the fine tuning llms playlist as much as possible sir

  • @krishnaprasadsheshadri6206
    @krishnaprasadsheshadri6206 9 месяцев назад

    Can we get a video about reading tables using unstructured and such frameworks

  • @tharunps8048
    @tharunps8048 9 месяцев назад

    Since it is running locally, using this model with organization's data doesn't expose it right ?

  • @kashishvarshney2225
    @kashishvarshney2225 9 месяцев назад

    hello sir, what is the minimum system configuration for ollama

  • @KumR
    @KumR 9 месяцев назад

    Do we need to download the entire 7gb llama2 locally to use with ollama

  • @NeelamDevi-z6e
    @NeelamDevi-z6e 9 месяцев назад

    Great content Krish...Need these coding files kindly share those

  • @sawankumar2088
    @sawankumar2088 4 месяца назад

    Can we just download and use or do we require any meta-ai api key as well?

  • @YashDeveloper-rq2yc
    @YashDeveloper-rq2yc 9 месяцев назад

    Bro using these techniques can I convert it as superb ai assistant? And what capabilities can use?

  • @nagasudha6928
    @nagasudha6928 9 месяцев назад

    Hi Krish This is Sudha from ISRO Hyderabad, I would like to know the documents to be provided for ollama and get the answers from it

  • @ashishdayal172
    @ashishdayal172 7 месяцев назад

    hii krish, i am facing error creating modelfile .Please help

  • @VishalTank-vk5ju
    @VishalTank-vk5ju 6 месяцев назад

    Hello, krish, I am facing an issue with the Ollama service. I have an RTX 4090 GPU with 80GB of RAM and 24GB of VRAM. When I run the Llama 3 70B model and ask it a question, it initially loads on the GPU, but after 5-10 seconds, it shifts entirely to the CPU. This causes the response time to be slow. Please provide me with a solution for this. Thank you in advance.
    Note:- GPU load is 6-12 % and CPU load is 70% .

  • @kavitajakkali5030
    @kavitajakkali5030 3 месяца назад

    I Installed ollama in my local system but getting responses is taking very long time what can i do for that one ?

  • @Notknows
    @Notknows 9 месяцев назад

    Nice video sir

  • @hassanahmad1483
    @hassanahmad1483 8 месяцев назад

    How to deploy these custom gpts...?

  • @rajarshidey424
    @rajarshidey424 7 месяцев назад

    How can we get the code?

  • @sanjaynt7434
    @sanjaynt7434 9 месяцев назад

    Can this read a document and answer my questions on that document can it.

  • @pssab8
    @pssab8 9 месяцев назад

    Excellent videos. I set up mistral model locally on ubuntu20.04 and found that it is taking more than a minute for every response .Running in cpu mode only.Can you suggest me to improve the performance.

    • @amazingedits9298
      @amazingedits9298 9 месяцев назад

      This models are running on your computer hardware.So it requires a good hardware like gpu or something for creating quicker responses

  • @starkgaming1425
    @starkgaming1425 9 месяцев назад

    Please release a step by step guide on how to fine tune Gemini API in Python.....I tried by refering to documents but encountered a lot of errors with OAuth Setup please...........!!!

  • @manasjohri2495
    @manasjohri2495 9 месяцев назад

    Can you please tell me how we can run this ollama on GPU right now it is working on CPU?

  • @susnatakanjilal703
    @susnatakanjilal703 9 месяцев назад

    Sir I need to create a custom text data set from common crawl.for Bengali language....and train llama2 using that...can you plz demonstrate similar project!?

  • @ranemghalion581
    @ranemghalion581 6 месяцев назад

    thankyou

  • @AjayYadav-xi9sj
    @AjayYadav-xi9sj 9 месяцев назад

    Make a video on Python framework of ollama. Make a end to end project and also host it somewhere where real people can use it

  • @mohammedalfarsi4361
    @mohammedalfarsi4361 9 месяцев назад

    are these model support arabic language ?

  • @copilotcoder
    @copilotcoder 9 месяцев назад

    Sir please create a codebase understanding model using ollama and test it on a opensource codebase

  • @VishalKumar-gv6gy
    @VishalKumar-gv6gy 8 месяцев назад

    Does it require GPU ?

  • @YashDeveloper-rq2yc
    @YashDeveloper-rq2yc 9 месяцев назад

    After installing it will work in offline?

  • @MotoEdge40-j3v
    @MotoEdge40-j3v 9 месяцев назад +1

    Every time we see a kid we ask him to say a poem and when you have so many llm models but you only want a poem on machine learning

  • @DeadJDona
    @DeadJDona 9 месяцев назад

    please finish that Chrome update 😢

  • @rishiraj2548
    @rishiraj2548 9 месяцев назад

    🙏💯👍

  • @Nagireddy-lw7rl
    @Nagireddy-lw7rl 6 месяцев назад

    Hi Krish sir I have need ollama chatbot python code provide me. I check with your Github.

  • @naveenkumarmaurya3182
    @naveenkumarmaurya3182 9 месяцев назад

    hi krsih i m getting this error
    Ollama run codella! 🐰💨
    (Note: I'm just an AI, I don't have personal preferences or the ability to run code, but I can certainly help you
    with any questions or tasks you may have!)

  • @parthwagh3607
    @parthwagh3607 5 месяцев назад

    Thank you so much krish. I am having problem running models downloaded from hugging face having safetensor file. I have these files in oobabooga/text-generation-webui. I have to use this for ollama. I followed everything, even created modelfile with path to safetensor directory, but it is not running >> ollama create model_name -f modelfile. Please help me.

  • @jatinchawla1680
    @jatinchawla1680 8 месяцев назад

    llm=ollama(base_url='localhost:11434',model="llama 2")
    TypeError: 'module' object is not callable
    Can someone pls help w this?