Getting Started on Ollama

Поделиться
HTML-код
  • Опубликовано: 21 янв 2025

Комментарии • 144

  • @milorad9301
    @milorad9301 10 месяцев назад +17

    Thank you, Matt! Please create more videos like this; they're really clear and simple.

  • @zerotheory941
    @zerotheory941 10 месяцев назад +22

    If you can make a video about crew AI explaining it as simply as you did here, you'd be my hero.

    • @edwardrhodes4403
      @edwardrhodes4403 9 месяцев назад +2

      And also Autogen and other agents like Devika and how to integrate them

  • @MonsieurGinger
    @MonsieurGinger 7 месяцев назад +3

    I just did all of this yesterday and still watched your video from start to finish. Very clear and concise. I look forward to your other videos.

  • @sdaiwepm
    @sdaiwepm 7 месяцев назад +3

    Thank you for such a helpful explanation. I wish more tech explainers and presenters were this clear and structured.

  • @D1s0rdr
    @D1s0rdr 7 месяцев назад +1

    Im so happy there's someone in the mix who actually has a career in AI/Dev. Seriously, really enjoying your content. Dont listen to any of these jerks.

  • @hiltonwong5419
    @hiltonwong5419 7 месяцев назад

    I have watched a few of your videos. I love the way you explain things simply and clearly. Keep it up. Thank you for your work to all of us.

  • @continuouslearner
    @continuouslearner 7 месяцев назад +3

    Would have been good to cover what ollama is and what problems does it solve, for about 30sec-1min, before going into hardwarre requirements etc.

  • @volt5
    @volt5 4 месяца назад

    Thanks, this was crystal clear. I just started trying Llama and your video helps me to orient myself.,

  • @exxonrcg
    @exxonrcg 6 месяцев назад +2

    you are a good educator. Thanks for this

  • @German_dude175
    @German_dude175 4 месяца назад

    This is the quality content im looking for! Great video!

  • @bens4446
    @bens4446 9 месяцев назад +1

    Thanks! Just downloaded Ollama and was feeling a bit lost. Would really appreciate some guidance on integrating speech recognition and text to speech into the chatbot. But just about anything you say will probably be useful. Please keep 'em coming!

  • @talktoeric
    @talktoeric 6 месяцев назад

    I really like this channel! The presentation is great and understandable. It is easy to follow along. Thanks.

  • @enmingwang6332
    @enmingwang6332 7 месяцев назад

    What a great tutorial, clear, concise and informative!!!

  • @Cube_Box
    @Cube_Box 5 месяцев назад

    Absolutely love this channel

  • @wirreswuergen
    @wirreswuergen 8 месяцев назад +1

    Thank you, Matt! Your videos are awesome and already helped me a lot :)

  • @Filipe9171
    @Filipe9171 7 месяцев назад

    This is gold. Thank you, Mr Williams!

  • @RobCowie
    @RobCowie 8 месяцев назад +1

    Does it phone "home" at all, or is the model I use locally, assuming the machine is connected to the Internet, shared publicly at all, and is it secure?

    • @technovangelist
      @technovangelist  8 месяцев назад

      It doesn’t reach out anywhere unless you write a program to have it do something like that.

  • @incrastic6437
    @incrastic6437 9 месяцев назад

    Excellent introduction. Thanks for the help

  • @sci1200
    @sci1200 4 месяца назад

    Thank you, Matt! BTW, I saw you're drinking at the end of video😄

  • @dirtydevotee
    @dirtydevotee 2 месяца назад

    An excellent video. Thank you!

  • @harumambaru
    @harumambaru 6 месяцев назад

    Thanks for explanation! Love the outro :)

  • @ValentinPletzer
    @ValentinPletzer 10 месяцев назад +1

    Thanks. I really learned a lot by watching your videos. I recently ran into an issue when writing a new template model for few shot learning. Most of the times it responds like expected but sometimes it responds to my prompt and then also inserts it's own command by adding [INST] some other prompt … and also answers it. I probably made some mistake but I cannot figure it out. That's why I would love to see you make a video on templates (if this isn't too much to ask).

    • @jahbini
      @jahbini 10 месяцев назад

      I second that request!

  • @cankobebryant
    @cankobebryant 5 месяцев назад

    Thanks, great video!

  • @liammcmullen4497
    @liammcmullen4497 10 месяцев назад

    Great Overview Matt, your a star!

  • @nicosilva4750
    @nicosilva4750 10 месяцев назад

    Do the models return Markdown ...like lists? extended Markdown ...like tables and LaTeX? I have written my own desktop client that I put on all our machines to use OpenAI and their API (cheaper than $20 * 5/month). So I would like to have a network server for my home to run a local model. Can I set it up there and have everyone use it, or would there be performance issues? ...what about simultaneous usage?

  • @SergiySev
    @SergiySev 10 месяцев назад

    Great video, thank you for Ollama introduction!
    is there a way to add my own data to the model or shrink model to a particular topic? for example TailwindCSS, there is a sorce code, docs, library of the project, is there any way to train model to be able generate layouts and components based on a provided data?

  • @fastmamajama
    @fastmamajama 5 месяцев назад

    wow. thanks. i am trying to use ai to capture ufos. ive been using opencv to detect saucers, tictacs and orbs. i wonder if i could use ollama to detect ufos on videos that opencv records or make the process more effective

  • @NoHack_Know_How
    @NoHack_Know_How 6 месяцев назад +1

    Question - I have a mining board "ASUS B250 MINING EXPERT LGA1151 DDR4" do you think I can use that to host OllAMA 2?

    • @technovangelist
      @technovangelist  6 месяцев назад +1

      Depends on the gpus.

    • @technovangelist
      @technovangelist  6 месяцев назад

      But what is the gpu that you have

    • @NoHack_Know_How
      @NoHack_Know_How 6 месяцев назад

      @@technovangelist I have 8 videos cards at the moment not sure how many cudas yet ?

    • @technovangelist
      @technovangelist  6 месяцев назад

      What gpu do you have? If you have 8 old gpus that aren’t supported it won’t help. That’s why I keep asking what gpu you have.

    • @NoHack_Know_How
      @NoHack_Know_How 6 месяцев назад

      @@technovangelist Hey sorry for the delay, I was actually getting that information; they are all Radeon 580, and from what I read they don't support it.

  • @sebingtoon
    @sebingtoon 10 месяцев назад

    Hi Matt, do you know what determines the length of a model's answer? How does the model 'know' when to stop? Is it hard coded into the model or is it controlled by Ollama? Thanks

  • @juanjesusligero391
    @juanjesusligero391 10 месяцев назад

    Thank you so much for your tutorials! :D
    I would like to suggest an idea for a future video that I would be really interested in watching: a more detailed exploration of the various models (such as the instruct/base/etc. ones you've mentioned).
    Again, thank you very much! You rock! ^^/

  • @mister-ace
    @mister-ace 5 месяцев назад

    Thank you for the video! I'm very interested in how to conduct sentiment analysis of data? For example, I have 100 texts and I would like to know their sentiment. (at the same time, bert, etc. did not work for me, because they are only for English, and for other languages ​​the alternatives are very bad).. What could you recommend (as if I were 5 years old, lol).

  • @alexsnow2993
    @alexsnow2993 9 месяцев назад +1

    Hello! My video card is an RX580. Is there a way to make it work?

    • @alexsnow2993
      @alexsnow2993 9 месяцев назад

      Using the rx580, it will be slow? Or not work at all?

    • @technovangelist
      @technovangelist  9 месяцев назад

      I don’t see it on the compatibility list. github.com/ollama/ollama/blob/main/docs/gpu.md

    • @technovangelist
      @technovangelist  9 месяцев назад

      Just won’t work at all. I think ollama requires the newer amd drivers and amd didn’t make it backwards compatible with older cards.

    • @alexsnow2993
      @alexsnow2993 9 месяцев назад

      Thanks for the info!
      I can't get another V-card at the moment, and using the CPU it is a no go.
      Is there any version or any other AI out there, that can be configured locally?

    • @technovangelist
      @technovangelist  9 месяцев назад

      Everything I know of is going to need a decent recent gpu.

  • @naan9692
    @naan9692 14 дней назад

    Thanks for pulling back the curtain.

  • @emil8367
    @emil8367 10 месяцев назад

    thanks for sharing, prune is something what I missed but very useful due to the fact of downloading large files and loosing them after each restart, was very annoying. I see ollama didn't documented it well or maybe I overlooked it

  • @blackwinegum
    @blackwinegum 8 месяцев назад

    I just dont get any sort of CLI when i install Ollama , the app just shows "view logs" and "Quit Ollama"

    • @technovangelist
      @technovangelist  8 месяцев назад

      so when you run ollama at the command line you don't see anything?

    • @blackwinegum
      @blackwinegum 8 месяцев назад

      @@technovangelist I think i've figured it out, i think my firewall was blocking something, thanks for replying.

  • @sanzzrulezz
    @sanzzrulezz 6 месяцев назад

    Hi Matt, After installation, Ollama is not opening in Mac? Any tips? Thank you.

    • @technovangelist
      @technovangelist  6 месяцев назад

      So when you run ollama in the terminal nothing happens?

    • @sanzzrulezz
      @sanzzrulezz 6 месяцев назад

      ​@@technovangelist Hi Matt,
      I'm new to this terminal and coding, but I love to learn how to execute this. Currently, I have installed Ollama and downloaded your Mac zip file from GitHub, but I'm not sure how to run this and get it working as software to rename images. Can you guide me on this? It would be really helpful.
      Thank you.

    • @sanzzrulezz
      @sanzzrulezz 6 месяцев назад

      @@technovangelist I have pulled llava:13b in Terminal. However, I don't understand how to run the macOS x86 file from GitHub, which is renamed as airenamer, in Terminal.

  • @nholmes86
    @nholmes86 8 месяцев назад

    I successful run Ollama with llama 3 on Mac OS M1 8G, it runs better when you close other apps .

  • @samsquamsh78
    @samsquamsh78 10 месяцев назад +1

    I like your videos, always spot on and pedagogical! Why did you leave ollama?

    • @technovangelist
      @technovangelist  10 месяцев назад

      If we find ourselves in the same room I’ll talk about it there.

  • @vicnent75
    @vicnent75 7 месяцев назад

    thank you for you job Matt.

  • @YotamGuttman
    @YotamGuttman 7 месяцев назад

    fascinating. thank you for these videos!

  • @JoaoKruschewsky
    @JoaoKruschewsky 9 месяцев назад

    Hello from Brazil. I really liked your content ! thanks

  • @thepassionatecoder5404
    @thepassionatecoder5404 9 месяцев назад

    Do I need to know match, statistics, etc... apart from programming?

  • @ftlbaby
    @ftlbaby 8 месяцев назад

    Thanks for this! I just setup Ollama with wizard-vicuna-uncensored:30b-q8_0. Do you know what's different in the fp16 models?

  • @ec_gadgets
    @ec_gadgets 9 месяцев назад +1

    You explained it perfectly, thank you

  • @bens4446
    @bens4446 9 месяцев назад

    FYI- My llama2 install is working reasonably fast without a GPU, just a ryzan 5600G CPU, which has some rudimentary graphics capacity built into it.

    • @Thymed
      @Thymed 7 месяцев назад

      More than rudimentary but still I get your point. much less than modern GPUs or recent APUs

  • @sepulchral.
    @sepulchral. 6 месяцев назад

    Does anyone know where the models save to on Windows?

    • @technovangelist
      @technovangelist  6 месяцев назад

      the windows docs has the answer to this and other questions: github.com/ollama/ollama/blob/main/docs/windows.md

    • @sepulchral.
      @sepulchral. 6 месяцев назад

      @@technovangelist explorer %HOMEPATH%\.ollama - thanks.

  • @Leon-AlexisSauer
    @Leon-AlexisSauer 4 месяца назад

    yoo so far as i understand , ollama is not an application right? or is there a way to get it like that i am new to this

    • @technovangelist
      @technovangelist  4 месяца назад +1

      Not sure I understand. Ollama is an application to run ai models

  • @richardurwin
    @richardurwin 8 месяцев назад

    Thank you for the video

  • @abhijeetkumar8044
    @abhijeetkumar8044 9 месяцев назад

    Please create videos on how to fine tune these models 🙏

  • @hotbird3
    @hotbird3 8 месяцев назад

    You're a very smart person 👍👊

  • @mrrohitjadhav470
    @mrrohitjadhav470 10 месяцев назад

    it would have been great to know how to install other models not mentioned in Ollama library with specific type of Low Vram and GGUF

    • @technovangelist
      @technovangelist  10 месяцев назад

      check out ruclips.net/video/0ou51l-MLCo/видео.html

    • @mrrohitjadhav470
      @mrrohitjadhav470 9 месяцев назад

      @@technovangelist Thanks a lot❤

  • @Delchursing
    @Delchursing 9 месяцев назад

    Great video. The costs are a bit unclear to me. Would a local ollama/llm be free to use?

    • @technovangelist
      @technovangelist  9 месяцев назад +1

      What costs? You have to own a computer with a gpu. That’s it

  • @makesnosense6304
    @makesnosense6304 10 месяцев назад

    9:40 To have the same result you just need the same input, seed (and other parameters), no? Reason it's different every time is because the seed is random for every request, right? The seed used (and other parameters) make the result different because it takes a different path in the weight model.

    • @technovangelist
      @technovangelist  10 месяцев назад

      Using the same seed and temp doesn’t always guarantee the same result

    • @makesnosense6304
      @makesnosense6304 10 месяцев назад

      @@technovangelist Ah, because temp is a percentage randomness scale of sort.

    • @makesnosense6304
      @makesnosense6304 10 месяцев назад

      @@technovangelist What if temp is 0? Or 1?

    • @technovangelist
      @technovangelist  10 месяцев назад

      It’s not guaranteed

  • @PoGGiE06
    @PoGGiE06 9 месяцев назад

    Thanks Matt, why does everyone use Mistral rather than Mixtral?

  • @userou-ig1ze
    @userou-ig1ze 10 месяцев назад

    I wish there was an easier way to fill in template text, and parsing pdfs. I've seen the 'function calling' video's, but somehow it's still eluding me how do get this done as easily as possible (e.g. sending a pdf over api in a curl request from another machine, and rename it sensibly/according to content)

    • @technovangelist
      @technovangelist  10 месяцев назад +1

      The biggest problem there is the pdf. You can’t easily get to the contents of the pdf. The text. It’s often jumbled up. PDF is the worst format you can use if you want the text and to do something with it. That’s also one of the benefits of pdf. It obfuscates the source text so folks can’t do anything with the text.

    • @userou-ig1ze
      @userou-ig1ze 9 месяцев назад

      ​@@technovangelistthanks for the reply! I used pdf2text but it was not exactly perfectly successful. I wonder how ollama frontends (e.g. webgui or webui) solve this for their RAG? Gave me hope that there is a good way of doing it 🎉

  • @mshonle
    @mshonle 10 месяцев назад

    Here’s a video request: can you do one on LMSys’s SGLang? Particularly using constrained decoding?

  • @piotrnakonieczka1111
    @piotrnakonieczka1111 2 месяца назад

    Hi, I like your channel 😊 could you toch the topic of how to use ollama with two(or more) GPU’s. Option one to use them to speed up the compute and option two to use the vram of both GPU’s to be able to load larger model. Thx in advance 😊

  • @qewolf
    @qewolf 8 месяцев назад

    Verry cool, thank you 🙏

  • @DcKayb
    @DcKayb 10 месяцев назад

    Excellent summary, thanks alot.

  • @flexchamp
    @flexchamp 7 месяцев назад

    10 out of 10!

  • @axeljohannes3464
    @axeljohannes3464 9 месяцев назад

    Wait do I need to download anything or not? You say "Now the model should be downloaded, so you can run it with ollama run mistral" Why would it be downloaded? I just installed the Ollama software. Does it download all the models automatically? This seems very unclear

    • @technovangelist
      @technovangelist  9 месяцев назад +2

      I think you must have skipped around a bit. I very clearly said to install and then run ollama pull to download the model. Then while downloading talked about what’s going on. Then the model is downloaded and you can run it. When you downloaded the model you only downloaded that model. Why download anything? Because you want to run it.

    • @axeljohannes3464
      @axeljohannes3464 8 месяцев назад

      Thanks! I got it to work@@technovangelist

    • @axeljohannes3464
      @axeljohannes3464 8 месяцев назад

      I think what confused me is the term pull, and what that actually meant. So when you got to the point of speaking about downloaded, I was like "Hey, no one said anything about downloading anything"

  • @briann1233
    @briann1233 9 месяцев назад

    Can Ollama be used in prod on Linux server?

    • @technovangelist
      @technovangelist  9 месяцев назад +1

      absolutely. lots of folks are doing just that.

    • @briann1233
      @briann1233 9 месяцев назад

      @@technovangelistWow, that is amazing. I would appreciate it if you could provide documents that would guide me in deploying a model for production use and "Function Calling." Ollama is an excellent tool for a startup to keep costs down and avoid OpenAI usage costs.

    • @briann1233
      @briann1233 9 месяцев назад

      It also gives us flexibility to keep our data in-house.

    • @briann1233
      @briann1233 9 месяцев назад

      ​@@technovangelist This is awesome!. Do you have any reference deploy successfully into prod? We are trying to avoid OpenAI and are looking for open-source AI models with "Function calling."

    • @technovangelist
      @technovangelist  9 месяцев назад

      all the docs in the github repo. but it’s a pretty simple app without many dependencies. I don't know of any guidance though.

  • @Mike-vj8do
    @Mike-vj8do 8 месяцев назад

    AMAZING!

  • @tecnopadre
    @tecnopadre 10 месяцев назад

    Sometimes your level is so high some others too simple. Cheers.

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 10 месяцев назад

    Great content

  • @viniciussilvano4177
    @viniciussilvano4177 7 месяцев назад

    Please, do more compatibility with GPUs. I have rx580. My processor is crying hehe

    • @technovangelist
      @technovangelist  7 месяцев назад +1

      That’s a request for AMD to add support to those older lower end cards I think.

    • @viniciussilvano4177
      @viniciussilvano4177 7 месяцев назад

      @@technovangelist Is there any way I can use a library that allows me to do this. Or is it actually something that depends on AMD?
      I'm really impressed with what using Ollama as an API has added to my projects. I would like to find some way to speed up processing without having to spend money, at least for now.

    • @technovangelist
      @technovangelist  7 месяцев назад +1

      But amd support requires a certain level of the drivers which amd only has working for newer cards. I think the only option is to buy a more recent card. The 580 is 5 years old.

  • @JeppeGybergyoutube
    @JeppeGybergyoutube 8 месяцев назад

    Nice video

  • @AwesomeCanadianHomes
    @AwesomeCanadianHomes 9 месяцев назад +1

    I have a feeling Duncan Trussell is a subscriber : )

  • @anshulsingh8326
    @anshulsingh8326 7 месяцев назад +1

    Subed ❤️
    If only you taught maths too

  • @thiagoassisfernandes
    @thiagoassisfernandes 10 месяцев назад +1

    arch and nix are system-d distros

  • @stebansb
    @stebansb 8 месяцев назад

    great content, a telegram group would be great!

    • @technovangelist
      @technovangelist  8 месяцев назад

      Telegram??? I think I used it once at an Idan Raichel concert but never since. What’s special about a telegram group?

    • @stebansb
      @stebansb 8 месяцев назад

      @@technovangelist the other option being Discord, I feel is simpler, cleaner user interface, yet very powerful; popular with business and a slightly more mature cohort. The other option is Discord, slower more complex, popular among gamers. Either way, will be cool to have something to build a community beyond RUclips.

  • @3clectic0
    @3clectic0 2 месяца назад

    Very helpfunctie.

  • @MrI8igmac
    @MrI8igmac 2 месяца назад

    I could make the console work if it displayed code with a syntax color highlieght

    • @technovangelist
      @technovangelist  2 месяца назад

      if highlighting is holding you back, you have other issues...

    • @MrI8igmac
      @MrI8igmac 2 месяца назад

      @@technovangelist \033[91m definitely

  • @Andres-m2u
    @Andres-m2u 10 месяцев назад

    Ollama runs very well on a M3 Max

  • @K600K300
    @K600K300 10 месяцев назад

    thank you

  • @sergey_a
    @sergey_a 10 месяцев назад +1

    thanks for the informative video.
    some examples should be displayed in a video, rather than spoken, for example, to show how to use environment variables

    • @technovangelist
      @technovangelist  10 месяцев назад

      There are a number of videos pointed out throughout the video that provide all the examples

  • @dmbrv
    @dmbrv 10 месяцев назад

    thanks

  • @robert_kotula
    @robert_kotula 9 месяцев назад

    Booted up Ollama with the llama2 model and my M1 MBP just froze 💀

    • @technovangelist
      @technovangelist  9 месяцев назад +1

      That is bizarre…as in you would be the first person that has happened to. Running macOS I assume. Installed using the installer? What else was running? So you installed and then opened a terminal and run ollama run llama2 and then nothing? Probably easiest to solve on the discord.

    • @robert_kotula
      @robert_kotula 9 месяцев назад

      @@technovangelist I’ll join the discord channel and try to troubleshoot. I’ve had a couple of tabs open in Safari and one tab in Firefox Developer addition, nothing else. Will need to dig into the performance stats on the laptop.

  • @mcawesome4150
    @mcawesome4150 9 месяцев назад

    you should have more views and subscribers

    • @technovangelist
      @technovangelist  9 месяцев назад +1

      Thanks. Both are accelerating quickly. But feel free to share. I like to say I am working on my first million subscribers. Only 985,000 short.

  • @florentflote
    @florentflote 10 месяцев назад

  • @jyashi1
    @jyashi1 10 месяцев назад

    First

  • @lucasmuso
    @lucasmuso 6 месяцев назад

    Bro did not just use FAQ as a whole assed word.

    • @technovangelist
      @technovangelist  6 месяцев назад

      Ahh you must be new to this technology world. Welcome.