Design Your Own Ollama Model Now!

Поделиться
HTML-код
  • Опубликовано: 14 апр 2024
  • After watching this you can be an Ollama Model Making Pro.
    Be sure to sign up to my monthly newsletter at technovangelist.com/newsletter
    And if interested in supporting me, sign up for my patreon at / technovangelist
  • НаукаНаука

Комментарии • 32

  • @wilson_joe
    @wilson_joe 24 дня назад +1

    I really appreciate you videos, you have a simple, understandable , friendly approach to teaching which keeps me coming back for more.

  • @user-wr4yl7tx3w
    @user-wr4yl7tx3w 2 месяца назад +6

    may be a dummy guide would be also helpful. it's a bit advance, the content, though very useful.

  • @tharun2003
    @tharun2003 15 дней назад

    You saved my day. Thank you Matt.

  • @tecnopadre
    @tecnopadre 2 месяца назад +2

    I love your endings

  • @xspydazx
    @xspydazx 20 дней назад

    very good !!! qell understood....
    (quick advice)...
    Tempreture is related to the training also (as things which were not trained deeply will need higher tempreture ... and things deeply embedded will be ok with lowest tempreture: how do people train thier odels and what are thier acceptable levels? as some are .=0.5 and under whilst other dont care and let the model complete an epoch on large dataset and assume the data took .. as long as thier final output was preferable : when in fact all the data which did not go in at the loss below .0.5 did not take and is not retrivable perhaps its there ephemeallly ... as it is like a pretraining ... its just used for next word prediction... but we are doing tasks ! which is whole sequece prediction/recall so when we train for a task we expect the whole of the data set to be fit in range .... so low temptrture 1 should be acceptable losses ...
    Some say tha this effect the soft max of possiblisty chosen byu the topk sample as well as the topP percentage of cutt of... but this is when there are many sample chosen... but this also depicts the values that were trained at thatr rate of loss .... so it will be collecting sample from the level under the temptrture rate of 1 ( a lot ) so this will need constraining with topP (selecting the highest of probablitys ... but the softmax will also spread them alowing for more random also, when the model has been over trained.) ..
    so an over trained model can be loosened by raising the temptretue and a wild model tamed !
    lol...

    • @xspydazx
      @xspydazx 20 дней назад

      i would like to see a vdieo on publishing a modl really !

  • @AliAlias
    @AliAlias 2 месяца назад

    Thanks ❤
    Very helpful 😊

  •  2 месяца назад

    Excellent thank you!

  • @userou-ig1ze
    @userou-ig1ze 2 месяца назад

    Please keep doing what you're doing, at this point I would guess job offers from all over the world pour in. Thanks for your continuous videos! I went through this with the meditron model, that I suspect is still not fully correct in prompt format, but couldn't fix, maybe with this video I will be more successful.
    Ps: Let us know in case you sell merch :)

    • @technovangelist
      @technovangelist  2 месяца назад

      No merch but the is a patreon at patreon.com/technovangelist and a newsletter at technovangelist.com/newsletter

  • @wardehaj
    @wardehaj 2 месяца назад

    Great video, very usefull! I have a request for you: please make a video about making an ollama model of dbrx and/or grok 1.5 vision models

  • @francescobassignana4211
    @francescobassignana4211 Месяц назад +1

    Hi! Thanks for the video. I have a question about using the Ollama model with LangChain: When I run the .invoke method with a simple prompt, does the Ollama library automatically insert the prompt into the pre-configured template in the model file, or do I need to manually include it in the LangChain prompt template?

  • @twinnie38
    @twinnie38 Месяц назад

    So helpful, so interesting, thanks 👍After generating my model, I notice that I have to specify the number of layers to use even though my GPU has enough memory in Ollama (--n-gpu-layers). If I use fewer layers, what does this mean in practice?

  • @SonGoku-pc7jl
    @SonGoku-pc7jl 2 месяца назад

    thanks!!!

  • @rude_people_die_young
    @rude_people_die_young 2 месяца назад

    Another great one - identifying several pitfalls

  • @eyeseethru
    @eyeseethru 2 месяца назад +1

    Thanks for all the helpful videos on Ollama! I've since located the answers, but these are a few questions I was always left with whenever I saw mention of making a model file. Asking so it may help other new Ollama users: What kind of file is it? What program should be used to create it? Is it saved in a specific file format or location?

    • @technovangelist
      @technovangelist  2 месяца назад

      I created it in vscode. It’s just a text file like everything else in a code editor. And put it anywhere you like. Once you run ollama create, blobs and manifests are generated in a specific place.

  • @atrocitus777
    @atrocitus777 2 месяца назад

    i see that you can use your own docker registry with ollama as a way of hosting model files. would love to see a video on this for users running ollama on closed networks.

    • @technovangelist
      @technovangelist  2 месяца назад

      It’s not actually the same as the docker registry. It was written by the same person that created the docker registry though.

    • @technovangelist
      @technovangelist  2 месяца назад +1

      It had to be modified because layers in a docker image are tiny whereas models are huge.

  • @KhanaKhala1
    @KhanaKhala1 2 месяца назад

    Extremely useful but what if there is no template in the readme?

    • @technovangelist
      @technovangelist  2 месяца назад

      Then look in that file I showed. And if not there then look how the model was trained or fine tuned

  • @explorer945
    @explorer945 2 месяца назад

    Thank you for the short and sweet video. How do you get so much good audio quality on your videos? step 0: have a great voice. What is step1 (gear, setup in OBS/plugins) :?

    • @technovangelist
      @technovangelist  2 месяца назад +3

      I think I need a video on it. I don’t use obs though.

    • @explorer945
      @explorer945 2 месяца назад

      @@technovangelist yes, video please. You could add affiliate links to the gear as well. Really loved the base in audio

    • @technovangelist
      @technovangelist  2 месяца назад +1

      ruclips.net/video/LQe3DFjMYrE/видео.htmlsi=R4u3h6yPtbUaHeDh

  • @UTubeGuyJK
    @UTubeGuyJK 2 месяца назад

    My coworker and I set up a windows machine to run ollama. It works great but occasionally seems to crash. Could it be the keep_alive setting? If I want others to be able to hit it via the api, should I set the keep_alive to “forever”? (I don’t remember the flag for that off the top of my head). Thanks for your work on Ollama!

    • @technovangelist
      @technovangelist  2 месяца назад

      In most cases you shouldn’t need to worry about keep alive.

  • @Cloud_Dude
    @Cloud_Dude 2 месяца назад

    there is folder based on the date of this video . do you have a gist containing content of the template per model ?

    • @technovangelist
      @technovangelist  2 месяца назад

      No. It was just a few lines that you can grab from the same sources I did so didn’t bother with it

  • @florentflote
    @florentflote 2 месяца назад