Unsloth: How to Train LLM 5x Faster and with Less Memory Usage?

Поделиться
HTML-код
  • Опубликовано: 16 сен 2024

Комментарии • 19

  • @faaf42
    @faaf42 День назад

    Clear and to the point. Thanks!

  • @danielhanchen
    @danielhanchen 6 месяцев назад +1

    Thanks for sharing Unsloth and fabulous work on the video! Keep up the great work!

  • @HemangJoshi
    @HemangJoshi Месяц назад +2

    how to train on unstructured data such as GitHub repo code or something like that ? what i mean is i don't have dataset in instruction and answer format but raw text. Do i need to have compulsorily have data in question answer format ?

  • @benda2250
    @benda2250 6 месяцев назад +2

    And do you have a tutorial to build the training dataset easily ?

  • @Nurtech87
    @Nurtech87 4 месяца назад

    Will it always give points now for an answer ? Line 1) 2) 3) etc? Or just for the business plan question and whatever you had in your fine tuning dataset?
    Your reply is greatly appreciated 😊

  • @ReOp14
    @ReOp14 Месяц назад

    I'm getting the error:
    raise KeyError(f"Cache only has {len(self)} layers, attempted to access layer with index {layer_idx}")
    KeyError: 'Cache only has 0 layers, attempted to access layer with index 0'

  • @obamagaming7909
    @obamagaming7909 4 месяца назад

    How do you make and enter a custom dataset? Is there a template?

  • @HuemanInstrumentality
    @HuemanInstrumentality 4 месяца назад

    how do I get conda up and running on wsl?

  • @TruGame-s4j
    @TruGame-s4j 2 месяца назад

    It says its missing the Triton package but then nor pip or conda can find it. Any solution?

  • @n.praveenraja3469
    @n.praveenraja3469 4 месяца назад

    ValueError: Pointer argument (at 2) cannot be accessed from Triton (cpu tensor?)

  • @gram-b4b
    @gram-b4b 6 месяцев назад +1

    function calling with small llm

  • @solidkundi
    @solidkundi 6 месяцев назад +2

    I have obvious questions ... how long does it take ona windows with 3090 ... how long on m1? and what kind of results

    • @danielhanchen
      @danielhanchen 6 месяцев назад

      Unsloth definitely will be super fast on a 3090 - 2-5x faster and 70% less VRAM usage. Sadly we don't support Macs as of yet :( MLX is probably what you're looking for!

  • @prathameshchaudhari4937
    @prathameshchaudhari4937 6 месяцев назад

    what was the cost for this fine tuning? What can we expect for our use case?

  • @podunkman2709
    @podunkman2709 6 месяцев назад

    Maybe I do not get something but what is the sense to "train model" if u have sets question-answer?
    You can just build trivial DB that will serve even better 😅

    • @danielhanchen
      @danielhanchen 6 месяцев назад

      So given a set of questions and answers, you can use your data to finetune a base LLM like Gemma, Llama or Mistral to make it able to answer new unseen questions. A trivial DB is good, but on questions outside of that DB, a finetuned LLM is able to answer them

  • @manasikhillare5646
    @manasikhillare5646 5 месяцев назад

    How do we install unsloth ?

  • @richerite
    @richerite 4 месяца назад

    How to install Python ? I need to do fine tune