Fine-Tuning Mistral 7B
HTML-код
- Опубликовано: 8 ноя 2023
- This session is led by Chris and Greg!
You'll learn what you need to know about Mistral 7B, and how to get it fine-tuned for your application!
Agenda with additional resources: docs.google.com/document/d/1s... - Наука
This made my day. Perfect and clear explanation.
THank you so much for this tutorial.
Google Colab: colab.research.google.com/drive/1JtrVh--bcPR-CR8QNOyXd3Z5eZt0WgOw?usp=sharing
Slides: www.canva.com/design/DAFzn7Uynrc/IMrrg6GSL_2NWpAnWXfobQ/edit?DAFzn7Uynrc&
thanks for this tutorial bro ...
🤘
New to this stuff. Is it possible for me to use my own gpu to train? If yes, how? Thanks!
With a combination of Quantization strategies (4bit from bitsandbytes, AWQ, and more) plus LoRA (or other adapter methods) it's more than possible to fine-tune large language models on a consumer GPU!
If it's your own GPU on prem, you'll just have to deal with some hardware config that is more streamlined when leveraging compute from cloud providers!
Thanks, this is super , in your generate_reponse(promt), for generate_ids what is the value for pad_token_id? pad_token_id=tokenizer or pad_token_id=tokenizer.eos_token? I actually tried both of them, none of them works, anything I missed here? is there any other parameter after pad_token_id?
pad_token_id=tokenizer.eos_token_id is what you'd want!
Can you do a video on finetuning a multimodal LLM (Video-LlaMA, LLaVA, or CLIP) with a custom multimodal dataset containing images and texts for relation extraction or a specific task? Can you do it using open-source multimodal LLM and multimodal datasets like video-llama or else so anyone can further their experiments with the help of your tutorial. Can you also talk about how we can boost the performance of the fine-tuned modal using prompt tuning in the same video?
We'll add this suggestion to our backlog of potential future events for sure! Keep the ideas coming!
@@AI-Makerspace Thanks
If i have hardware constraints, can i use a small model such as tiny-llama?
Also, how can i perform RAG on a csv dataset?
You could!
For the RAG question - you could use a CSVRetriever!