Hugging Face LLMs with SageMaker + RAG with Pinecone

AWS re:Invent 2021 - Serverless Inference on SageMaker! FOR REAL!

Fine-tuning Llama 2 on Your Own Dataset | Train an LLM for Your Use Case with QLoRA on a Single GPU

Our Fire Evacuation! Forced To Leave Our Home...

Tornado touches down in Santa Cruz County, several injured

These Are The Worst Job Interviews Ever

Summarizing legal documents with Hugging Face and Amazon SageMaker

Julien Simon

Просмотров 11 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 11 янв 2025

Комментарии • 28

@caiyu538 Год назад ⁺¹
Keep on learning from your great lectures.
@juliensimonfr Год назад
That's the plan!
@anuragbhatia1980 Год назад ⁺²
Amazing tutorial. One minor issue: Video uses "title" column while the Gitlab notebook uses "summary" column.
@juliensimonfr Год назад ⁺¹
Thank you, and good catch: 'title' it should be. I fixed the notebook.
@stephenielane783 11 месяцев назад
Thank you for the video Julien. My summarisation task involves (1) taking verbal recordings (2) keeping certain domain specific English phrases, and fixing any grammatically errors. I don't have lots of inputs but have a lot of "output text". Do you think we can still train flan-t5?
@giantdutchviking Год назад
May I ask why your domain specific training data have an imbalance in row amount between the text, summaries and titles? I assume every row contains the text, corresponding summary and title. Does it just ignore the +/- 15k rows which doesnt have a corresponding summary?
Thanks for making this vid, wanting to see the magic before learning the theory.
@MAx-gi1pn 9 месяцев назад
What would mean if the code in the training part runs for many hours but nothing happens? And when I stop it manuallly it says: INFO:sagemaker.image_uris:image_uri is not presented, retrieving image_uri based on instance_type, framework etc.
@juliensimonfr 9 месяцев назад
Something's wrong with the training container. You may need to update the Python version or the transformers version to a more recent release. See github.com/aws/deep-learning-containers/blob/master/available_images.md
@MAx-gi1pn 9 месяцев назад
@@juliensimonfr Thank you for responding to me, did not think I would get a response. I finally fixed the issue but now the code has been running for 2 hours and I still see no output yet. Do you think I should just leave it running? I actually do not know if its working because its not specifying errors but it has been too long and I see no results yet.
@juliensimonfr 9 месяцев назад
@@MAx-gi1pn check the Cloudwatch monitoring infomation for the training job (logs and graphs)
@iqranaveed2660 Год назад
Sir i want to do abstractive summarization on pubmed dataset but it cant run on colab please tell me some platform for it
@aru6575 10 месяцев назад
julien, can you do content for training for the summary label instead of title? I have concern with the training capacity of google colab, im a free user
@juliensimonfr 10 месяцев назад
Hi, you can restrict the number of training samples if needed, or use a smaller T5 model.
@meirgoldenberg5638 Год назад
Is there not a way to be charged only for compute resources that you actually use, i.e. per second of usage? (that's how it works with AWS Lambda)
@juliensimonfr Год назад
You can try serverless inference on SageMaker
@caiyu538 Год назад
Great lectures. I used this model to summarize medical report.
@juliensimonfr Год назад
Great! If you can, please share the model on the Hugging Face hub :)
@holydarknes Год назад
Let's say I want to summarize any type of incoming document. Would I have to train a bunch of different models for different types of files, then determine the type of file before submitting it to be summarized? Is there a way to have a more general solution?
@juliensimonfr Год назад ⁺³
2 options IMHO:
1) a large summarization model trained/fine-tuned on tons of different documents
2) a text classification model (to figure out what the doc is about) + several small domain-specific summarizers
#1 may feel simpler, but it can difficult to get great results if you don't have a lot of data and if the domains are extremely different. #2 is also more flexible, you can add new domains without retraining a larger model every time.
@stayinthepursuit8427 Год назад
Do we need a local gpu or everything can be done through sagemaker? Why then do i see people complaining about gpus all the time
@juliensimonfr Год назад
SageMaker is a cloud service, so it runs in the cloud ;)
@dstyle5120 Год назад
My kernel crushes when I try to use flan-t5-large, while the small and base versions work fine. Does anybody know why? I can only select conda_pythorch_p310 and not the p39 Julien is using and also using the free tier of AWS services. Any help would be much appreciated, I've just got back coding after 10 years and a lot has changed.
@danielguns2019 11 месяцев назад
Great video! One question, the limit for tokens is 512 on my model. Can I increase this safely?
@juliensimonfr 11 месяцев назад
No, the sequence length is a built-in property. You need to consider models with a longer sequence. Another popular option is to split long documents in chunks, summarize each chunk, and then summarize the summaries :)
@danielguns2019 11 месяцев назад
Ok thanks for the response! @@juliensimonfr
@truthseeker318 8 месяцев назад
Do you offer consultations for non-profits?
@juliensimonfr 8 месяцев назад ⁺¹
Hi, I'm afraid I can't find time for that. I would recommend posting a message at discuss.huggingface.co (maybe in the "community calls" forum?) and hopefully someone can help.
@truthseeker318 8 месяцев назад
@@juliensimonfr Great thanks! Do you have any other videos for training models, specifically to summarize legal ease accurately?

Следующие

Автовоспроизведение

Hugging Face LLMs with SageMaker + RAG with Pinecone

Hugging Face LLMs with SageMaker + RAG with Pinecone

AWS re:Invent 2021 - Serverless Inference on SageMaker! FOR REAL!

AWS re:Invent 2021 - Serverless Inference on SageMaker! FOR REAL!

Fine-tuning Llama 2 on Your Own Dataset | Train an LLM for Your Use Case with QLoRA on a Single GPU

Fine-tuning Llama 2 on Your Own Dataset | Train an LLM for Your Use Case with QLoRA on a Single GPU

Our Fire Evacuation! Forced To Leave Our Home...

Our Fire Evacuation! Forced To Leave Our Home...

Tornado touches down in Santa Cruz County, several injured

Tornado touches down in Santa Cruz County, several injured

These Are The Worst Job Interviews Ever

These Are The Worst Job Interviews Ever

Rio Da Yung OG - RIO FREE (Official Video)

Rio Da Yung OG - RIO FREE (Official Video)

5 Levels Of LLM Summarizing: Novice to Expert

5 Levels Of LLM Summarizing: Novice to Expert

AI Text Summarization with Hugging Face Transformers in 4 Lines of Python

AI Text Summarization with Hugging Face Transformers in 4 Lines of Python

Deploy LLMs (Large Language Models) on AWS SageMaker using DLC

Deploy LLMs (Large Language Models) on AWS SageMaker using DLC

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

Your Own Llama 2 API on AWS SageMaker in 10 min! Complete AWS, Lambda, API Gateway Tutorial

4 - Summarization Fine Tuning BART | GPT2 T5 PEGASUS using HuggingFace | NLP Hugging Face Project

4 - Summarization Fine Tuning BART | GPT2 T5 PEGASUS using HuggingFace | NLP Hugging Face Project

Generative AI: Long Text Summarization and Analysis with Amazon Bedrock and Anthropic Claude 2

Generative AI: Long Text Summarization and Analysis with Amazon Bedrock and Anthropic Claude 2

Importing Open Source Models to Ollama

Importing Open Source Models to Ollama

Stray Pointers Podcast - S2 E20 - 2024 Advent of Code Wrap-up with Juan Vazquez and Cameron Cunning

Stray Pointers Podcast - S2 E20 - 2024 Advent of Code Wrap-up with Juan Vazquez and Cameron Cunning

Build, train, deploy, and operationalize Hugging Face models on Amazon SageMaker

Build, train, deploy, and operationalize Hugging Face models on Amazon SageMaker

с бобром готовим полено 🪵🥴 #шортс #тикток

с бобром готовим полено 🪵🥴 #шортс #тикток

😱Короче , 10 Минут Бесполезной инфы о GTA San Andreas

😱Короче , 10 Минут Бесполезной инфы о GTA San Andreas

DIY Gift Idea 🎁 #shorts

DIY Gift Idea 🎁 #shorts

白天使能预知未来。#小丑 #天使 #超人不会飞 #shorts

白天使能预知未来。#小丑 #天使 #超人不会飞 #shorts

I can’t explain it but this miniature wind tunnel is so cool! 💨 #miniature #model #models #cars

I can’t explain it but this miniature wind tunnel is so cool! 💨 #miniature #model #models #cars

ВОЗВРАЩЕНИЕ ГЕЛИКА! и сразу в УБЕЖИЩЕ! ОГРОМНАЯ КОЛОННА МАШИН, НА КОГО ОНИ ОХОТЯТСЯ???

ВОЗВРАЩЕНИЕ ГЕЛИКА! и сразу в УБЕЖИЩЕ! ОГРОМНАЯ КОЛОННА МАШИН, НА КОГО ОНИ ОХОТЯТСЯ???

Британцы учудили новый интересный кабель! И как вы думаете для чего? #секрет #энерголикбез #uk

Британцы учудили новый интересный кабель! И как вы думаете для чего? #секрет #энерголикбез #uk

Awesome Harley Quinn. #Harriet Quinn #joker #cosplay

Awesome Harley Quinn. #Harriet Quinn #joker #cosplay