SageMaker JumpStart: deploy Hugging Face models in minutes!

The Best Way to Deploy AI Models (Inference Endpoints)

host ALL your AI locally

Child kidnapped found 70 years later alive | KTVU

VIEWER BEWARE... DIGITAL CIRCUS EPISODE 3 IS NEAR!

DUBOIS KNOCKS OUT JOSHUA! 🏆 Official Fight Highlights From EPIC Wembley Clash 🔥

Deploy models with Hugging Face Inference Endpoints

Julien Simon

Просмотров 16 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 23 сен 2024
In this video, I show you how to deploy Transformer models straight from the Hugging Face hub to managed infrastructure on AWS, in just a few clicks. Starting from a model that I already trained for image classification, I first deploy an endpoint protected by Hugging Face token authentication. Then, I deploy a second endpoint in a private subnet, and I show you how to access it securely from your AWS account thanks to AWS PrivateLink.
⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos ⭐️⭐️⭐️
⭐️⭐️⭐️ Want to buy me a coffee? I can always use more :) www.buymeacoff... ⭐️⭐️⭐️
- Model: huggingface.co...
- Inference Endpoints: huggingface.co...
- Inference Endpoints documentation: huggingface.co...
- AWS PrivateLink documentation: docs.aws.amazo...
Code:
import requests, json, os
API_URL = ENDPOINT_URL
MY_API_TOKEN = os.getenv("MY_API_TOKEN")
headers = {"Authorization": "Bearer "+MY_API_TOKEN, "Content-Type": "image/jpg"}
def query(filename):
with open(filename, "rb") as f:
data = f.read()
response = requests.request("POST", API_URL, headers=headers, data=data)
return json.loads(response.content.decode("utf-8"))
output = query("food.jpg")

Комментарии • 31

@50kT Год назад ⁺⁵
This is the exact content I was looking for yesterday, you posted it today! Fantastic lol
Really hope I can get everything set up to put my idea into production at scale.
@juliensimonfr Год назад
Glad it was helpful!
@caiyu538 Год назад
I need to recheck your previous video. There are deployment of training instance. Now it is to deploy inference instance. Always great to revisit to understand different terms for a beginner.
@grandplazaunited Год назад ⁺¹
Thanks Julien. Besides ease of using hugging face endpoints, i learned about how VPC endpoints work!
@juliensimonfr Год назад
Cool :)
@caiyu538 Год назад ⁺¹
Thank you for hugging face. It makes deployment much easier.
@arunvijay8949 6 месяцев назад
Fantastic , great learning thank you very much. So now I can use these endpoints from Langchain or lllama Index without worrying about the deployment of my model.
@juliensimonfr 6 месяцев назад
Exactly, and you're welcome :)
@sandiegoman Год назад
Ughh, I wish I had found this earlier. I created my own VPS with both front end and back end server to provide access to a transformer model. Thanks, this should help.
@juliensimonfr Год назад
Glad I could help!
@connor-shorten Год назад ⁺¹
Thanks Julien, great video!
@juliensimonfr Год назад
Glad you liked it!
@caiyu538 Год назад ⁺¹
great lectures.
@innocentanyaele5986 6 месяцев назад
You're the best!!!
@juliensimonfr 6 месяцев назад
Thanks, I'll tell my wife
@danielminchev4173 Год назад
This is pure gold, thank you!
@juliensimonfr Год назад
Thanks!
@datasciencesolutions2361 Год назад ⁺¹
Great job sincerely!
@juliensimonfr Год назад
Thanks!
@moneyjuice Год назад ⁺¹
That's amazing, Merci pour le partage
@juliensimonfr Год назад
Glad you liked it.
@blockchaingeek7118 Год назад
I'll appreciate if you share how to deploy models .ckpt or safetansors on a vps that I already own (vultr or digitalocean)
@MinaliJain-m9s 8 месяцев назад
In this we need AWS for model storage , or we can directly use by the inference api endpoints of hugging face , because I want to use jais13b-chat model @Julien Simon
@juliensimonfr 8 месяцев назад
Inference Endpoints lets you deploy any hub model on managed infrastructure running on AWS or Azure. Not sure what you mean by 'model storage' ?
@nb9t7 2 месяца назад
Hey Julien,
Where can we find the training model video for food dataset?
Also, I am trying to use a model and deploy it on Hugging Face Inference, but it errors out saying I need a config.json file. I'm not sure how to create it. Any leads would be really helpful.
Thanks!
@juliensimonfr Месяц назад
Hi, I think this is the right video: ruclips.net/video/uFxtl7QuUvo/видео.html
Yes, your model repository needs to have a config.json file, which is generated automatically when you save your trained model. See the docs at huggingface.co/docs/inference-endpoints/index
@FushigiMigi 2 месяца назад
Need to know how to communicate with chat models that are running using python code. I’m struggling to find this information.
@juliensimonfr 2 месяца назад
Check out the Inference Endpoints documentation. The format is simple JSON.
@efexzium Год назад
paying for it but Its reaaaaaallly hard to change the tokens for models.
@hskhawaja 9 месяцев назад
Where do I get my api token?
@juliensimonfr 8 месяцев назад
Create an account on the Hugging Face hub and go to settings.

Следующие

Автовоспроизведение

SageMaker JumpStart: deploy Hugging Face models in minutes!

SageMaker JumpStart: deploy Hugging Face models in minutes!

The Best Way to Deploy AI Models (Inference Endpoints)

The Best Way to Deploy AI Models (Inference Endpoints)

host ALL your AI locally

host ALL your AI locally

Child kidnapped found 70 years later alive | KTVU

Child kidnapped found 70 years later alive | KTVU

VIEWER BEWARE... DIGITAL CIRCUS EPISODE 3 IS NEAR!

VIEWER BEWARE... DIGITAL CIRCUS EPISODE 3 IS NEAR!

DUBOIS KNOCKS OUT JOSHUA! 🏆 Official Fight Highlights From EPIC Wembley Clash 🔥

DUBOIS KNOCKS OUT JOSHUA! 🏆 Official Fight Highlights From EPIC Wembley Clash 🔥

Minecraft Hero Academy: THE MOVIE

Minecraft Hero Academy: THE MOVIE

How to Deploy a Docker App to AWS using Elastic Container Service (ECS)

How to Deploy a Docker App to AWS using Elastic Container Service (ECS)

How to configure AWS CLI keys

How to configure AWS CLI keys

$0 Embeddings (OpenAI vs. free & open source)

$0 Embeddings (OpenAI vs. free & open source)

How to Deploy a Docker Image on AWS ECS Cluster | Amazon ECS Tutorial | KodeKloud

How to Deploy a Docker Image on AWS ECS Cluster | Amazon ECS Tutorial | KodeKloud

Hugging Face + Langchain in 5 mins | Access 200k+ FREE AI models for your AI apps

Hugging Face + Langchain in 5 mins | Access 200k+ FREE AI models for your AI apps

AWS Sagemaker tutorial | Build and deploy a Machine Learning API with Python

AWS Sagemaker tutorial | Build and deploy a Machine Learning API with Python

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Amazon Bedrock vs Amazon SageMaker Jumpstart

Amazon Bedrock vs Amazon SageMaker Jumpstart

What is Hugging Face - Crash Course (No Coding) | ML Products for Beginners

What is Hugging Face - Crash Course (No Coding) | ML Products for Beginners

The joker favorite#joker #shorts

The joker favorite#joker #shorts

Помоги Симбочке убежать от монстра! Подпишись на ютуб, скорее! 🙀 #симба #симбочка #симбочкапимпочка

Помоги Симбочке убежать от монстра! Подпишись на ютуб, скорее! 🙀 #симба #симбочка #симбочкапимпочка

ТА САМАЯ ОТЛИЧНИЦА ИЗ ТВОЕГО КЛАССА

ТА САМАЯ ОТЛИЧНИЦА ИЗ ТВОЕГО КЛАССА

Ozoda - Lada (Official Music Video)

Ozoda - Lada (Official Music Video)

Сартарош Абдували Профессионал!!!

Сартарош Абдували Профессионал!!!

Интересный наборчик 😀

Интересный наборчик 😀

Bamboo Creations with Slingshots using pin #bamboo #bamboocrafts #bambooart #diy #Craft

Bamboo Creations with Slingshots using pin #bamboo #bamboocrafts #bambooart #diy #Craft