Clemens Siebler
Clemens Siebler
  • Видео 5
  • Просмотров 18 614
Azure OpenAI Service - Rate Limiting, Quotas, and throughput optimization
This video explains how Azure OpenAI Service's rate limiting and quota configuration works and shows suggestions for optimizing the throughput for a given model.
Blog post: clemenssiebler.com/posts/understanding-azure-openai-rate-limits-monitoring/
#azure #openai #gpt4
Просмотров: 3 840

Видео

Azure OpenAI Service - Embeddings Tutorial
Просмотров 9 тыс.Год назад
In this video we'll walk through OpenAI's embeddings and check out a few code examples how you can use them in Azure OpenAI Service. For this, we're using LangChain as an abstraction layer. Code samples: clemenssiebler.com/posts/azure-openai-service-embeddings-tutorial/
Azure OpenAI Service - 5 Tips for writing better prompts in OpenAI & ChatGPT
Просмотров 256Год назад
This video shares 5 tips for how you can write better prompts in Azure OpenAI Studio, OpenAI Playground or in ChatGPT. Blog post with prompt examples: clemenssiebler.com/posts/5-tips-for-writing-better-prompts-in-openai-chatgpt/
Azure OpenAI Service - API Access
Просмотров 5 тыс.Год назад
This video gives a short introduction how you can use the Azure OpenAI Service API by authenticating to it with your Azure Active Directory user. Blog post with code examples: clemenssiebler.com/posts/using-aad-for-azure-openai-service-authentication/
Azure OpenAI Service - Studio Overview
Просмотров 348Год назад
This video gives a short introduction how you can use the Azure OpenAI Studio for accessing GPT-3 models, like text-davinci-003 for writing prompts.

Комментарии

  • @jonathanbarton5243
    @jonathanbarton5243 4 месяца назад

    Most concise explanation - thank you

  • @Stateoftheheart
    @Stateoftheheart 8 месяцев назад

    Thank you, Clemens, very helpful! Keep them coming :)

  • @JohnLee-wv4wq
    @JohnLee-wv4wq 8 месяцев назад

    can we use this to compare a resume to a qualification?

    • @clemenssiebler
      @clemenssiebler 7 месяцев назад

      I would do this directly in the prompt, no need for embeddings. How to do this while taking Responsible AI best practices into account - I leave that up to you.

  • @nclub976
    @nclub976 8 месяцев назад

    Hello. I want to use Chatgbt4 Turbo vision for my application however I am not sure about the charges I am paying the way of calculation is very confusing to me. Does anyone know for sure what is paid on Azure open ai for using the Chatgbt 4 Turbo vision model, is it just spent tokens or something extra,host? Thank you

    • @clemenssiebler
      @clemenssiebler 8 месяцев назад

      Azure OpenAI just charges you for the tokens you consume when you use pay as a you go! 👍🏻

  • @jagadeeskumarlenin5517
    @jagadeeskumarlenin5517 9 месяцев назад

    Is it only supported for round robin only ?

  • @jagadeeskumarlenin5517
    @jagadeeskumarlenin5517 9 месяцев назад

    Thanks for this video. May i know what is the user hit limt for 240k token. (Per second or per minute)

    • @Leavinggermany
      @Leavinggermany 9 месяцев назад

      It’s in TPMs, so Tokens per Minute. There’s now also a dynamic quota feature that allows to go over that limit in case there is capacity. 👍🏻

  • @SenthilkumarM-l1h
    @SenthilkumarM-l1h Год назад

    you still use openai embedding, not azure openai embedding service right?

    • @clemenssiebler
      @clemenssiebler Год назад

      No, all code in this video is using Azure OpenAI for LLMs/Embeddings...LangChain does it automatically, if you initialize the openai.* variables.

  • @VladimirBalko
    @VladimirBalko Год назад

    🎯 Key Takeaways for quick navigation: 00:13 📚 Embeddings in Azure OpenAI Service: Embeddings are tools used to measure the relatedness of two strings, such as words, sentences, or documents. They are versatile and can be used for semantic search, clustering, recommendations, anomaly detection, diversity measurement, and classification. 01:22 📦 Supported Embedding Models: In Azure OpenAI Service, the primary model for embeddings is the "text-embeddings" model, introduced by OpenAI. It's powerful and cost-effective for various use cases. 02:02 🔑 Getting Started with Embeddings: To use embeddings in Azure OpenAI Service, you need to authenticate, set up the API base, and initialize your embeddings model. You should also adjust the "chunk size" to 1 for single embedding calculations. 05:05 🧩 Comparing Text with Embeddings: You can compare the similarity between two pieces of text by calculating the cosine similarity between their embeddings. This allows you to measure relatedness between strings. 09:27 🎥 Building a Recommendation System: Embeddings can be used to create recommendation systems. By comparing the embeddings of items, you can recommend similar items to users based on their preferences. 13:28 🌎 Visualizing Embeddings: Embeddings can be visualized in a multidimensional space, revealing clusters that represent relatedness. This can help gain insights into the relationships between various entities, like cities in different regions. Made with HARPA AI

  • @Shoaibkhan-oj3oe
    @Shoaibkhan-oj3oe Год назад

    loved this video, just so clean and easily understandable. Earned yourself a sub. Waiting for other videos to drop.

  • @adriaandavel9540
    @adriaandavel9540 Год назад

    I have searched and watched many videos, but the more I search the more confused I get. I hope you can point me to some information that will save my sanity. I think my idea is very basic, and a very common use case, but I cannot find examples or implementations of it. I have a text document of about 50 pages, which can be bigger and can be smaller. I want to give users the ability to "chat" with the document. I would prefer that the data does not go into the public realm. I would prefer to build a tool for this using c# without Azure services. Is this possible? It must be, so please save my sanity! :)

    • @clemenssiebler
      @clemenssiebler Год назад

      Sure, this is a common use case, maybe have a look at this post: clemenssiebler.com/posts/chatting-private-data-langchain-azure-openai-service/ I'd recommend using LangChain for this, alternatively, you can also look at Llama-Index, see clemenssiebler.com/posts/using-llamaindex-with-turbo-and-azure-openai-service/ Hope this helps!

    • @adriaandavel9540
      @adriaandavel9540 Год назад

      @@clemenssiebler thanks for the info! Is this possible without langchain and without Azure? langchain is not available for visual studio, and Azure has additional costs so I need to work without those

  • @moellerseo
    @moellerseo Год назад

    Thanks Clemens. This was a great video. Ideas for next video: compare embeddings for top 3 ranking articles on the topic of 'dental implants risks'. Google has a patent that looks for similarity of articles. They use embedding models as well. It would be interesting to see how similar the top articles are.

  • @ScottzPlaylists
    @ScottzPlaylists Год назад

    I would like to see a similar video, but classify files from the filenames in an input folder, and put them into a different folders subfolders where the name of the subfolders is the classification list to sort them into. (ex. AI, AI LLMs, AI Art, AI Tools, Health, FPGA, Crypto, etc) I'll use this to classify videos I've downloaded, into categories that I'll watch later by category. = A similar thing would be to classify screenshots from videos I watch into a similar categories of subfolders like above example, but the text to classify would have to come from OCR of the Images. Love your Channel.

    • @clemenssiebler
      @clemenssiebler Год назад

      Thanks for your feedback, this is a great use case that could be easily implemented by either embedding the filename/path and then training a simple classifier or, even easier, just using gpt-3.5-turbo to classify based on the filename into categories. In this case, you can try something like this prompt: You classify files based on their path/filename into categories. The category can be one of AI, AI LLMs, AI Art, AI Tools, Health, FPGA, Crypto. Path: ... Filename: Category: This might already work pretty well, given that path/filename is descriptive.

    • @ScottzPlaylists
      @ScottzPlaylists Год назад

      @@clemenssiebler Thanks for your reply. Here's my idea for "Offline AI DVR for RUclips" I'm not sure what you mean by "either embedding the filename/path and then training a simple classifier" but to spell out my Idea better: Have an input folder of Files to Classify by the filenames that are the title of the youtube video called Input_Folder Have a Output_Folders directory, where the Subfolder Names under Output_Folders are the Classes to train a neural net, to specify where the files should be copied to. I already have this setup and lots of youtube videos already manually moved into the Subfolders. This could be used to train the classifier. once trained I'd like to run the classifier on each filename in Input_Folder, the classifier would tell me the folder name to copy it to (I would call it Folder_Destination). Python would copy the file to \{Output_Folders}\{Folder_Destination. This would be an awsome little AI to organize files when you already have started the organizing. I use a nice little tool called "WinX RUclips Downloader" to get the files I've put in a public folder named like "12"(the Date I started a new download list) as I browse RUclips, I "Save to Playlist" and put it in "12" to put on my HD Later. WinX even puts them in a folder called "12 in this example. Then when I'm ready to Download and organize I copy the URL of the Playlist, paste it into Winx and it gets all the video files and puts them in "12". Then there's the long process of organizing them into categories on my HD like described above, so I can view Videos by catagory at my leisure. ( I used to be Offline most of the time, there was a method to my madness to get lots of videos for watching later). See all my Playlist names, and you can see I have a lot of interests. I keep the best videos by category on youtube playlists. It would be so nice to have the program I described and it would be the main component in a larger program later that would be like an Offline AI DVR for RUclips. Thanks for reading this long post, hope you whip up the code as a better programmer than me!

  • @test123-vu7gf
    @test123-vu7gf Год назад

    very helpful and concise. Looking forward to more content :-)

    • @clemenssiebler
      @clemenssiebler Год назад

      Awesome, thanks for the feedback! Greatly appreciated!