Chat with Documents is Now Crazy Fast thanks to Groq API and Streamlit

World’s Fastest Talking AI: Deepgram + Groq

How I Built An Ultra Realistic Real-time AI Call Assistant (using OpenAI Realtime API)

Invincible - Season 3 Teaser | Prime Video

Blastoff! SpaceX Starship launches on 5th flight, nails 'chopsticks' booster catch!

Getting Started with Groq API | Making Near Real Time Chatting with LLMs Possible

Prompt Engineering

Просмотров 35 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 16 окт 2024
Наука

Комментарии • 45

@martg0 7 месяцев назад ⁺⁴
Thanks for the video! I will start testing this API with a POC I am working now to learn.
@thierry-le-frippon 7 месяцев назад ⁺¹⁰
They should sell their LPUs instead and compete with Nvidia. They would surely get lots of backup and investments. They will probably be copied instead othetwise and fade away quickly.
@jonoburcham4059 7 месяцев назад ⁺⁶
Great video! Can ou make a voice chatbot using groq in one of your next videos please? I would also love to see if you do this on streamlit or if it's too slow and you use something else. Thanks so much for your videos
@engineerprompt 7 месяцев назад ⁺¹
Planning on making that. For voice chatbot, might just do cli though
@KOTAGIRISIVAKUMAR 7 месяцев назад ⁺¹
why you cant use the conversational retrieval chain instead of the conversation chain Because it can handle the memory by default no need maintain externally?
@prompt Engineering
@osamaa.h.altameemi5592 7 месяцев назад
this is next level. OpenAI got some serious competition.
@MikiyasZelalem-h2b 7 месяцев назад ⁺³
Please Create a step-by-step video guide on using the Groq API with Streamlit.
@engineerprompt 7 месяцев назад ⁺¹
That's coming soon
@DestanBegu 7 месяцев назад ⁺¹
Thanks for your content! I´m using Streamlit as well and want to give Content as the System role. For Example "answer me in short sentences in italian" so it will do this for each prompt i do. Where can i do this in the code? I used the Streamlit Chatbot Repo.
Thanks in advance
@dhruvpatel2554 7 месяцев назад ⁺⁴
Awesome stuff !!!!
@mickelodiansurname9578 7 месяцев назад
Heres the question, can Groq cards also work on inference for art and audio and voice models? or is it just LLM inference specific? It is like, well superfast... the only worry is literally the latency from you to the endpoint... so if its say, a streaming interruptible feed you are giving the model then the use cases for TTS and Speech applications just went through the damn roof!
@engineerprompt 7 месяцев назад
I am not sure but I was listening to Chamath (who is an investor in Groq) and he was talking about the initial use cases of the hardware. Seems like they were focused on vision so it might have the ability
@engineerprompt 7 месяцев назад ⁺²
I am trying to put together an example for end to end speech conversation, let's see how that goes
@shaheerabdullah6738 5 месяцев назад
Very Helpful.
@vishnuprabhaviswanathan546 7 месяцев назад
How to control the output of LLM for a single input?
@ramimithalouni6592 7 месяцев назад ⁺¹
what is the time to receive the first chunk in streaming?
@easy-dashboard 7 месяцев назад
Depends on the amount of input tokens. With a one line instructions it's below 1 second. If you include context of a RAG-System it will go up to 3 seconds to start the first token (30k tokens of context)
@Francotujk 6 месяцев назад
What are the rate limits of the free api? Is it necessary to provide credit card?
@engineerprompt 6 месяцев назад ⁺²
It's free at the moment and there is a rate limit as well. Seems to keep changing. Last time I checked, it was around 20 messages per minute
@benben2846 7 месяцев назад ⁺¹
tu est fort man ^^👍
@jesusleguiza77 6 месяцев назад
Hi, this api have function calling? regards
@hmsfaceface8925 7 месяцев назад ⁺⁴
how can the groq fpga use mixtral 8x7b with just 250gigs of vram?
@coyoteq 7 месяцев назад
Bcoz of groq tpu...
@ConnectorIQ 2 месяца назад
almost a baby version of a quantum computer if you can actually perfect a model based on speed of responses to your questions and using the groq gpu...
@jmay3230 7 месяцев назад
If temp can adjust to minus what is impact on generation ( consider it as hypothetical if case don't exist )
@engineerprompt 7 месяцев назад
It will be same as setting it zero :) basically if you set it zero, it will pick the next most probable token. If you set a higher value, it can to sample among the most probable tokens
@bobsmithy3103 7 месяцев назад
Can it run other models?
@prestonmccauley43 7 месяцев назад
I tried a few things with this and it is incredibly fast.
@engineerprompt 7 месяцев назад
I agree!
@ranaayushmansingh2368 4 месяца назад
can we fine tune this and use it?
@engineerprompt 4 месяца назад
You can't fine-tune via their api yet.
@CharlesDonboscoA 7 месяцев назад
Hi whether it's free or paid ?
@engineerprompt 7 месяцев назад ⁺¹
Free at the moment
@siriyakcr 7 месяцев назад ⁺¹
Wow
@ZombieJig 7 месяцев назад ⁺³
Fuck all these cloud only AI services, release the cards!
@thierry-le-frippon 7 месяцев назад ⁺¹
Yes, otherwise they will fade away quickly. Their window of opportunity is small. Money is looking at eating in the nvidia cake now not tomorrow.
@conciousaizielia 7 месяцев назад
Grok is not a llm it can run a llm
@TheJscriptor09 7 месяцев назад
YALLM ... it is almost becoming daily news ... Yet Another LLM.
@savire.ergheiz 7 месяцев назад ⁺¹
Fast but useless. These oss models still way far behind cgpt4.
@manishadeshmukh2519 6 месяцев назад
Bro groq outsmarts GPT-4 in 70B model
@manishadeshmukh2519 6 месяцев назад
It is super faster than gpt 4
@geo4design 7 месяцев назад
This is an AD
@sausage4mash 7 месяцев назад ⁺⁷
did someone say free
@engineerprompt 7 месяцев назад ⁺²
For the time being :)

Следующие

Автовоспроизведение

Chat with Documents is Now Crazy Fast thanks to Groq API and Streamlit

Chat with Documents is Now Crazy Fast thanks to Groq API and Streamlit

World’s Fastest Talking AI: Deepgram + Groq

World’s Fastest Talking AI: Deepgram + Groq

How I Built An Ultra Realistic Real-time AI Call Assistant (using OpenAI Realtime API)

How I Built An Ultra Realistic Real-time AI Call Assistant (using OpenAI Realtime API)

Invincible - Season 3 Teaser | Prime Video

Invincible - Season 3 Teaser | Prime Video

Blastoff! SpaceX Starship launches on 5th flight, nails 'chopsticks' booster catch!

Blastoff! SpaceX Starship launches on 5th flight, nails 'chopsticks' booster catch!

If your message isn't COMPLETELY UNIQUE, you're banned.

If your message isn't COMPLETELY UNIQUE, you're banned.

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Пробуем использовать LLM через Groq API. LPU

Пробуем использовать LLM через Groq API. LPU

Unleash the power of Local LLM's with Ollama x AnythingLLM

Unleash the power of Local LLM's with Ollama x AnythingLLM

OpenAI Realtime Voice API: A 7-Minute Getting Started Guide

OpenAI Realtime Voice API: A 7-Minute Getting Started Guide

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Merge LLMs to Make Best Performing AI Model

Merge LLMs to Make Best Performing AI Model

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

Run ALL Your AI Locally in Minutes (LLMs, RAG, and more)

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

"I want Llama3 to perform 10x with my private knowledge" - Local Agentic RAG w/ llama3

How to build a real-time AI assistant (with voice and vision)

How to build a real-time AI assistant (with voice and vision)

Hardware tools repair tool high performance tool

Hardware tools repair tool high performance tool

Bu telefonda oyun oynamak ister misiniz?

Bu telefonda oyun oynamak ister misiniz?

🤯 Б/У Игровая Клавиатура Лучше НОВОЙ

🤯 Б/У Игровая Клавиатура Лучше НОВОЙ

САМЫЙ ДЕШЕВЫЙ iPhone

САМЫЙ ДЕШЕВЫЙ iPhone

Стоит ли заказывать б/у материнки с eBay или Aliexpress? Lenovo ideapad 310 не хочет заряжаться :(

Стоит ли заказывать б/у материнки с eBay или Aliexpress? Lenovo ideapad 310 не хочет заряжаться :(

КИТАЙСКИЙ ИГРОВОЙ ПК 36 ЯДЕР, распаковка посылки с али.

КИТАЙСКИЙ ИГРОВОЙ ПК 36 ЯДЕР, распаковка посылки с али.

Смартфоны через 10 лет

Смартфоны через 10 лет

Самые дорогие телефоны 2000х

Самые дорогие телефоны 2000х