Vector Embeddings Tutorial - Code Your Own AI Assistant with GPT-4 API + LangChain + NLP

freeCodeCamp.org

Просмотров 229 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 26 сен 2024

Комментарии • 168

@aniakubow Год назад ⁺²¹²
So much love for this incredible community! Hope you like this video
@Isciaszinco Год назад ⁺⁶
Thanks for the work, hope you gain 1 million subs🎉
@dot_dot_pwn2650 Год назад ⁺⁶
seriously, you guys have no idea how much I've learned from this channel. The value is incalculable. Thank you so much!
@rameezalam1968 Год назад ⁺³
You look like A.i generated model😅
@sattineez Год назад ⁺²
Thank you for all your work 🙏 you are an inspiration 😊 I hope one day I can be as good as you
@ArghyadeepPal Год назад
@@rameezalam1968 😂
@kangmoabel Год назад ⁺⁴⁶
Can we stop just for a moment and appreciate her!!! learned loads of thing from you!!! hats off 🎓🎓 ❤❤ love you from Ethiopia
@maciej2320 Год назад ⁺³
She's an actor. There's tonnes of people behind the scene to write this content.
@Ur_Airdrop Год назад
@maciej2320 wow amazing me too from Ethiopia ❤
@jay_wright_thats_right 5 месяцев назад
How do you know she's creating the content? You learned loads of stuff but is it making your more money?
@aniakubow 2 месяца назад
@@maciej2320 I made this myself :)
@aniakubow 2 месяца назад
@@jay_wright_thats_right I made it myself :)
@nAIT369 Год назад ⁺¹²¹
do you agree that this legendary channel is better than most paid courses for coding out there?
@toddmarshall7573 Год назад ⁺¹²
Having programmed since 1963, it looks like a return to COBOL. Programming today is 90% getting things hooked up. 10% getting things done. 100% being clueless as to what's going on under the covers. Follow the bouncing ball programming.
@shrdn2960 Год назад ⁺⁴
A few months following their courses and their YT channel. I believe it's as useful (if not more) than my university degree, which took a lot of money and years.
@Suraj_hps Год назад ⁺³
I just completed their certification course in responsive web design and actually learned a lot from any other course and only in 40 days (300-350hours )and everything for free too
@Jd-zd6bh Год назад
what, do you mean there is such thing as a paid course?? I am already 100% clueless. .
@Suraj_hps Год назад
@@Jd-zd6bh I said it's free means it is free all courses on their website with certificate
@minsupark9246 8 месяцев назад ⁺⁵
This lecture on vector embedding is undoubtedly one of the best I've encountered! Huge thanks to Ania and FCC! Kudos to you all!
@VittorioLizzerri 2 месяца назад ⁺¹
Ania, that was freaking amazing! You simplified all the concepts without going too high-level and dumbing it down altogether. You told us what happens and showed us HOW they happen. I found this very informative and you answered so many questions I have been pondering. I'm not a developer or an AI person, I'm a network engineer. So, thank you...
By the way, I used an embedding model to map your face, and the semantic engine returned the words "gorgeous," "lovely," "beautiful," etc... 🙂
@andreyklepikov7084 8 дней назад
You have a talent to deliver complex information in a very interesting manner! Waiting for more videos!
@ChameeraDedduwage 4 месяца назад
Wow. Probably the best lecture to meaningfully explain what vector embeddings are, and how cosine similarity works. Thank you very much!
@MaverickCoder-mz6hp 3 месяца назад
Nice introduction to vector embeddings with clear explanation. One of the best lecture/videos encountered on you tube related to this topic.
@asutoshpanda7998 Месяц назад
You're dazzling and wise, a true blend of grace and intellect.
@I61void 11 месяцев назад ⁺³
Wow the instructor for this vid was actually amazing. I only clicked on it because It was 30 minutes long, having no real intention to actually learn and just have play in the background while I read a textbook for fizz. The instructor was phenomenal, I understood everything she said, every instruction was clear to follow although I only really know some JavaScript and cpp. I actually learned a few things. Before the video began like I said no real intentions of implementing this but since I actually learned and understood pretty much all of it I could see myself actually implementing it on some project and adding it to my resume. Would be cool. Thanks
@jastorgallywix4424 10 месяцев назад ⁺¹
ok simp
@ActiveAndReactive 25 дней назад
This girl is an ideal perfect educator!
@Lemure_Noah 10 месяцев назад ⁺²
Simply love this presentation! That vec math (King - man + woman = Queen) just blow my mind!
@MarcusNeufeldt Год назад ⁺²²
🎯 Key Takeaways for quick navigation:
00:00 📘 Anikubo's course covers vector embeddings using OpenAI's GPT-4.
01:49 🖥️ Vector embeddings transform various data types into numeric forms for algorithm processing.
06:12 📈 Numbers can represent complex data, and cosine similarity helps compare them.
08:04 🌐 Embeddings find applications in recommendation systems, NLP tasks, and more.
14:04 🛠️ LangChain, an open-source framework, enhances AI interactions, chaining models and data.
23:25 🛠️ The tutorial walks through setting up a Python environment and key scripting steps.
24:22 ⚙️ Essential packages and tools are installed for AI development.
33:17 🤖 The AI assistant, using vector-based search, fetches relevant documents from a database.
Made with HARPA AI
@Legacyposh Год назад ⁺¹
You're a lifesaver
@davidtindell950 11 месяцев назад ⁺²
thank you. a TIMELY course for my projects!
@GuilhermeLuz_F 2 месяца назад
This is a masterclass!!! Thanks so much! Really appreciated!
@easygreasy3989 10 месяцев назад ⁺¹
So well done and well put together! Thanks for the value ❤
@ShadowMind312 Год назад ⁺⁹
I did this in my Numerical Analysis course using Maple and MatLab. Then i did some analysis on images when i took Fourier analysis. Shame i never got much chance to use it professionally, as i tend to work with financial data.
@tomoki-v6o Год назад ⁺²
Those skills are usefull for grad and research
@nocopyrightgameplaystockvi231 Год назад ⁺¹⁷
Fun fact : GPT3 has vector embedding sizes of around 12,888 which is 100 times more than tiny models and 25 times more than normal NLP models.
@gonzalomunoz2767 Месяц назад
Dumb question: are they interoperable? Like creating embeddings for a dataset with GPT-3 but then comparing them to a new embedding created by a different model
@adityanjsg99 Год назад ⁺²
This channel is goldmine
@patekreol974 7 месяцев назад ⁺¹
Incroyable !
@broadcastdave Год назад ⁺²
Ania is the best.
@Salvatore769 Год назад ⁺¹
Great video very detailed 🎉 0:35
@Ur_Airdrop Год назад ⁺¹
Wow mashallah you are amazing freecodecam we love you from Ethiopia 🇪🇹
I was take certificate of responsive web development that was amazing ❤❤❤
@shakilkhan4306 Год назад ⁺²
I just studying on this...Thanks
@myslates2854 Год назад ⁺⁶
This is a very good video but I would like to understand why do we need datastax and store in DB if the intension is just to use as prompt and get answer. We can get directly from the OPenAI with key and prompt without storing or anything to do with vector embedding and those will be internal to OpenAI, I wanted to understand the use case of approach.
@chidiebere 11 месяцев назад
Valid question
@giovanimaciel1651 11 месяцев назад ⁺¹
I believe this method is better than fine-tuning and significantly superior to using prompts, especially when you have a lot of information; the chat will provide much better answers.
@AbdulBasetWangde 3 месяца назад
I think this is more for AI to answer questions on your data..hence she downloaded data from hugging face . But this could also be your own data vectorized..stored in db and queried . I may be wrong but this what I infer.
@wingleungchoi3110 11 месяцев назад
Thank you so much for this amazing video! I learnt a lots from it!
@limitedgain6433 Год назад
Thank you for sharing the information and knowledge.
@DennisKenneybees Год назад ⁺⁷
Any educational course should always first explain what prerequisites are necessary to understand and learn the course material.
@AceOnBase1 11 месяцев назад ⁺¹
Think its safe to assume if youre here that you know a bit of CS at least.
@GeorgekuttyGeorge Месяц назад
Love it !
@bilalahmedkhan5876 Год назад
Thank you! this was really insightful.
@RobertAlexanderRM 7 месяцев назад
@AniaKubow you're great at explaining. The only thing lacking in this otherwise excellent video is Poetry ;)
@ghelmstetter-AI 9 месяцев назад
Great tutorial, thanks! Lol at those answers it was spitting out though...
@Kishor-ai Год назад
Thanks for making this video
@李云飞-u3e Год назад ⁺²
i like the course which is less than 1 hour
@Cr0ss9o Год назад ⁺¹
Exactly my thought
@elijahkulpinski4983 Год назад
This came just in time as I just discovered Flowise which is just a code-less LangChain and wanted to play around with long term memory for my models
@fuad471 3 месяца назад
Thank you for tutorial. does some llms models and tools like chatgpt handles all the tasks related to from storing data to vectorbases and querying relevant data ? doesnt openai provide any database for storing the embedded text so that we used cassandra for this purpose?
@fl4tcircl3 10 месяцев назад
Brilliant!
@rafaelbatres5280 8 месяцев назад
Amazing explanation! How could I use an existing Access database for my data set? It actually contains text reports and keywords for each report.
@TheOntheskies 8 месяцев назад
Awesome Video to get started in AI. Any reason why you used datastax instead of a vector DB like pinecone?
@GreySavo Год назад
Thank you for so clearly and articulately presenting these lessons for us for free! Your eyes, your smile, and your beauty all together it is incredibly distracting to me though haha! I mean you are one of the most gorgeous women I've ever seen in my life and I appreciate your time. Sorry to come off like a creep but you're stunning.
@josecarlosmarques4012 9 месяцев назад
tks a lot amazin video
@robschannel1156 4 месяца назад
say you wahnt the computah to scan this for whads...
i appreciate the video, and your accent.
@PopCultureQuest Год назад ⁺⁴
This tutorial is very interesting, except that without a paid account on OpenAI we cannot really put it into practice. But I can't afford to pay 20 euros per month just to set up a tutorial.
@donaldoalmazan7338 Год назад ⁺²
You don't need to sign up for OpenAI Plus in order to create an API Key, they are billed separately. You also get a free 3-month API credit when you first create an account, the amount varies, I think they've decreased it now to about $5 (unfortunately I missed out on my credit, since I created my account last year and wasn't coding anything)
@PopCultureQuest Год назад
@@donaldoalmazan7338 thx for info
@kevthedestroyer1044 Год назад ⁺¹
actually the 20$ for chatgpt is different from the api, for the api you can for example buy a 10$ credit and use it for as much as you would like, as long as it's used to finetune it will last long
@abhayprajapati4981 Год назад ⁺²
I'm first woohoo tho I can only write "hello world" 🙂
@nigerianprince5389 10 месяцев назад
Damn. Jay took one for the team learning vector embeddings 💀
@KyleSpragg 11 месяцев назад ⁺¹
When calculating cosine similarity, does a value closer to 1 mean more similar and a value closer to -1 mean less similar?
@ghelmstetter-AI 9 месяцев назад ⁺¹
1 indicates an identical vector, or very close semantic meaning, or even identical text. (Note that it's the similarity of the vector's direction only, not scale.) 0 indicates an orthogonal relationship, ie, unrelated semantically. -1 in theory represents complete semantic opposition, but in practice, a perfect -1 is rare in natural language contexts.
@jasintanuwijaya5203 Год назад
When we do ask question using the vector database like "what are the biggest questions in science" does it consume token from open api also?
@KumR 8 месяцев назад
Wow. .Lovely
@mereb29 Год назад ⁺²
I'm new to this. At 13:18 what terminal is she using to input the code?
@augustovalbuena6685 Год назад
That is the simple Terminal. Later she uses Visual Studio Code
@snehamandal5376 Год назад
It is the linux terminal....curl is a linux command..but if u are on windows ..u may use wsl to use linux terminal on windows
@DrAIScience 8 месяцев назад
Can we do this for image search? Can we see embeddings of images? Can langchain do that? Thanks
@ecapanema 11 месяцев назад
a git with that would be awesome
@frogfox9577 10 месяцев назад
we have to have a GPT-4 prenium suscription fo follow this course ?
@truszko91 9 месяцев назад
How do ML models create embeddings for new, or novel, words? For example, what if I fed it "Hexamethylenetetramine" (an organic compound)? My brain is frying thinking about this...
@toddmarshall7573 Год назад ⁺¹
9:50 "...we also use it for information retrieval...": How does it deal with misspellings, either in the query or in the training data?
@TropicalCoder Год назад
With a lower score than expected
@technolus5742 Год назад ⁺¹
The model has seen misspellings before, and it knows they are related to the correct spelling more than other words.
@toddmarshall7573 Год назад
Amazing. How could it know? @@technolus5742
@AaronGomez-ij4dx 10 месяцев назад ⁺¹
Is there a repo with the code?
@tanmaybrahmachari Год назад
Hey, I would love you could somehow make a video on bun and scyllaDB, been trying to learn them but theres no source 😥
@ClarkRuell 9 месяцев назад ⁺³
10:51 THIS is a 'golden nugget' right here: "The core advantage of vector embeddings..." Such a great summation of exactly what an ai model really is. Thanks for such a fantastic video. I love it !
@sreejishnair5922 Год назад ⁺²
What is the prerequisites of this course?
@ivarravi1000 Год назад ⁺¹
passion to learn and explore
@TedGulesserian 6 месяцев назад
Link to a gist file with the code would be helpful please.
@blaketheshepherd 8 месяцев назад
Is it possible to connect this to a custom GPT for the openai store?
@priyaranjanmarathe 7 месяцев назад
Is there a tutorial on doing these using LLaMA?
@smitpatel9722 11 месяцев назад
Is python good language to learn dsa?.. because on the internet there are lot of guys who are telling that you should learn java/c++
@coolmatt3906 4 месяца назад
Why to use OpenAI instead of some open source code?
@bishalroychoudhury4190 Месяц назад
download secure bundle is not showning and where to get the client secret id , please help
someone
@WilsonSilva90 11 месяцев назад
I did not understand how the LLM (OpenAI) uses the embeddings stored in the DB.
@FamilyManMoving 8 месяцев назад ⁺¹
LangChain: "chaining" resulted in two answers for each prompt: "I don't know" and the headlines. The first answer came from OpenAI's LLM, the second answers (the headlines) came from the vector-DB (Astra/Cassandra) that she set up outside OpenAI. LangChain was the bridge between the two.
It's a simple little example without much relevance, but it shows the bones. There is a lot more work to make something useful.
For instance, you could use a pre-trained LLM to perform the organizational tasks and composition (the language parts) using current data from a real-time source. For instance, "what kind of activities would be good at Los Angeles beaches today?"
The LLM could contextualize the meaning of the question using pre-trained data (an understanding of what constitutes beach sports and the condition necessary for each is something that won't much change over time), and then use an external source (weather channels, surf sites, diving data sites, sailing sites, etc.) to search for real-time conditions at LA Baches. The LLM - using the current real-time data - could then look for nearness matches based on how the conditions match up to certain sports.
So instead of a generic pre-trained answer like, "people like to sunbathe, swim, dive and surf at beaches" you can get a specific answer such as, "The conditions at Redondo Beach suggest it's a good surf day, but rip tide warnings suggest it is a bad day for people to be swimming. There is a water quality alert for bacteria in Santa Monica Bay."
The LLM used the external real-time data to give accurate point-in-time suggestions that it would otherwise never have using data from training months earlier. That's where LangChain can help - merging "new" or custom data into the pre-trained contextual LLM model.
Hope this helped.
@brunomattesco 6 месяцев назад
help with this error on this part of code anyone?
llm = openai(openai_api_key=OPENAI_API_KEY)
TypeError: 'module' object is not callable
@nishantdixit958 8 месяцев назад
It seems secure bundle (20:36) option is no longer available. Can someone acknowledge it if it is so?
@maxwellmartin6917 8 месяцев назад
Yeh it seems to have moved location, open your table and on the right side under 'Database Details', 'Region' click the three dots and select download SCB :)
@luissantiagolopez3863 8 месяцев назад
so fine
@allignwith 10 месяцев назад
used the python code to get the embed, ran with no errors but returned no results?
@nonstopper Год назад
Embeddings vs Fine tuning?
@mrguiltyfool Год назад ⁺²
Too bad nowadays openAi won't get you use api key unless you are a paying customer
@KyleSpragg 11 месяцев назад
I was going to say, I cannot run the initial vector embedding program because of billing issues.
@durjayghosh7664 Год назад
please guys make video on laravel react js
@urimtefiki226 7 месяцев назад
Vectors of 2704 table matrix
@AceOnBase1 11 месяцев назад ⁺¹
Id like for her to embed my vector
@KaptainLuis 8 месяцев назад
thank you ! sadly you dont go in deep into the needed data...hoy big are the documents etc...but still good thanks!
@emmanuelworldspen9409 Год назад
Can non-coder take this course?
@maker9310 8 месяцев назад
did anyone run into this error
llm = openai(openai_api_key=OPEN_API_KEY)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: 'module' object is not callable
@alanbiquet8401 9 месяцев назад
does someone knows cheaper alternatives to OpenAI GPT's API ?
@asiddiqi123 8 месяцев назад
Google bard API
@bigsmoke8344 Год назад
Make video about godot for unity users
@ashishkharangate1110 Год назад
How to use chat GPT API key
@666rony 8 месяцев назад
lol i am not able to find api keys and lclient secret keys for astra
@toddmarshall7573 Год назад ⁺²
4:40 "...Joe is 38 on the 0 to 100 scale... so -.4 on the -1 to 1 scale...": How is that? I get -.24. If it's -.4 on the -1 to 1 scale, that's 30 on the 0 to 100 scale. Please fix my math.
@FamilyManMoving 8 месяцев назад
Context has two flavors, near and "not near". Joe is 38% near. Maybe Alice is 40% "not near", which would equate to a negative value (-.4).
So context is more than "this one is like the other", it's also "but it's not like this other thing". If we just used a single dimension, then literally everything would be "like" everything else, which makes it a little difficult to differentiate.
"The school bus is yellow. A banana is yellow. A bus is NOT LIKE a banana." Dimensions in dimensions.
@enriquebc5330 9 месяцев назад
Somebody having the problem of "You exceeded your current quota, please check your plan"
@nikosterizakis 5 месяцев назад
Great video, but @AniaKubow, if you do not mind, you could have had a very successful career in modelling. Refreshing to see that you chose computing and specifically AI.
@frontprojects Год назад
perfect 👌
@moejobe 5 месяцев назад ⁺¹
There are a lot of gaps in this tutorial, especially in the programming part. Good for conceptual learning, but don't recommend the implementation.
@__Gojo___ 5 месяцев назад
Whats wrong of her explanation?
@eaxypop Год назад ⁺¹
She doesn't explain the basics. What is the terminal that she uses?
@chidiebere 11 месяцев назад
you can use a vscode terminal
@gamefleek9012 11 месяцев назад
@@chidiebere I'm using VS Code and I'm getting errors about the API key I created. Is there a way to validate a key?
@paybuy5315 Год назад
Hello please help me how can run .jar file in HTML ,js please
@RatafakRatafak 9 месяцев назад
It made me really laugh how she speaks about word and calls it "text". LOL
@abhisheksingh-ck7wp 9 месяцев назад ⁺¹
i don't code don't how to, i am here due to the thumbnail
@elsondasilva8636 Год назад ⁺¹
👸🌟💝🌹💕🌹🌹🌟🌟
@Manikandan-cm6yn 10 месяцев назад
🎯 Key Takeaways for quick navigation:
00:14 📉 *Sam Altman was fired by OpenAI's board for not being consistently candid in his communications, leading to implications of lying by omission.*
01:11 🤯 *Various theories circulated, including speculation about dangerous AI developments, financial ties with Saudis, and a letter from former employees alleging dishonesty.*
01:53 🌐 *OpenAI employees expressed discontent, with over 500 threatening to quit, potentially joining Microsoft to dominate the AI space.*
02:21 🔄 *After negotiations with Microsoft failed, Altman and Brockman formed a new AI research team at Microsoft, but Altman eventually returned to OpenAI as CEO on November 21st.*
03:02 ❓ *Uncertainty remains about the true reasons behind Altman's firing, with speculation about conflicts of interest, AI commercialization, or a possible publicity stunt involving the board, Microsoft, and Altman.*
Made with HARPA AI
@sanjaybhatikar 8 месяцев назад
It is disappointing that you have taken the route of OpenAI which is far from open - not only have they not open-sourced their models, their payment methods are limited to a few options and setting up billing is required for their API to work. It would have been much better to use free, open-source models to demonstrate LangChain.
@jimmydesouza4375 6 месяцев назад
Ania looks fake. I don't know if it is the lighting or the background or what but it just looks fake. It's freaking me out. The voice sounds natural at least.
@SonnyTo 10 месяцев назад
you did not explain cosine similarity and why this is a good definition of similarity when it comes to comparing vectors. Without understanding this, everything you say after it is incomprehensible
@Anastasia_Masterova 6 месяцев назад
Very educative. Thank you for this video. We have prepared a Russian version: ruclips.net/video/q5dx9jiYFPM/видео.html.
@brunomattesco 6 месяцев назад
how did you prepared a russian version?
@ArvindYadav-ri5mo 9 месяцев назад
are you Ai generated or an actual person.... say hi if you are AI and say dhinchak-dhinchak if you are actual persona...yeah audio will be better
@bribes_for_nouns Год назад
yawn why is modern programming just gluing together packages. not this channel or the creator of the video's fault but i just hate what it's all become
@kevthedestroyer1044 Год назад ⁺¹
most of this stuff is expensive to create
@chidiebere 11 месяцев назад
It takes time to create some of them if you were to code them from scratch. Also, you don't want to "reinvent the wheel"
@bravo1oh1 Год назад ⁺³
Imagine kissing her
@assesino8346 Год назад
I need a code grAm that remembers geo locations to work with satelite for live location accuracy hitmarks

Следующие

Автовоспроизведение

OpenAI Embeddings and Vector Databases Crash Course