Deep Learning Chatbot R&D

sentdex

Просмотров 71 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 4 окт 2024

Комментарии • 175

@sentdex 4 года назад ⁺¹²
Sample chat inputs and responses, along with my scoring that you can choose to work on top of, or not, if you want to take part: github.com/Sentdex/chatbotrnd/blob/master/chat_question_examples.py
@jt80995 4 года назад ⁺²
Long form is interesting. There are videos out there that are 4 to 8 hours long of straight coding. I consume them! Improves my coding skills
@ismailashraq9697 4 года назад ⁺¹
sentdex is there a problem the video? Its always crashing at 1:37. Couldn’t watch it 😩.
@USAIsrUKEUVngrdBLRckOccupiedUA 4 года назад
Someone can teach own chat AI using conversation of people in social networks.
@briarmario9463 3 года назад
@Matteo Trace definitely, been using flixzone for since march myself :D
@jacquesthibodeau1665 4 года назад ⁺⁶⁷
Hey Sentdex, these videos are actually exactly what I'm looking for. The problem with teaching stuff (especially in the programming world) is that things are always way too sterile. Like, OK OK OK, we've had enough clean dataset machine learning. There is a lack of R&D videos in ML. More videos like this help in two main ways: 1) we get to see how great programmers problem solve 2) it makes it less demoralizing seeing someone else also struggle and allows us to cultivate a mindset of pushing through hurdles.
Keep it up!!
@merth17 4 года назад ⁺²
Jacques Thibodeau this comment is underrated
@mechanicalmonk2020 4 года назад
Some of the videos I make (different account) are like this and that get absolutely no traction. Sterile videos get views even though I want to make detailed videos, mistakes and all.
@muslim2k Год назад
@@mechanicalmonk2020 Make those videos, get your channel going, and afterwards you can upload research videos
@raduciurca 4 года назад ⁺⁵⁹
I was wondering what Snowden is up to in quarantine
@randallwalkerdiaz1002 4 года назад ⁺⁹
He has always been in quarantine
@benjamin2713 3 года назад
@@randallwalkerdiaz1002 yes because he leaked too much
@IAmCandal 2 года назад
He does look like Snowden lol
@hXrsmurf 4 года назад ⁺⁴
I definitely enjoy this format because it allows me to learn how you think through problems!
@pafnutiytheartist 4 года назад ⁺¹
Please do more videos like that. You are completely right about me sitting at home and begging for content.
@jeanjacquesstrydom 4 года назад ⁺³⁵
Sentdex: "Are you a machine?"
Machine: "No, I'm dad!"
🤣🤣🤣🤣🤣🤣🤣
@BigTylt 4 года назад
Reddit's always gonna Reddit.
@WilliamWonker 3 года назад ⁺³
10:38 “I’m a software engineer. I’m a software”
It has become self-aware, you accomplished more than you ever could have imagined.
@AlFredo-sx2yy Год назад
@Arbaazbaby way to kill the joke
@axu6207 4 года назад ⁺¹
I love watching these kinds of videos, it gives so much insight into these kinds of projects.
Also for some of my more casual projects, these are insanely motivating, and they help really help me wanting to code more just for fun.
Great work! Keep it up!
@setukumari5485 4 года назад ⁺³
A legendary Jedi Master in sci-fi movie series Star Wars, Obi-Wan Kenobi was a noble man and gifted in the ways of the Force. He trained Anakin Skywalker, served as a general in the Republic Army during the Clone Wars, and guided Luke Skywalker as a mentor.
@rushirajparmar9602 4 года назад ⁺⁵
I just see notification from sentdex and I watch it on the spot 😂😂Great content man!!
@adempc 4 года назад
Thanks and thanks.
Haven't had time to watch any recent videos. But it's great they are here during this time.
@DSharty1337 2 года назад
7:22 “I’ve been constantly running train” Chad sound byte
@ErnolDawnbringer 3 года назад ⁺³
You know you have found the right chatbot when you type "hello" and it responds: general kenobi
@Kman3107 4 года назад
I came in confused. When the video was done I couldn't even leave, because I'm now completely lost.
I'm basically just here to listen to Sentdex have fun. Living vicariously through you, Sentdex.
@Tom-kg6qe 4 года назад
Someone has probably commented this before but it's a Star Wars meme.
Obi Wan Kenobi drops from rafters in front of General Greivous. Surrounded by Greivous's troops says: "Hello there!"
General Greivous retorts with "General Kenobi" and ensues to draw his light sabres to fight.
Also, could you split strings by punctuation and then check for each string in the list if the string occurs more than once in the list?
42:41 - "Sorting dics"
Intentionally misquoting Sentdex
53:00 - Possible reason why the answer isn't changing is because you're not retraining the models?
I was doing some Data Science stuff and one of the projects the tutor was speaking about doing was creating a Spam Filter. He was using Pandas DataFrames so that might be a more efficient method of sorting answers.
@marx427 4 года назад
sentdex you are awesome. I discovered your CNN serie with the cat vs dog classification. Now I'm obsessed with Computer Vision and I can finally say that my math bachelor is useful, FINALLY !
@vinayreddy8683 4 года назад
My concentration always goes towards that EEG helmet behind you!!! That's what my master's thesis was about but, here I'm hanging on NLP.
@iamparadox1750 4 года назад ⁺⁵
hey sentdex, Here's an Idea that i thought and sorry if it's wrong what if there's 2 chatbot models chating with each other and every time they reply to ones sentence/question there's an Understanding score that measures how good that reply was means how good the other chatbot was able to understand that reply. It kinda like GAN's we two neural network competes with each other to get better. Coz we humans are like that the more understandable ones question/sentence is the better we are able to reply to them. Again sorry if it sounds totally foolish
@ataadevs 4 года назад
Great video, love the chatbot.One thing i noticed is that when the answer attended to be short one for some of your questions, the long answers will not be good ones. So maybe penalize it against its length in such scenario, especially when the question is not too common (didn't occur in the dataset) or if the model is overfitting on some of the common words like 'living' for example. i guess there are tons of things one could try and test on this amazing chatbot!!!! we are waiting and hungry for more videos
@offgridvince 6 месяцев назад ⁺¹
Great job! I loved it
@benjaminbianchi1804 4 года назад ⁺²
Great video :) ,
if you love working with bots, I recommend the rasa library is great and has a pretty powerful documentation.
@antoniorivero1434 4 года назад
This content is great!! Thanks so much for all your videos! Just clicked the notifications button :D Oh and instead of having the if statement for checking if I_count is less than zero, you can just do max(0, i_count-1) woo
@RorretsNet 4 года назад ⁺²
"I'm a software engineer. I'm a software" it knows too much!
@ndiayemodou3474 4 года назад
You are making the world better place. Thank you !!!!!!
@hitesh_valecha 4 года назад
Sentdex, you provide some of the best material for python (I watch a lot of python videos online, so trust me --- other viewers)
Keep up the good work :)
@sentdex 4 года назад
Thanks!
@SubsNoVideoChallenge-wl7mz 3 года назад
THIS MAN HAS EVERY TUTORIAL I WANT
@collinxsmith 4 года назад
Did I like this video?
> No, I loved this video. More plz.
@Alaska-mk4ok 4 года назад ⁺²
I tried to make the same chatbot also using Seq2Seq using the Cornell corpus dataset but google colab ran out of RAM :(
Edit: Also my model had a 1 layer encoder and a 1 layer decoder each with 500 neurons LOL
@frog706 4 года назад ⁺¹
Great video, very interesting.
@maxhouseman3129 4 года назад ⁺¹⁷
"I'm a meth addict". WTF the source is Reddit, right? 😅
@coenkuijpers1800 4 года назад
I would love to see a vid about the preparation of the input and the structure of the network along with labeling.
@techycompute3636 4 года назад ⁺¹²
Is it just me or the sound is really low?
@sentdex 4 года назад ⁺¹
Might be lower than normal, but I can turn it up to be obnoxiously loud still.
@rose123998 4 года назад
yes it is
@samarthnagpal1697 4 года назад
Niceee. Quick question, when will neural network from scratch videos be made? Great work anyway!
@NicolaMastrandrea 4 года назад
Keep doing with videos like this.
It's awesome and let us focus on something.
Greetings from Italy!
Be safe.
@sentdex 4 года назад ⁺¹
Thank you, you too!
@hupa1a 4 года назад
Nice Video! I like that format
@ayior 4 года назад ⁺⁴
Here for the Algorithm now, I'll be back for the video later!
@jayksahani9215 4 года назад
sentdex you are a genius man
really....
@sifiso5055 4 года назад ⁺¹
Your latest content is amazing🙌
@ssasdaftt3 4 года назад ⁺¹
@Sentdex you are a great man, I have learnt alot from You :) lot of love from Pakistan .Stay save.
@sentdex 4 года назад ⁺¹
You too!
@MrYUPIYU 4 года назад ⁺²
1:03:15 "como estas" which means "how are you", and the answer "Gracias por favor" means "thank you please", which is not ok, and it does not have any meaning in spanish. Anyway, great video
@carsonmcmanus8825 4 года назад
I can vouch for how good spacy is. You can pick and choose what features you want to use. Highly recommend
@azira583 4 года назад ⁺¹
I'm noob coder but I had question can't it keep learning with new question and answers or we have to train it all from scratch everytime we want to add even more answers ?
@andrewm4894 4 года назад
Have you thought about using something like mechanical Turk to help score responses in some way? Or a tutorial on some other data labelling solutions out there, could be an interesting series too.
@camdenparsons5114 4 года назад
I am building chat bot too. what i find most curious is encoding/decoding representations of meaning into words or sentences. I'm assuming that the encoding and decoding for the model you are using were trained from scratch with the rest of the model. I wonder what happens if the encoding/decoding is trained separately, perhaps on a dictionary. I guess this falls under the discussion of whether its best to create models that do end-to-end learning tasks, or connect separate modules trained for sub-tasks.
@kasper1374 4 года назад ⁺⁵
"Yes, I am satoshi nakamoto"
"Yes, bitcoin cash is bitcoin"
The second one scares me
Wtf is the source of the data
@dcastudios7185 4 года назад
Sentdex: What's your favourite color?
Bot: Black, Black, Black, Black, Black, Black, Black, Black
@DarkRedman31 3 года назад ⁺¹
32:18 I guess you're not yet a "power use" of Sublime Text, I would have selected "ans" do Ctrl+D twice to select the other two occurrences and type answer
34:39 To precise a space for the first split argument is facultative since it's its default value.
38:10 instead of list comprehension you can fix it with a set comprehension. Well not, you can use set(words) on line 19 like for word in set(words)
Btw I was surprised by results because in your test it doesn't print "up" twice before it shows the list and looking at your code I don't get why it's filtered, but once fixed it should give the result 3 for total repetitions.
44:27 In the sorted function call you add the argument reverse=True
49:37 I faced this issue and found a solution, you can do from collections import most_common and do print(most_common(my_dict, 5)) to get the top 5 values, if you want the index of the best value (when not sorted) you can convert your dict to list of tuples with list(my_dict) use numpy.argmax() on it and use numpy slicing to filter the second column (of values)
@utpalbandyopadhyay1633 4 года назад ⁺¹
You're awesome dude...!!
@nguyenthichduyduc 4 года назад
Thank you
@fuba44 4 года назад
Love the style!
@setukumari5485 4 года назад ⁺¹
Hi, but can you again make video on how to make Charcot from scratch with tensorflow 2
@matts3414 4 года назад ⁺¹
If the scoring is so important and complex, why not turn it into a machine learning task? You could great a dataset of acceptable sentences and unacceptable sentences and get a model to predict the probability that each sentence is either acceptable or unacceptable, or something
@myildizhan 4 года назад
Hi which OS are you using and which screen recording app you are using. Thanks for great videos by the way :)
@sentdex 4 года назад ⁺¹
Ubuntu for OS, OBS to record.
@myildizhan 4 года назад
@@sentdex thanks 👍🏻
@electricalcoder3025 4 года назад
awesome fanatstic superlastic video dude you are my inspiration to make tutorials in programming and new innovative projects ...may god bless you.........................................................................
@lucasb.2410 4 года назад
Amazing video *as always*, What do you think about Evolved Transformer Networks ? Google developed Meena on it apparently and seems very promising. Good luck and happy coding !
@atineshs 4 года назад
I have read the paper its a crazy model
@sentdex 4 года назад
Haven't looked into them yet, planning to probably after NNFS. We'll see :)
@archaywakodikar6734 4 года назад
love from india :3
great work and effort, i wanna be like you -
*could potentially make something that can be world changing*
"ahaha nice responses cool, hey guys its me!"
@maxhouseman3129 4 года назад
Question: wouldnt it better for a score to use only multiplications and divisions because of the impact of the single parameters?
@vivek5147 4 года назад
Very cool bro
@oliverli9630 4 года назад
have you tried ChatterBot module but with much larger training data like those you have?
@5pellcast3r 4 года назад
omg ...!!! YAAASSS been waiting for this ... but holy shit u crazy with those 2 RTX 8000 .... meanwhile me out here using a GTX 1050
@umdbest001 4 года назад
love you sir......thanks a lot!!!
@ulissemini5492 4 года назад
pep8 is in tears
@atineshs 4 года назад
Once you finish experimentation, would you publish your ideas and experiment results
@sentdex 4 года назад
Most likely, yeah
@akashtyagi7182 4 года назад
So the libcublas does loads successfully. After so many hours on non-gpu system I thought it's just written to throw warning 🤷🏻‍♂️
@SeucheAchat 4 года назад
Thanks for this upload :)
@spyrosroum5986 4 года назад
When you want to make a new file in linux you don't have to open it with nano.
You can just do `touch filename.something` or even `sublime filename.something` though I'm not sure about the second
@sentdex 4 года назад
Oh right, I like to forget about touch, thanks!
@mohankrishna-vt8xd 4 года назад
Can't believe Sentdex didn't know General Kenobi :")
@sentdex 4 года назад ⁺¹
Never really got deep into star wars, my apologies :D
@morthim 4 года назад
1:05:45 the moment you fail a turing test.
@sovitrath4735 4 года назад
Just a quick question. Does your new book use NumPy, or everything from scratch using Python only?
@sentdex 4 года назад
Both. We first show how to do it in raw python, then make use of Numpy since raw python can get quite silly overall.
@bamitsmanas 4 года назад
I thought sentdex was so good he didn't need Stack....but we're all human
@sentdex 4 года назад
Just a regular person like you :D
@bamitsmanas 4 года назад
@@sentdex 👍👍I love your videos man
@wktodd 4 года назад
Just wondering if it would be possible to give the chat-bot a sense of 'self'? i.e. based on the answers it has previously delivered. It would make the bot much more convincing if its answers were consistent.
@atineshs 4 года назад
It is difficult to create chatbot model for Multi turn conversations, I would love to see how chatbots handle multi turn conversations in future
@andytang7725 4 года назад
Hi Sentdex,
Here are some recent paper that you may find helpful regarding non-greedy decoding.
- (nucleus / top-p sampling) arxiv.org/pdf/1904.09751.pdf
- (self-terminating training) arxiv.org/pdf/2002.02492.pdf
tl;dr: Beam search (along with similar maximization schemes) does NOT seem to work very well during decoding, provided that you've trained using maximum likelihood (MLE).
This is even true for extremely expressive generative models such as GPT-2, whose training was based on MLE and did not incorporate penalty against repetition. The second paper recommends a different learning objective that incorporates such penalties and seem to work well for RNN-type models.
Instead, consider using either top-k sampling or nucleus sampling during decoding, as they are agnostic to model architecture (i.e., can be adapted to RNNs or transformers, and can be used regardless of the learning objective(s)). It is also easier to implement since you do not need to re-train on 400mil pairs using fancy objectives. They have shown great promise in recent packages like Microsoft's DialoGPT (dialog version of GPT-2), which was also trained on Reddit convos.
Thanks for your videos -- you make some great content.
@omeraiman2983 4 года назад
I think if it is a chat bot functioning like siri, you need to add a random function after getting the results, other than that, you will need to supervise some stuff. Just a humble opinion😁
@ahmedhassankotb8785 4 года назад
Can you explain the media pipe hand-tracking because i need it in my graduation project
@iLinked 4 года назад
17:50 general kenobi
@thedosiusdreamtwister1546 4 года назад
Are you still using the 2015 comment corpus, or have you updated the dataset?
@sentdex 4 года назад
Updated, using like 400 million pairs now. He tends to think it's 2019 or 2018 ATM.
@soufianefariss 4 года назад
We are here. From notifications!
@sentdex 4 года назад ⁺¹
Welcome!
@gomenaros 4 года назад
How are you coping with the lockdown harrison? Things must seem very ordinary.
@cedric1731 4 года назад
For translation try deepL... It is amazing!
@datmesay 4 года назад
@sentdex am I the only one that can’t launch the video from an iPhone ? I am locked in the « something went wrong tap to retry » loop ! It’s been like this for the last 2 days !
@adeolaogunleye7965 4 года назад
Hello can I define a temporary dict and check if the word is already there for repetition?
@farookjintha9436 4 года назад
I just waited for this. I had stucked on my project of chatbot using tf . I took a course on Udemy to build chatbot , the author did it in TF1 whereas I would like to do in TF2 with all attention mechanism . But I continued with the course on building chatbot, at one point of time , I had error like Module not found , since he used TF1.4 I couldn't install that version, since no such version for now, atleast to mimic wht he's doing. So wht to do? How can I build chatbot using Seq2Seq model using TF2 with attention mechanisms. (Note: Not a NMT model to translate, but a chatbot model to give response to the user queries)
@josephrejive4081 4 года назад
I've trained a chatbot on 3M pairs, but my responses are nowhere as good as the ones yours outputs. How many pairs did you train on?
@techknowledge6808 3 года назад
How can I train my chatbot with multilanguage???
@wazed9468 4 года назад
Every video of you I saw includes spamming the console. I lowkey enjoy its suffering
@mrjt6404 3 года назад
Why "Brave" browser against chrome/FF ???
@alexandrodisla6285 4 года назад
Dex. Please show us how to bring machine learning inside an application. Any application
@wadyn95 4 года назад
Do you like Sentdex R&D video style?
> Yes, please do more!
> Hello General Sentdex
> Yes, this is exactly what I'm looking for
> Да, продолжай их делать!
> Yes, sorry for my English level
@akashsmaran3485 4 года назад
Is there any chance of open sourcing Daniel's ensemble code? :)
@sentdex 4 года назад ⁺¹
There's a chance. No ETA on that though :D
@aladaris 4 года назад ⁺³
28:10 do you even touch? ;P
touch test.py
@roostertechchan 4 года назад
Do you work on a standing desk?
@sentdex 4 года назад ⁺¹
Yes
@Q_QQ_Q 4 года назад
it says 2 minutes ago uploaded but some comments says 1 day ago .
@ekrem_dincel 4 года назад
They are channel members. Video is probably uploaded yesteday but it was private.
@tamizhazhagan-jaishreekris2199 4 года назад
I like sentdex 💓☺
@ekrem_dincel 4 года назад
Why you prefer Brave?
@sentdex 4 года назад
Good browser where my data isn't the business model.
@ekrem_dincel 4 года назад
@@sentdex Yes I agree, I use it too!
@ishfaqkhawaja6394 4 года назад
Okay idk if this is the genuine question to ask here, but anyways um going to ask:
I have ubuntu 18.04 installed on my PC but can't install my gpu driver nvidia GEFORCE(MX-150) when i run command like nvidia-smi , it shows "NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running" help me if u can please @sentdex
@Mrsparky492 4 года назад
I think there are a ton of STEM related responses because those kinds of peoole never shut up about it on Reddit! They don't necessarily have a disproportionate representation in Reddit users. Just a disproportionate representation in responses.
@cartoonchan182 4 года назад
Hey thanks a lot for the video... I have a doubt in case of huge dataset how will you train the model bcoz I'm using 1050TI always my gpu crashes when either dataset is big or nn is big so can you tell what will you do at that time. If you're using Colab can you do a tutorial on that and how to access files from it
@atineshs 4 года назад
It depends on the model, but you can try reducing the batch size
@MarsLanding91 4 года назад
What does R&D stand for in this case?
@sentdex 4 года назад
Research and development
@aman.upadhyay5798 4 года назад
i wanna implement a chatbot which is self learning and generative chatbot .
for the HR Policies , i have pdf files of HR Policies .
How can i implement it, please give link and ideas.
i don't have data in question answers form.
if i'll upload any document then it will automatically train based on data and give reply.
@aman.upadhyay5798 4 года назад
please give reply on my mail
@LapisGarter 4 года назад
Are you a genius
@sentdex 4 года назад
Doubtful.

Следующие

Автовоспроизведение