Email Spam Classifier | SMS Spam Classifier | End to End Project | Heroku Deployment

CampusX

Просмотров 350 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 2 янв 2025

Комментарии • 591

@kausikkar2587 10 месяцев назад ⁺³³
I find your's and Krish Naik's channels to be the ones which actually help us build industry grade ML projects. Rest are just doing model.fit without any given explanation.
@aaquibtayyabi 10 месяцев назад ⁺⁵
also its insane how similar they both look lol
@anirudhrapratap3005 Месяц назад
are these projects good for putting in resume plzz reply
@radhekrashna2148 Год назад ⁺¹⁰
Well explained many in a single 90 minutes videos
It's one of the best channel to learn data science free of cost with very good content
@muhammadtayyabtahirqureshi7186 Год назад ⁺¹²
bundle of thanks for this amazing video... apparently it's one video but it covers all the steps involved in machine learning development, data cleaning, EDA, pre-processing, model training, optimizing and good practices everything has been packed in just one video. ❤
@aakashgupta50 3 года назад ⁺¹⁰⁰
One Day this channel is going to have 1M Subscribers ❤🔥
@tusharbedse9523 3 года назад ⁺¹
within 1 year i guess
@dheerajrajput7058 2 года назад ⁺¹
1B
@codingworld6151 Год назад
We need recommended system from scratch
@Nothing-iu1uy 11 месяцев назад
Ofc can't wait for it
@rudrakshpotghan801 Месяц назад
!One day
@handhikayp 3 года назад ⁺¹⁵
Cool, this is the completed tutorial I had seen, from beginning until deployment
Keep up the great work sir!
@JaySharma-ob8nw 3 года назад ⁺¹⁴
Amazing Content!! I dont understand why this channel has small reach this content is better than any paid courses
@junaidyousaf4602 3 года назад
Are you nlp expert
@TechnicalDrMusic 10 месяцев назад ⁺⁶
100% easy to understand, really enjoyed 😅 a little bit is that in the middle the videos you cut for analysis but it was totally fine. Loved it, now i got my project to make it.
@jooyeonsimp 5 месяцев назад
Hii can you help df.corr() aint working for me, it says value error could not convert string to float
@Kaushik-f9h Месяц назад
Brother could you share your githup of this project i am facing this issue NotFittedError: This TfidfTransformer instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator
@sachinjain458 4 месяца назад
I am so glad that i found this channel. i recommend this to many of my friends. Amazing content
@Malayalam_learner 2 месяца назад
Thanks to this channel im getting exposure to projects ,hope by the end of this playlist ill make one
@ahsannaseer2775 2 года назад ⁺⁹
Awesome method of teaching. Learned a lot from you. Very insightful content. Keep up this good work.
@hussainktk8411 Год назад ⁺¹
Thumbs up for you sir. All other videos are till the model training. You have showed the full working. Thanks !
@robinabraham1801 2 года назад ⁺²
You are best person I have ever come across. With great feature engineering got to learn many things for model deployment too.
@mcchickencrispy 2 года назад ⁺¹⁹
Thank for the walkthrough! Just a small remark, while defining the word-cloud object you can set the collocations parameter to false and avoid duplicates in the final plot.
@campusx-official 2 года назад ⁺⁵
Thanks, will use it the next time
@Human12358 Год назад ⁺¹
@@campusx-official but how to extract data from mail to jupyter
@DashingData66666 Год назад
sir in which language u r making this project
@@campusx-official
@abhinabachakraborty5202 Год назад
@@Human12358 download the file in your system and use read_csv or read_excel and feed it to your dataframe
@TanishkaSharma-e8z 5 месяцев назад ⁺¹
I am facing an error in reading csv file can you help? UnicodeDecodeError: 'utf-8' codec can't decode bytes in position 606-607: invalid continuation byte
@LockLad Год назад
I haven't even started to watch your videos but i can feel it in my heart that I landed on the right place
@rashidsiddiqui4502 Год назад ⁺⁶
Thank u very much NItish sir.😊
Very helpful playlist, thank u helping us learn ML AND DL , NLP with such interest and enthusism.
@TechnicalDrMusic 10 месяцев назад ⁺¹
woow!, finally i made one project sir, thank you so so much really paid off for me.
@2076TanmoyAnik 3 месяца назад
what a tutorial... claen & refreshing... i appreciate your work ... respect💜
@Kaushik-f9h Месяц назад
Brother could you share me your githup project link I am facing this issue NotFittedError: This TfidfTransformer instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator
@chiradipbiswas7339 Год назад ⁺¹
Awesome teaching !!,not using just libraries for text conversion, helping to understand process under the hood, and building the functions and conditions straight from the ground logic. Very nice video
@SachinModi9 2 года назад ⁺¹
I'm loving it!! Dekh ke hi motivation mil gaya :)
@SurajYadav-xj1yf 2 года назад ⁺¹
Thank you for uploading this thorough end to end project... this really helped me..
@manujkumarjoshi9342 Год назад
The best part is you also cover app development and deployment.
@manujkumarjoshi9342 Год назад
Awesome method of teaching. Learned a lot from you. Very insightful content.
@atul7965 Год назад
Thanks Sir for such an Amazing tutorials on Machine Learning Projects .The concepts you taught was so much crystal clear .
@ahmadaliahmadali8670 2 года назад
gooood nice sir your teaching method is so best for biggners
@BhuvanaMalla 9 месяцев назад ⁺³
why didn't you show us that you dropped the column 'text' inorder to make work the corr() in EDA stage it wasted me around 30mins of my time in figuring it out
@adwaithpradeep5677 Год назад ⁺⁵
12:28 Build an email spam classifier website with data cleaning, EDA, feature engineering, model building, evaluation, and deployment.
24:56 EDA is performed to understand the distribution of spam and ham messages and analyze the number of characters, words, and sentences used in the SMS.
37:24 Data preprocessing steps for text data
49:52 The main problem was the error caused by a silly mistake in the code.
1:02:20 Performing Exploratory Data Analysis (EDA) on spam and ham messages
1:14:48 Naive Bayes with TF-IDF vectorization and max features of 3000 performs best
1:27:16 The best performing model for spam classification is Multinomial Naive Bayes with a precision of 99.1% and accuracy of 98.1%.
1:39:39 Built and deployed an email spam classifier using NLTK and Streamlit
Crafted by Merlin AI.
@TanishkaSharma-e8z 5 месяцев назад
I am facing an error in reading csv file can you help? UnicodeDecodeError: 'utf-8' codec can't decode bytes in position 606-607: invalid continuation byte
@ashishmalhotra2230 11 месяцев назад
It's difficult to match your depth of knowledge and teaching style. So happy to have found this channel. Keep up the great work!
@letstry2854 Год назад ⁺¹
Wow !! You're doing amazing job sir!! It helps us a lot. Keep uploading such end to end videos in days to come too!!
@neeraj.kumar.1 2 года назад ⁺²
Finally completed this project with a lot of learning...... thank you boss 🙌
@rinalzankar2812 2 года назад ⁺¹
I could not load dataset . can you help?
@neeraj.kumar.1 2 года назад
@@rinalzankar2812 hey Rinal,
Did you try to download the dataset from kaggle?
@akuljoshi7943 2 года назад
@@neeraj.kumar.1 did you deploy your website on Heroku????
@neeraj.kumar.1 2 года назад
@@akuljoshi7943
Yeah
@abhishekshukla5747 Год назад
@@neeraj.kumar.1 can u plz send link of your website
@abinashnath2481 2 года назад ⁺¹
Thank you for the awesome content.your method of teaching is really incredible
@Abhinav00788 5 месяцев назад
Craziest EDA ever seen❤❤
@ashwinidikonda2146 2 года назад ⁺²
Great job sir👏👌👌👌 amazing explanation
@poojithareddy8777 Год назад
Is he running every cell to go to next cell
@poojithareddy8777 Год назад
How he is going to next cell
@meenalsinghthakur8705 5 месяцев назад ⁺⁵
NotFittedError: This MultinomialNB instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator. i am getting this error. How to resolve it?
@mallikagupta7643 5 месяцев назад
@@meenalthakur1440 hey i am getting the same error but for tfid ! please help !
@Kaushik-f9h Месяц назад
Same I am also facing this issue
@Kaushik-f9h Месяц назад
Are you able to solve this error ??
@DakshBhardwaj-j4n 12 дней назад
You have to first create object of multinomial nb class like
base_model =MultinomialNb()
Then you can pass this mode in other multiclass classifier like
model = OneVsOneClassifier(base_model)
@pankajbeldar9799 2 года назад ⁺¹
U deserve 100 billion subscribers
@karachilawyer 2 года назад
You Are The Best On RUclips ❤️❤️❤️
@hridhanpatel3987 4 месяца назад
sir at 35:18 why did you copied y to text and not simply used y itself ?
@sonamatulya 7 месяцев назад
Fabulous work Guru ji🙏
@KnockEdu 3 года назад ⁺³
Brother.., i cannot count the no of characters in each msg.. Because the error comes and it shows "object lf type int has no len()".. Do h have any solution to this??
@krishnakanthmacherla4431 2 года назад ⁺⁴
hi Nitish,
why are we fitting the data to countvectorizer even before splitting the data , doesnt this lead to data leakage problem there by effecting our metric ?
please answer this
@shu03bh 4 месяца назад ⁺¹
At 53:30 it's showing the same bar graphs for both ham and spam corpus
@skywalker9390 8 дней назад
make sure in snsplot target == 0
@SupratimMukherjee-st1pi 7 дней назад
Thank you Nitish sir.
@aryannijhawan8448 2 года назад ⁺²
18:25 can I use "df['num_words']=df['text'].apply(lambda x: len(x.split()))" instead , I mean what is the difference between split() and word_tokeniser?
@yashgourav 2 года назад ⁺¹
split() has bdefault delimiter as space. May be there are some words seperated usiing a ","(comma) or some other characters. In those scenariors, word_tokenizer can be used..
for further details, you can check the underlying working of nltk in its source code
@sampannamishra1178 2 года назад ⁺²
Amazing Lecture. Learned a lot from your project explanation!
@jooyeonsimp 5 месяцев назад ⁺¹
Please suggest few basics to work with before doing this project . A roadmap or something to know
@HemaSanthoshiAkula 2 месяца назад ⁺²
The project was awesome and explanation was throughly enjoyed, but one doubt if i don't want to create website and need output in notebook itself what should i do. Appreciate your help
@campusx-official 2 месяца назад
You can directly call the prediction function in your notebook.
@Kaushik-f9h Месяц назад
Brother could you share. GitHub link of this project I will help me I am facing issue NotFittedError: This TfidfTransformer instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator
@harisumanth 3 года назад ⁺⁷
Hi, I followed that video and used tfidf with base (not max_features =3000), While prediciting for all sorts of inputs i am getting "Not Spam"..what might be the reason?
@akshaybhat3556 3 года назад
@@gokulakannanb4103 Have you deal with the data imbalance?
@srujanagundam2520 2 года назад
were you able to solve it? because even i am getting the same issue
@nikithajosstephen7754 2 месяца назад
Where u able to solve it
@sulthanashaik1980 3 месяца назад
very good explanation thank u so much
@himanshuyadav126 6 месяцев назад ⁺²
Sir, After running the ipynb file from your github, The model gets changed and it does not run. After entering the predict button It shows "NotFittedError: This MultinomialNB instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator." Help me fix this.
while cloning your code from github and running the streamlit, It shows "NotFittedError: This TfidfTransformer instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator."
@mallikagupta7643 5 месяцев назад
hey did it get resolved?
@LITHISHLOCHANCKPSGiTECH-qj1jw 2 месяца назад
brother i dont get it , but i'm facing the issue you have mentioned
can you say me the way to resolve this .
@ChandanKumar-ku7wf Месяц назад
yeah facing same error
@humerashaikh3618 3 года назад ⁺²
Thank you so much. Very good content sir. I ll definitely try this project.
@gokulakannanb4103 3 года назад
It worked?
@vijaykiran3404 3 года назад
@@gokulakannanb4103 Not for me, model is creating of size 1kb only. And thus it's giving error. something is missing from original code as well.
@kamathprajna 2 года назад
@@vijaykiran3404 omg
@anuradhabalasubramanian9845 Год назад ⁺¹
As usual , your explanation is magical.Keep up your great work Sir !Thank you so much
@MeeraDevi-mm4cr 3 года назад ⁺¹
Hi please create more videos like this but also keep in mind a more industry relevant projects. Please🙏
@vanarraja1940 3 года назад
loved the way you orate :)
@Kaushik-f9h Месяц назад ⁺¹
Facing this issue when i click on predict button please help : NotFittedError: This TfidfTransformer instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator
@Salsalbull Месяц назад
Same, you fix it? Pls help me
@patnalapravallika6770 Год назад ⁺⁴
I didn't get heat map getting error:could not convert string to float: 'Go until jurong point, crazy.. Available only in bugis n great world la e buffet... Cine there got amore wat...'
@Kalyan1143 9 месяцев назад ⁺²
Same to u I am facing that error.
Cloud you resolve this error in that code
Please share me resolve of code
Please reply as soon as possible😢😢
@StudentLinker 5 месяцев назад
Because sir is applying with num_char,num_word,num_sent and we are using with all 4 columns @@Kalyan1143
@Malayalam_learner 2 месяца назад
Same I did not understand heatmap
@anjanatoppo9728 Год назад
Surprisingly this really worked out 💯
@divyamishra4040 Год назад
Thank you so much sir, finally i completed it...
@sohamjiddewar4059 5 месяцев назад ⁺²
Hi, CampusX after running import seaborn as sns it is running fine but when running histplot it is plotting graph but showing Future Warning use_inf_as_na option please tell how to remove it as when running heatmap it is giving error "could not convert string to float: 'Go until jurong point, crazy.. Available only in bugis n great world la e buffet... Cine there got amore wat...' "
@SpaceShip933 5 месяцев назад ⁺¹
Same problem
@ritilranjan7369 3 месяца назад ⁺¹
Exactly bro, no one in the comments is talking about it. The fact is that, even the code which he himself has uploaded on github suffers from the same issue. It is really a magic how he escaped it
@titan_471 2 месяца назад
@@ritilranjan7369 can you give the timestamp?
@palakpopat2069 18 дней назад
numeric_df = df.select_dtypes(include=['number'])
sns.heatmap(numeric_df.corr(),annot=True)
Just add this line after generating pairplot
It is giving error because there is text present in the df, we can plot map using only numbers
@nigiledwin4784 3 года назад
Awesome teaching👏👏
@SpaceShip933 5 месяцев назад ⁺¹
Instead of heroku we can use render but what should be procfile , setup sh , . git ignore and requirements.txt
@SandeepSharma-yi1jt 7 месяцев назад ⁺¹
When we are going to find the correlation it show could not convert string to float how can I overcome that
@ameybhuvad8461 3 месяца назад
sir how do i create a directory ? 4:09
@abdulqadar9580 2 года назад ⁺¹
Thank you Sir for your great efforts
@lisamathur9206 3 месяца назад
Hello, thank you for the videos. But I have a question. Do we really use ML like this in profession job?? Not using any particular model, like naive bayes or maybe simply using count vectorizer to get the BOW and it can remove the stop words as well. Not sure about stemming, maybe we can do it on the do like we are doing it now. So we don't use that all and take this do it all manually approach?
I am not doubting your approach but I thought that that's how we do it in professional projects. Just wanna know out of curiosity what's correct so that I can lean to that approach more. Thanks
@raj4624 3 года назад ⁺³
Top notch ..🔥🔥Quality of Project.. Tysm Professor
@byotikram4495 Год назад ⁺³
Hi, Sir..... The precision score that we used to calculate are: TP / (TP + FP). But if we calculate from confusion matrix the precision score that we gat is different from directly applying presicion_score function. Why it is so sir ??? For all the three NB cases(gnb, mnb and bnb)
@jooyeonsimp 5 месяцев назад
Hii can you help, df.corr() aint working for me, it says value error could not convert string to float
@harshitdhiman6748 2 года назад ⁺¹
thank you so much , it really helped a lot.
@antukhan5592 2 года назад ⁺²⁴
All those who are getting this error:
NotFittedError: This MultinomialNB instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator.
You need to put the line written below after voting.fit(X_train,y_train)...
mnb.fit(X_train,y_train)
Then re-run to get model.pkl and vectorizer.pkl
Or
u can run also gnb,mnb,bnb
thankyou
@rjabhi_31 Год назад
I am getting another error
Can you please advice me how to solve it.
NotFittedError: The TF-IDF vectorizer is not fitted
@swaragupta7932 Год назад ⁺¹
Thank you. I spend 2-3 hrs to solve this error but your code magically helped me run my model.
@antukhan5592 Год назад
@@swaragupta7932 cheers bro
@atharvatirkhunde4517 Год назад
thanks bro for solving the error
@antukhan5592 Год назад
@@atharvatirkhunde4517 ty
@moni-q4l 2 года назад
Thanks a lottt...
Explained very clearly.
@shahnawazalam2495 Год назад ⁺⁴
"NotFittedError: This MultinomialNB instance is not fitted yet. Call 'fit' with appropriate arguments before using this estimator."
this is the error i got when i enter predict button .can anyone resolved it ??
@KartikMongia 8 месяцев назад
did you get any ans
@ParthKakarwar-b7j 6 месяцев назад
I got the same error,Did you resolved it,please help....🤒
@abhishekbhardwaj6795 3 месяца назад
Same problem. ..any solution
@shahnawazalam2495 3 месяца назад
@@abhishekbhardwaj6795 naa bro
@Sreyanskumar-v2m 5 месяцев назад ⁺¹
Its showing the error of ‘list’ object has no attribute ‘transform’ while predicting the spam or not . Can anyone help?
@wellbell23 Год назад ⁺²
following the same process but after prediction it is only showing "not spam"
@mohaiminrahat4974 3 года назад ⁺¹
Thank you sir it was really helpful
@tech_lec 2 года назад
Thank You Sir,
It was a great video, help me a lot.
Very very Thanks ❤❤
@nikitapatil3019 5 дней назад
Sir apne konse algorithm use Kiya hai p
@muhammadwaqasakram3018 Год назад
Great 👍.. amazing video
@remoanil8532 Год назад
Thanks for the video, as per my understanding , using scaler before train_test_split will introduce data leakage , it is good to use scaling after train_test_split
@WaqasAhmad-0 2 года назад ⁺⁴
hi, thanks for the detailed video but I don't know why I am getting an application error whenever I try to open the Heroku link, even your working demo link is giving the same error.
@Optimus_Gaming07 2 года назад
Same issue...
@zeinabshahraki5529 2 года назад
Hi, I have the same issue while it seems it has deployed successfully. Could you solve the issue?
@nishitkashyap8748 11 месяцев назад
While vectorizing the text defining X it gives an error saying list object has no attribute 'lower'.
@Kalyan1143 9 месяцев назад ⁺³
I did not get heatmap getting error
ValueError: could not convert string to float: 'Go until jurong point, crazy.. Available only in bugis n great world la e buffet... Cine there got amore wat...'
Please any one reply to resolve this error. 😊😊
@CodewithAbhi03 5 месяцев назад ⁺²
Have you got the solution on how to resolve this issue
@Kalyan1143 5 месяцев назад ⁺²
@@CodewithAbhi03 no
@mixupthings 2 года назад
You are best SIr !! respect ++
@ViniPandla 3 месяца назад
Sir I'm facing error during heatmap time for data type string is not support in target column but when I'm using it with include int 64 it is working in that time target column is missing because string is not include
@siyasuryawanshi495 Год назад
This works so well ! Thankyouuuuu
@neelesh621 Год назад ⁺¹
One doubt: During voting classifier and stack classfier , the Xtrain, ytrain,Xtest, ytest which you are using, are they the X train or modified prameters like max_feature=3000 or normal without anything modification
@badalrathod2435 2 года назад
great! teaching helps to understand
@AdityaSharma-vs5dl Год назад
Sir you are the best please keep it up and can you make video about internships for ml
@Riya-zb1iz Год назад ⁺⁴
Hi Sir,
Firstly, Thanks for this amazing content!
Secondly, I had a question that why did we use LabelEncoder here, shouldn't we use One Hot Encoder here?
@ok0855 Год назад ⁺¹
as the name suggest this is for labeling the data( here spam and ham are two labels not sentences ). but one hot encoding is used to convert the entire sentence
@muhammadtayyabtahirqureshi7186 Год назад ⁺¹
Label encoder is used to encode output labels. while one hot encoding is to encode the input features. That's the reason
@aiwithvanshikaa Год назад ⁺²
UnicodeDecodeError: 'utf-8' codec can't decode bytes in position 606-607: invalid continuation byte
Sir, I am getting the above error while reading csv file. Please help on this sir. I downloaded the dataset from the kaggle link given by you. but I am getting this error.
Kindly provide the solution.
@nikushekhawat5712 Год назад
change column name 'v1' and 'v2' in csv file....
@kartikshrivastava582 Год назад ⁺²
df = pd.read_csv('spam.csv', encoding = "ISO-8859-1")
@raazrobotics Год назад ⁺¹
pd.read_csv("spam.csv", encoding =('ISO-8859-1'), low_memory =False)
@aliyagilani3468 5 месяцев назад
appreciated (y) very well done
@utkarshchalsey241 2 года назад ⁺¹
I did the same as you did and I have double checked also but still I am getting this error:
"y should be a 1d array, got an array of shape (1034, 6708) instead."
@easyscience2893 8 месяцев назад
hi i have a question .why heroku is an option for deployment? kindly disuss the more option for deployment and also tell hot to deliver the project ?in which form we will deliver..in the end great content sir much appreciated
@Kiran23456 Год назад
Thank you so much brother 😊🙏
@rafaykhawjikzai1243 2 дня назад
The cut scene code is compulsory or not ?I am confused
@musafir8638 16 дней назад
Man I made this whole project but when i tried inputting my email's spam messages and sms messages which are spam it shows as Not Spam. I don't know if it lacks accuracy or what is the issue but I tried a lot of message mostly for all of them it shows as Not Spam.
@chaithrashreecs35 6 месяцев назад
Machine learning is a vast topic what is the actual technology of machine learning is used for classifying
@dafnexxl 6 месяцев назад
instead of entering the emails manually, if I connect to my email and have it read my own emails to compare with the dataset, how can I do this? I urgently need help.
@prashanthdeva2770 3 года назад
Thank you for sharing this👍👍
@HustleWithShubham01 10 месяцев назад
please bring a crash course on machine learning sir it really needed
@pranjaldangwal1425 6 месяцев назад
I've already defined df, as defined in this video but it is giving me Name Error : df is not defined sometimes it is running but mostly it is giving me the error.
@uditpandey2573 2 года назад
In streamlit when you're taking user message, how is it getting converted to vectors. That is not a document of messages. When I tried it out on a single string, i got an error saying "iterable over raw documents expected, string object received".
@alphaadil4028 3 года назад ⁺¹
Sir, I have followed all the steps but I got an error:
RuntimeWarning: 'nltk.downloader' found in sys.modules after import of package 'nltk', but prior to execution of 'nltk.downloader
plz guide me... I have tried both flask and streamlit same error occurs
@areeshaanjum3396 2 года назад ⁺¹
were you able to solve the error coz I am getting the same while deploying on heroku....And app is showing that filenotfounderror for vectorizer and model files
@alphaadil4028 2 года назад
@@areeshaanjum3396 No, I was unable to debug it.
@surajJoshiFilms 2 года назад
If you guys found the solution please share with me...

Следующие

Автовоспроизведение

Movie Recommender System Project | Content Based Recommender System with Heroku Deployment