Introduction to NLP | Bag of Words Model

Поделиться
HTML-код
  • Опубликовано: 4 ноя 2024

Комментарии • 61

  • @অচিনমানুষ-জ৪খ

    I am Bengali as well. I am glad that you are doing this and feeling proud of you brother.

  • @balipavankalyan5008
    @balipavankalyan5008 2 года назад +2

    Rocked in very less time. This can only come with very keen knowledge.

  • @fahdciwan8709
    @fahdciwan8709 4 года назад +9

    excellent vids brother ... keep going. You've compressed months & months of learning into a easy-to-learn videos ... please don't stop

    • @NormalizedNerd
      @NormalizedNerd  4 года назад +1

      Thanks a lot for appreciating the effort :D

  • @dimaelzein9908
    @dimaelzein9908 3 года назад +1

    I like the video. Simple and straight to the point. Keep it up!

  • @shivanineeli5392
    @shivanineeli5392 3 года назад +1

    My 4 months struggle to learn nlp got done in 23 min. Thanks a ton bro!

  • @rajaroy4731
    @rajaroy4731 3 года назад

    thanks a lot man, after two days of such boring videos on youtube finally found a good one to learn NLP, great work man keep going on machine learning and other data science topic, it's really rare to find such a great video.

  • @my_day_my_day
    @my_day_my_day Год назад

    Amazing. W8ing for more video😊

  • @sumitchhabra2419
    @sumitchhabra2419 4 года назад +3

    I am at present studying the Machine Learning A-Z course from Udemy. Trust me, the tutor has no Idea how to explain stuff and it is the best selling course on Udemy for Data Science.
    I scratched my head while I was going through his tutorial, then I came here and all my doubts were cleared. Coding in python is not a difficult task but understanding the concept is the most important thing. And I got that understanding of Bag of words Model from your tutorial.
    Thanks a lot for your help.
    P.S. One who understands the concept and has strong Fundamentals has the ability to explain stuff in the simplest manner possible. Please keep it Simple like this in upcoming tutorials also. ALREADY SUBSCRIBED.

    • @NormalizedNerd
      @NormalizedNerd  4 года назад +2

      Thanks a lot @Sumit Chhabra. I'll try my best to maintain this level of simplicity :)

  • @dien2971
    @dien2971 2 года назад

    Thank you for your best explanation!

  • @akalankagamage8112
    @akalankagamage8112 4 года назад +1

    Really Amazing and very clear . Keep this up . New subscriber .

  • @vindyaabeygunawardena6222
    @vindyaabeygunawardena6222 2 года назад

    excellent. thanks Normalised Nerd

  • @thegamooo7310
    @thegamooo7310 3 года назад

    Very nice explanation also covered much things in very less time. Keep it up Man👍

  • @soumadeepacharya9513
    @soumadeepacharya9513 Год назад

    Thanks brother❤

  • @christinavalavani1561
    @christinavalavani1561 3 года назад

    Please make a video or two about neural machine translation. With an example.

  • @sumitchhabra2419
    @sumitchhabra2419 4 года назад +1

    What does CountVectorizer do in this model ?
    Does it convert the words in the instances/Document into 0/1 please suggest....
    Thanks for your help in advance.

    • @NormalizedNerd
      @NormalizedNerd  4 года назад

      It'll generate the feature matrix that I started drawing at 7:06

  • @shaminmohammed672
    @shaminmohammed672 3 года назад

    Thank you sir 🙏

  • @assiaben1220
    @assiaben1220 2 года назад

    Hello Normalize Nerd, i've got an error in y(target) y = data.as_matrix(['Review_class'],
    AttributeError: 'DataFrame' object has no attribute 'as_matrix',
    By the way, thanks 💚 the tutorial is very clear and well explained . 👏Bravo

  • @oludotunode
    @oludotunode 2 года назад

    Shouldn't you initialize the regular expression outside the loop?

  • @kanchangandhi1200
    @kanchangandhi1200 4 года назад

    Very informative video , thank you for uploading the NLP series , having one queries how can we use auto text summary generator in other human lanaguages text like(Japanese , chinese & Korean)????? your reply would be very helpfull.

    • @NormalizedNerd
      @NormalizedNerd  4 года назад

      Unfortunately there's no library for that can summarize every language. However, you'll find many github repos where people have built text summarizers for other languages using the same method!

  • @prajnapn5946
    @prajnapn5946 3 года назад

    Well explained !!!

  • @thetensordude
    @thetensordude 4 года назад

    chalie jao dada!!!

  • @elirhm5926
    @elirhm5926 3 года назад

    Thank you so much

  • @sumitchhabra2419
    @sumitchhabra2419 4 года назад +1

    One suggestion here. Please ZOOM-IN your screen while you are explaining the coding part. Press Ctrl and scroll up from your mouse, it will zoom in. It puts strain on our eyes and understanding the coding part becomes a punishment.

  • @roopeshroope5988
    @roopeshroope5988 3 года назад

    We first split the data and do Preprocessing right why you perform on whole dataset

    • @NormalizedNerd
      @NormalizedNerd  3 года назад

      Yes, ideally we should first split then preprocess. Here, the text preprocessing will remain the same for both train and test set so I did them together. However, I also formed the BOW model on the whole data; which is not the correct way. We should build it only on the training set then apply it on the test set. I did it just to make things a little easier.

  • @user-jz3wo1om2c
    @user-jz3wo1om2c 4 года назад

    you are great.

  • @dhusor9633
    @dhusor9633 3 года назад +1

    plz a make tutorial how nlp is work on Bangla text datasets.

  • @elirhm5926
    @elirhm5926 3 года назад

    how can we go back to the original sentence from X_test? I mean how can I see what sentences the algorithm doesn't classify correctly?

    • @lpereira95
      @lpereira95 3 года назад

      Compare y_pred and y_test. The indices where they don't match are the mistaken samples. Then use that indices to access the sentences in X_test. I hope it helps.

  • @user-or7ji5hv8y
    @user-or7ji5hv8y 3 года назад

    Is X_train the bag of words?

  • @apoorvshrivastava3544
    @apoorvshrivastava3544 4 года назад

    Great, New subscriber

  • @doyourealise
    @doyourealise 4 года назад

    amazing keep posting videos :)

  • @rakeshmanathana
    @rakeshmanathana 4 года назад

    Is it 'Lov' or 'Love' the root?

    • @NormalizedNerd
      @NormalizedNerd  4 года назад

      Good question. The thing is...stemming should give us 'lov'. But, Porter stemmer gives us 'love'. I guess the reason lies in the details of Porter stemmer's implementation.

  • @apoorvshrivastava3544
    @apoorvshrivastava3544 4 года назад

    why we use deimitor as '/t'

    • @NormalizedNerd
      @NormalizedNerd  4 года назад

      In the .txt file, the values are separated by tab('/t') just like the values are separated by a comma in a .csv file. In pandas we have the function read_csv(reads .csv files by default). We need to pass the parameter to read tab-separated files.

  • @thamizhansudip6644
    @thamizhansudip6644 4 года назад

    powerpoint could have been used for nice presentation

  • @10xGarden
    @10xGarden 4 года назад

    I would have loved it if it was good. 😀😅Is this positive?

  • @user-or7ji5hv8y
    @user-or7ji5hv8y 3 года назад

    How can we access the text files used?

    • @NormalizedNerd
      @NormalizedNerd  3 года назад

      I've provide the link in the video description

  • @mdarifulislam2736
    @mdarifulislam2736 4 года назад

    ভাই আমিও বাঙালি.! ❤❤❤

    • @NormalizedNerd
      @NormalizedNerd  4 года назад +2

      বেশ ভালো লাগলো আপনার কমেন্টটি পেয়ে। চ্যানেলটিকে আপনার পরিচিতদের মধ্যে শেয়ার করার অনুরোধ রইল। ❤️