Bag of Words - Feature Extraction in Natural Language Processing (BoW in NLP)

Поделиться
HTML-код
  • Опубликовано: 29 сен 2024

Комментарии • 9

  • @Socratica
    @Socratica  8 месяцев назад +3

    We worked these examples using the Wolfram Language. Socratica offers a pro course, 'Mathematica Essentials,' providing key concepts for mastering Wolfram products:
    www.socratica.com/courses/mathematica-essentials

  • @kirbymarchbarcena
    @kirbymarchbarcena 8 месяцев назад +7

    I didn't expect The Foundation, The Adventure of Sherlock Holmes, and War and Peace to be in this video as examples.

  • @jagadishgospat2548
    @jagadishgospat2548 8 месяцев назад +3

    Good one team, it's about time we learn about algorithms before they take over.

  • @jim4859
    @jim4859 8 месяцев назад +4

    I think this is really interesting.

  • @Her_Lovely_Tentacles
    @Her_Lovely_Tentacles 8 месяцев назад +1

    "because cats are not vegan they should eat meat"
    vs
    "because cats are vegan they should not eat meat"
    Bag of Words: "It's the same sentence 🤷"
    In seriousness: is there a way around situations like this, for example by binding the "not" more tightly, or is this simply out of scope for this approach, and the only relevant features are cats and whether or not they are vegan, but with no conclusion if they actually are vegan?

  • @DasIllu
    @DasIllu 8 месяцев назад +1

    I just wrote a small tokenizer to fit my needs, now i feel like i have to expand it massively.
    Thanks for the video.

  • @chlupatazarovka8201
    @chlupatazarovka8201 8 месяцев назад

    What about lemmatization? It isn't used?

  • @samson6707
    @samson6707 8 месяцев назад

    WordCount[text]. Where you taking these functions from?

    • @Socratica
      @Socratica  8 месяцев назад +4

      This is a built-in function in the Wolfram Language.
      WordCount["string"] gives the total number of words in string.