Rasa Algorithm Whiteboard - BytePair Embeddings

Поделиться
HTML-код
  • Опубликовано: 27 дек 2024

Комментарии • 14

  • @faangsde
    @faangsde 4 года назад +2

    This channel is a gold mine! Thank you very much for sharing your insights!

  • @distrologic2925
    @distrologic2925 Год назад

    How does the algorithm know when to stop merging tokens?

  • @87456100
    @87456100 4 года назад +1

    Great video! I think the video description lacks a word in the sentence "They need way memory..."?

    • @distrologic2925
      @distrologic2925 Год назад

      they were masking that word to test your listening comprehension

  • @piyalikarmakar5979
    @piyalikarmakar5979 3 года назад

    Thanks sir.. one query.. what's the difference between byte pair and wordpiece tokenization?

    • @RasaHQ
      @RasaHQ  3 года назад +3

      (Vincent here)
      Great question! My impression is that they are very similar in practice but that the way for merging letters is slightly different. I could be wrong but I think workpiece uses a likelihood heuristic while bytepair uses counts.

  • @alanliang9538
    @alanliang9538 Год назад

    thanks bro, best explaination i can find.

  • @wibulord926
    @wibulord926 2 года назад

    thanks for usefull tutorial

  • @shahzadmalik96
    @shahzadmalik96 4 года назад

    First comment

    • @gorgolyt
      @gorgolyt 3 года назад +4

      congratulations, you win nothing.