LLM-powered Topic Modeling

Поделиться
HTML-код
  • Опубликовано: 8 сен 2024
  • Content summary: Topic modeling involves extracting the most salient topics from large text corpora, such as collections of notes or articles. Traditional topic modeling methods, like Latent Dirichlet Allocation (LDA) and Structural Topic Modeling (STM), often fail to account for the context in which words appear. Advances in large language models (LLMs), capable of contextually understanding textual data, have led to the innovation of context-aware topic modeling techniques, such as BERTopic and TopClus. In this talk, we will discuss and demonstrate these advanced topic modeling techniques. The objectives include:
    1. Explaining what topic modeling is and how it can be applied in our research.
    2. Discussing how LLM-powered topic modeling techniques, like BERTopic and TopClus, differ from traditional methods, particularly in their use of pretrained LLMs to generate context-aware topics.
    3. Providing a live demonstration of these techniques.
    Presenter: Charles Alba
    Code and materials used in this video can be downloaded from GitHub:
    240127_BERTopic.zip; 240127_bertopic.pdf
    github.com/Dre...
    Hashtags: #topicmodeling #artificialintelligence #machinelearning #deeplearning #python #pythonprogramming #pythontutorial #aitutorial #coding #neuralnetworks #neuralnetwork #pytorch #computervision #nlp #naturallanguageprocessing #scikitlearn

Комментарии • 8

  • @54LZ
    @54LZ 4 месяца назад +1

    An interesting and great presentation. Thanks for sharing.

  • @ColabCorgi
    @ColabCorgi 4 месяца назад +1

    Excellent content. Just what I was looking for! Any tips for how to optimize topic modeling process using gpt models from OpenAI?

    • @giacomocassano1439
      @giacomocassano1439 4 месяца назад

      hello! I'm a researcher in Politecnico di Milano and University of South Australia, I'm trying to do the same thing, maybe we can have a chat!

    • @ColabCorgi
      @ColabCorgi 4 месяца назад

      @giacomocassano1439 sure, how can I reach you

  • @joshed790
    @joshed790 4 месяца назад

    Could you give an example of how to merge this topic modeling with our original dataset for further analysis and report creation

    • @rolandabi2848
      @rolandabi2848 Месяц назад

      Hey Josh, you found a way to do this?

  • @romanonugia8180
    @romanonugia8180 Месяц назад

    Topic -1 is an outlier and should be ignored