NODES 2023 - Using LLMs to Convert Unstructured Data to Knowledge Graphs

Поделиться
HTML-код
  • Опубликовано: 19 ноя 2024

Комментарии • 20

  • @joshuacunningham7912
    @joshuacunningham7912 9 месяцев назад +4

    Thank you very much for this helpful and inspiring presentation!

  • @tacticalforesightconsultin453
    @tacticalforesightconsultin453 7 месяцев назад +3

    I've done and presented a project like this with more transparency over 5 years ago, and completed it within a few weeks time. The only concern there was, was with polysemy (word with multiple parts-of-speech).
    It really helped to condense the information down and easily see implications across the documents.

  • @capri300
    @capri300 8 месяцев назад +2

    Nice talk. Concise and providing the just the right amount of information. massive thank you for using animations in your slides it helped tremendously with your flow. Trying the github repo as we speak.

    • @neo4j
      @neo4j  8 месяцев назад

      Thank you

  • @ahmed_hefnawy1811
    @ahmed_hefnawy1811 7 месяцев назад +4

    chunking is one of the most steps to build a stable RAG flow, KG will change the RAG Game

  • @kennethnielsen3453
    @kennethnielsen3453 9 месяцев назад +7

    Surprised you didn't use the Matrix movies instead :D

  • @Manu-m8w6m
    @Manu-m8w6m 9 месяцев назад +4

    Quick question let say we are working with maybe 100s of files to create graph, would'nt it be too costly to use llm?

    • @MrRubix94
      @MrRubix94 9 месяцев назад +1

      That's the real question

    • @Manu-m8w6m
      @Manu-m8w6m 9 месяцев назад

      @@MrRubix94 Any idea on how we can solve it?

    • @MrRubix94
      @MrRubix94 8 месяцев назад

      No idea. I have yet to dive into the subject myself.@@Manu-m8w6m

    • @MrGara1994
      @MrGara1994 8 месяцев назад +1

      I think what you do there is pre index the vector database, and before sending the request, you preload the top n chunks. And most likely optimizing the knowledge graph by limiting the amount of tokens per chunk to the most optimal number for different tasks.

    • @Manu-m8w6m
      @Manu-m8w6m 8 месяцев назад

      @@MrGara1994 if that the case this might be a dumb question 😅, but if we are using vector to get top n chunks then is there any different with doing kg or normal vector search?

  • @divyaburri-z5j
    @divyaburri-z5j 5 месяцев назад

    can you provide information regarding seed from URI for azure storage seed provider

  • @BasuSaptarshi
    @BasuSaptarshi 3 месяца назад

    Does anyone develop application for production in this way? What about ontology?