Leann Chen
Leann Chen
  • Видео 5
  • Просмотров 51 788
You Need Better Knowledge Graphs for Your Graph RAG
RAG (Retrieval-Augmented Generation) has become the hype of Generative AI applications, so are knowledge graphs. You see lots of graph-based LLM apps out there and you're probably building one too. However, how you construct knowledge graphs determines the quality of your LLM-based application. Solely relying on GPT-4 for extracting entities and relationships without thorough evaluation will give you the garbage-in-garbage-out effect.
To get prepared for Data Day Texas 2024, I built a Graph RAG AI assistant using Diffbot API for both web scraping and knowledge graph construction. You'll see how I built it while monitoring the results throughout the video. Diffbot offers transparency in the...
Просмотров: 35 718

Видео

Build an Advanced RAG Chatbot with Neo4j Knowledge Graph
Просмотров 14 тыс.10 месяцев назад
Advanced RAG (Retrieval-Augmented Generation) and knowledge graphs make AI chatbots more powerful and context-aware. Your chatbot can digest more data sources than just one document. We feed the chatbot with different text data regarding the event of Sam Altman's surprising exit and return to OpenAI. This video walks you through how to build the system with LLM tools. 0:00 Intro 0:42 Load wiki ...
Vector Search (RAG) Decodes Inside Out
Просмотров 83411 месяцев назад
🎬 Inside Out Data Adventure! In this video, we take a fun look at Pixar's 'Inside Out' and see what data can tell us about the movie's 5 emotions: 😄😢😠😱😒. We're using cool AI to understand the characters better and build a simple app to show you the story in a whole new way. What's Inside: - Cool ways to turn the 'Inside Out' script into pictures and graphs. - A chatbot that helps us dig into th...
Meet the Inside Out AI Chatbot! 🤖🎬
Просмотров 53011 месяцев назад
Curious about the emotions from the movie Inside Out? I've built an AI chatbot to explore them! This video is a quick peek at how we decode the movie's feelings with AI technologies. Check out the app: inside-out-character-explorer-wdyr3tx8maxqwgjrh2bnz3.streamlit.app What to Expect: A sneak peek at using AI to chat about 'Inside Out's characters. A teaser of our data science journey with Joy, ...

Комментарии

  • @sanjaybhatikar
    @sanjaybhatikar 28 дней назад

    Simply using OpenAI tools is not interesting

  • @auro284
    @auro284 Месяц назад

    very insighful, would love to see more such videos.

  • @rockstar-lt8rg
    @rockstar-lt8rg Месяц назад

    How to return neo4j subgraph image when stremlit's response?

  • @Shaan_Suri
    @Shaan_Suri 2 месяца назад

    Hi, since you mentioned pricing, especially how expensive GPT-4 is, is using the Diffbot API free? Thanks.

  • @nschul4
    @nschul4 2 месяца назад

    I've just completed some experiments using Microsoft's GraphRAG and your description describes my results exactly, "garbage-in-garbage-out". Without a consistently solid knowledge graph there's not much an LLM can have knowledge of. Thanks for sharing your project. I'll take a look at that.

  • @AIGOAT-z4m
    @AIGOAT-z4m 2 месяца назад

    I feel like a Noob in the field of AI, after seeing your video. But the video is great

  • @rephechaun
    @rephechaun 3 месяца назад

    Hello, thank you very much for posting the video, I am very interested in the part where you also show the graoh with in the chatnot, what python packahe is that please?(,y apologies if the question is redundant, I couldn't find it in other comments)

  • @zaubermanninc4390
    @zaubermanninc4390 3 месяца назад

    Subbed for Mr. Beast counting 💗

  • @riyashah1161
    @riyashah1161 3 месяца назад

    you're so pretty

  • @mohammedshuaibiqbal5469
    @mohammedshuaibiqbal5469 4 месяца назад

    Can you make a video on how to generate knowledge graphs for pdf books like DSM 5

  • @mandraketupi5
    @mandraketupi5 4 месяца назад

    Hi Leann Thank you! Your videos are an amazing source of inspiration! keep up the good work!

  • @enkhbatenkhjargal9447
    @enkhbatenkhjargal9447 4 месяца назад

    Thanks for this video, subscribed! :)

  • @senthilkumarpalanisamy365
    @senthilkumarpalanisamy365 4 месяца назад

    Excellent video, clear explanation, please do post more in the gen ai and knowledge graph space

  • @markquinsland8385
    @markquinsland8385 5 месяцев назад

    go to the presentation by Amy Hodler (and tell her I said hello)

  • @AP-fu3bj
    @AP-fu3bj 5 месяцев назад

    Are you creating embeddings on top of the knowledge graph for RAG??

  • @AP-fu3bj
    @AP-fu3bj 5 месяцев назад

    Can you please share the code for the application you built to visualize the knowledge graph ?

  • @idk-kv9hg
    @idk-kv9hg 5 месяцев назад

    Hey Leann, first of all great explanation with some insights (specially on Diffbot). You got a new subscriber 👍 I'm going to work on a RAG based project which will use Neo4J as a Graph Database. I've went through other comments and your answers to them. But still wanted to know few things: 1. Here you took the example of speaker and what they have spoken (and their interest/expertise etc...) which is working fine. But what if I have some PDF docs of roughly 50-70 pages with some rules and regulations and want to use them as a custom knowledge base from my RAG project? Is knowledge graph database is good choice? why not simple Vector DB (such as milvus db)? 2. Assuming I must use Graph database, how do I efficiently chunk the PDFs and store into graph notes and relations? So that if users asks any query then user should get correct answer. 3. If the docs are related to rules and regulations, then what will be the nodes and relationships between them? Because here in your example, nodes were speaker, their expertise etc... I understand that you might not have perfect answer for all of these above but I'd like to have some point of view. Hope you find my comment and reply me once you get a time. Thanks for reading and your time.😊

    • @Abstract.x
      @Abstract.x 3 месяца назад

      Hey great questions, I'm also considering working on such. Did you figure out answers to your questions? they will help me better understand the usability of this as well

  • @linlinlau6785
    @linlinlau6785 5 месяцев назад

    英文發音清楚。速度剛好。 如果內容文字部分再放大些。會更清晰喔

  • @linlinlau6785
    @linlinlau6785 5 месяцев назад

    太棒了。 已訂閱追蹤

  • @cristhiamtovar9003
    @cristhiamtovar9003 5 месяцев назад

    Thanks for this video

  • @BenjaminKing1
    @BenjaminKing1 5 месяцев назад

    It's nice to see my old co-worker Michelle randomly popping up in a video. I hope you were able to meet her. She is great!

    • @lckgllm
      @lckgllm 5 месяцев назад

      I did meet her and was able to talk to her personally! It was definitely great. Joining her session presented with Amy Hodler will make you realize you don't want to miss another one! 😊

  • @stanTrX
    @stanTrX 5 месяцев назад

    I have tested for few critical document to get some answer using standard RAG and to be honest, didnt enjoy the performance so much.

    • @lckgllm
      @lckgllm 5 месяцев назад

      Thanks for sharing your experience! As you referring to standard RAG (purely vector-based) or graph-based? Purely vector-based RAG is not great, while graph-based RAG could return more reliable results. But I also have to be honest that prototypes are generally cute and are very far from production-ready. That's why we need a lot of testing/evaluations, and I'm currently gearing towards making videos containing production-oriented testing :) Here's a video that I did some testing: ruclips.net/video/mHREErgLmi0/видео.html

  • @vbywrde
    @vbywrde 6 месяцев назад

    Yes, to physically present yourself in multiple locations at the same time is quite challenging. My understanding is that it requires you to achieve presence on the fourth dimension. Once there, you can then enter multiple three-dimensional spaces at the same time. I wish I could do that, though I suspect it would be really disorienting at first! Best wishes! Also, I learn something new with every one of your videos! Thank you! I really like your approach!

    • @lckgllm
      @lckgllm 5 месяцев назад

      I'm surprised and also thrilled that finally someone takes my not-so-funny joke in the video seriously! 😂 Love you concise and scientific explanation of the multi-dimensional space, which makes me dream more about having that superpower. 😉 Thanks for the encouragement once again. I'm learning a lot from you guys too and have been enjoying the journey with you all!

    • @vbywrde
      @vbywrde 5 месяцев назад

      @@lckgllm Oh good. When you make it to the fourth dimension, please give a holler! :) It would be fun to see you in two places at the same time. Three even! XD. In the meantime, please keep us posted as to your coding progress. I find your videos really helpful. Thanks!

  • @darkhydrastar
    @darkhydrastar 6 месяцев назад

    You are an excellent presenter. Thank you. We do however need to find you a better background music. It's giving pharmaceutical commercial and the levels are a little too high over your voice. Still great though. You have excellent stage presence and a clear voice.

    • @lckgllm
      @lckgllm 5 месяцев назад

      Totally agree with you :) I have since upgraded to epidemic sounds for music and be more mindful that the music volume should not distract the viewer when I'm speaking. I'm trying to learn and become better after every video, so really appreciate seeing feedback like this for improvement!

  • @ramdeeproy6853
    @ramdeeproy6853 6 месяцев назад

    Can we implement the azure open ai creds like api key, model name, endpoint, type and version in the ipynb file and run it? Also please mention the dependency libraries of the functions.py file as visual, Node, Edge, Cypher_graph are not getting initialised in VSC while running the file....

  • @cemery50
    @cemery50 6 месяцев назад

    I'm interested in creating a Little Logical Model based upon the command structure of an application and then using agents take voice to text and text to cmd. maybe with a coresponding graph view updated with current information avaiable in another window on another display screen.

  • @MrBekimpilo
    @MrBekimpilo 6 месяцев назад

    This is very insightful Leann.. cheers from South Africa

    • @lckgllm
      @lckgllm 5 месяцев назад

      Thanks for the encouragement! 😊

  • @alexandreturlier5464
    @alexandreturlier5464 6 месяцев назад

    Great content! I am a complete beginner: I have a Neo4j db already populated, I want to "only" do the chatbot portion connected with GPT4. Would you mind guiding me on which .py I should use in this usecase? In the meantime, I am getting a "UnboundLocalError: local variable 'nodes' referenced before assignment". Not sure what to do... Thanks!!

  • @krisograbek
    @krisograbek 7 месяцев назад

    This is so good, Leann! I'm just jumping into the field of Knowledge Graphs! This will be huge for RAG applications! Why did you stop publishing videos?

    • @lckgllm
      @lckgllm 7 месяцев назад

      Hi Kris! Thanks for the encouragement :) I didn't stop posting. Instead, I'm posting videos on another channel I work for (e.g. ruclips.net/user/shortsoJVRWGfqfjQ) I will try hard to post more videos about knowledge graphs and LLMs.

    • @krisograbek
      @krisograbek 7 месяцев назад

      @@lckgllm I didn't know about the other channels. Thanks for letting me know!

  • @kingmouli
    @kingmouli 7 месяцев назад

    Thank you for amazing short video, I am eagerly waiting for you to make a video on how to convert csv data into knowledge graphs and answers questions on the csv files

  • @kingmouli
    @kingmouli 7 месяцев назад

    thank you for the crisp view on KG+RAG, can we create KG on multiple csv files , currently csv agents were lacking behind to answer questions based on content, they only search for matching column for the question rather content passed.

    • @lckgllm
      @lckgllm 7 месяцев назад

      Love your idea! It’s totally possible to create KG based on multiple csv files, but can you say more about what “content” means in your case?

    • @kingmouli
      @kingmouli 5 месяцев назад

      @@lckgllm What I mean by content is csv data here

  • @souzajvp
    @souzajvp 7 месяцев назад

    Thanks for the awesome video! I was trying to reproduce your code but got an error because the "text_for_kg()" function was not defined. Any chance you can help me understand where this functions comes from? Great content and great editing! Thank you

    • @ajeeshsunny4592
      @ajeeshsunny4592 7 месяцев назад

      Same problem for me. Trying to implement text_for_kg.

    • @lckgllm
      @lckgllm 7 месяцев назад

      Hello! Sorry for the late reply, been busy with work. I just realized that text_for_kg() somehow was deleted from the notebook, but it should be the same thing as diffbot_nlp.nlp_request(). I just updated the notebook in the girhub repo. Let me know if it doesn't work. I'll do my best to fix it. Thanks for point this issue out! @souzajvp

  • @mohammedmahinuralam2796
    @mohammedmahinuralam2796 7 месяцев назад

    Great! Waiting for more of your videos!

  • @deniowork7084
    @deniowork7084 7 месяцев назад

    Love this!

  • @SunMai93
    @SunMai93 7 месяцев назад

    Diffbot sets a pretty high bar for entering this project, any thought/plan to utilise open source project instead? Thanks!

    • @lckgllm
      @lckgllm 7 месяцев назад

      Yes I have previously used spacy-llm in my last video:ruclips.net/video/mVNMrgexxoM/видео.html However, from the results generated by spacy-llm in my GitHub, you can see that there are still errors in the output, and I need to further pass the results to ChatGPT-4 for refining: github.com/leannchen86/openai-knowledge-graph-streamlit-app/blob/main/openaiKG.ipynb I hope future LLMs (regardless of closed source and open source) will enable us to see the confidence score for the output as I experienced with Diffbot's APIs.

    • @SunMai93
      @SunMai93 7 месяцев назад

      thank you@@lckgllm ! I will have a look @ the video and the notebook. Might come back for discussion again. have a good one!

  • @SunMai93
    @SunMai93 7 месяцев назад

    useful content, no word wasted!

  • @SunMai93
    @SunMai93 7 месяцев назад

    Very nice content! support support 🇹🇼

  • @AdityaSharma24091994
    @AdityaSharma24091994 7 месяцев назад

    Can RAGs become efficient enough to do data analysis over text tables and csvs? I'm planning to build one so wanted to know if this is possible.

    • @lckgllm
      @lckgllm 7 месяцев назад

      Yeah I think so! That's a great idea for a new video :)

    • @AdityaSharma24091994
      @AdityaSharma24091994 7 месяцев назад

      @@lckgllm yes. I would be glad to collaborate on such project.

  • @MehdiAllahyari
    @MehdiAllahyari 7 месяцев назад

    Great video! However, I would completely replace DiffBot with an open source solution. There are many NER models, SpanMarkerNER to name one, since most of the entities you showed in the video are Person, Location, and Org, which libraries like SpaCy and setFit are pretty good for them. Using LLM with few shot learning would be another option. Overall, very nice video.

    • @lckgllm
      @lckgllm 7 месяцев назад

      Thanks for the feedback! I have previously used spacy-llm in my last video:ruclips.net/video/mVNMrgexxoM/видео.html However, from the results generated by spacy-llm in my GitHub, you can see that there are still errors in the output even if examples are included in the prompts, and I needed to further the pass the results onto ChatGPT-4 for refinement: github.com/leannchen86/openai-knowledge-graph-streamlit-app/blob/main/openaiKG.ipynb I hope future LLMs (regardless of closed source and open source) will enable us to see the confidence score for the output as I experienced with Diffbot's APIs.

    • @MehdiAllahyari
      @MehdiAllahyari 7 месяцев назад

      @@lckgllm If you'd like to have confidence score using llms, a simple hack is, add that into the prompt, so llm returns the result with scores. :)

  • @pouet4608
    @pouet4608 7 месяцев назад

    It can be done by hand, but automatisation of this human feature is impressing. Good video!

  • @IdeationGeek
    @IdeationGeek 7 месяцев назад

    It's cool! Considering the current long context window of LLMs that pass the "Needle In A Haystack" (NIAH) test with flying colors, makes creating Neo4j graphs rather a human domain-learning and verified knowledge collection activity, useful for science and formal domain exploration, rather than ad-hoc knowledge explorations. One thing, Neo4j is not enough to represent complex hierarchical relationships that need hypergraphs. Check Dgraph (open source software).

    • @lckgllm
      @lckgllm 7 месяцев назад

      Just checked DGraph out! Didn't know about that before, thanks for sharing the info :)

  • @andydataguy
    @andydataguy 7 месяцев назад

    Great video!

  • @CK.23.
    @CK.23. 7 месяцев назад

    I just had to be the 1,000th.. Congrats..

    • @lckgllm
      @lckgllm 7 месяцев назад

      Thanks! 🥳

  • @goofballbiscuits3647
    @goofballbiscuits3647 7 месяцев назад

    Cool! New sub from me 😊

  • @BobaQueenPanda
    @BobaQueenPanda 7 месяцев назад

    Content was good but found face filters visually distracting.

    • @lckgllm
      @lckgllm 7 месяцев назад

      What face filters? I literally talked in front of my MacBook Pro 14. I did have makeup on which I admit.

    • @Armoredcody
      @Armoredcody 7 месяцев назад

      @@lckgllm you're fine do not worry about it. however, the audio sounds like it has a low bitrate.

    • @lckgllm
      @lckgllm 7 месяцев назад

      Definitely going get a mic for better voice quality. Thanks for the feedback, folks!@@Armoredcody

  • @vanderstraetenmarc
    @vanderstraetenmarc 7 месяцев назад

    I'm a newbie on these matters discussed here, but I really do appreciate the way your MyGraph RAG AI Assistant work, responding with text AND graph. Can you tell me a bit more on how you did accomplish this? (I'm especially interested in the graph that got generated!). Hope that's not a stupid question?

    • @lckgllm
      @lckgllm 7 месяцев назад

      Definitely a great question! I didn't include the process in this video and plan to make another video about this, but let me show you the details via email :)

    • @vanderstraetenmarc
      @vanderstraetenmarc 7 месяцев назад

      @lm Would be highly appreciated! 🙏 Didn't get it yet though...

    • @justindehorty
      @justindehorty 4 месяца назад

      ​@@lckgllm Hi Leann, I had the same question. Isn't this just an implementation of streamlit-agraph? Is there any reason why you left this out of the GitHub repo you shared? It would be incredibly helpful/instructive to see the implementation.

  • @manfyegoh
    @manfyegoh 7 месяцев назад

    nice sharing

  • @thomaskaminski2187
    @thomaskaminski2187 7 месяцев назад

    KG are key for providing context to RAG. Still, I see the OWL/RDFs path outperforming LPG as it enables the user to explicitly define semantics and infer knowledge

  • @Jeganbaskaran
    @Jeganbaskaran 7 месяцев назад

    Really awesome, Thank you for this video

  • @malipetek
    @malipetek 8 месяцев назад

    Thanks.