Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j

Поделиться
HTML-код
  • Опубликовано: 26 окт 2024

Комментарии • 109

  • @garic4
    @garic4 2 месяца назад +11

    This was EXACTLY what I've been looking for. Thank you for the great walkthrough, you just got yourself a new subscriber. Cheers

  • @eloiseidlitz7062
    @eloiseidlitz7062 2 месяца назад +2

    Smart video ! It could be a good way to access easily relationships between stakeholders in any kind of business.

  • @mgmanzanillo
    @mgmanzanillo 2 месяца назад +3

    Has any tried this code for a large amounts of documents?
    How does it scale?

  • @miguelprctba
    @miguelprctba 2 месяца назад +3

    Congratulations, great approach! I was wondering if you could consider using a different embedding method instead of OpenAI's, as it requires payment. What do you think?

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +2

      Ollama also offers an embedding model and to be honest, I think you should probably use it. You can use the same approach to use an env variable to use the ollama embeddings class like I did for the model

  • @mgmanzanillo
    @mgmanzanillo 2 месяца назад +2

    Could you also create sample code for instances where multiple documents are stored in a directory in either .txt or a compressed format like Parquet?
    How would it work then? Would you transform each document separately and combine them as a graph?
    Would like to get your insight.
    Thank you for the video!

  • @effectory
    @effectory 12 дней назад

    Great approach. As far as I understood, you still need to build a vector store for the queries, so it is not possible to get an answer only with Neo4J graphdatabase?

    • @codingcrashcourses8533
      @codingcrashcourses8533  12 дней назад

      You do! Its normal to use a hybrid approach use use results from neo4j and similarity search.

  • @charlesthompson8938
    @charlesthompson8938 15 дней назад

    Awesome video! The notebook is slightly different from the video. Could you please make some comments here to reflect the departure away from the video?

    • @codingcrashcourses8533
      @codingcrashcourses8533  15 дней назад

      I incorpoated issues people had with the Notebook which. Logically nothing should have changed

  • @Weotcs
    @Weotcs Месяц назад +2

    Hello, I had some trouble with :
    graph.add_graph_documents(
    graph_documents,
    baseEntityLabel=True,
    include_source=True
    )
    Apparently some labels and relationships were empty, which is a problem for neo4J, I handled it by making sur that at least every node and relationship had a label. Why do I have that issue you did not had ?

    • @codingcrashcourses8533
      @codingcrashcourses8533  Месяц назад

      @@Weotcs some people had this with llama. Did it also happen with gpt 4o Mini?

  • @yashbajpai231
    @yashbajpai231 18 дней назад

    Really informative and to the point tutorial! I do have 2 questions though.
    Q1. Did this approach store two sets of embeddings for the documents? One in the graph database and the other in a normal vector database?
    Q2. If I want to add more data to this graph, do I have to recompute the nodes every time or is there a way to just extend the original graphs with new nodes from the newly added content?
    Thanks!

    • @codingcrashcourses8533
      @codingcrashcourses8533  18 дней назад +1

      1. Yes, that approach is called hybrid approach
      2. Yes, at least as I know, there is no way around that

    • @yashbajpai231
      @yashbajpai231 17 дней назад

      @@codingcrashcourses8533 thanks 🙏

  • @i2c_jason
    @i2c_jason 2 месяца назад +2

    Are the graph visualizations mostly used for development, to double check what the system is 'thinking' and look for valuable clustering?

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад

      The visualization will probably used for development, the LLM won´t use it (a vision model won´t be able to deal with it, if you think of that)

    • @i2c_jason
      @i2c_jason 2 месяца назад +1

      @@codingcrashcourses8533 I may have missed it from the video, but is Neo4j open source / free to use always? I looked at their site and was confused. Or are you using free-tier for the demo? Would love to use this for development. Thanks!

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +2

      @@i2c_jason It´s free to use with Docker, you can also download a Desktop version. If you want to use it for your enterprise app, it might be different

    • @alishaw9999
      @alishaw9999 2 месяца назад

      @@codingcrashcourses8533 can you demonstrate it with desktop version of neo4j

  • @kshitijdesai2402
    @kshitijdesai2402 8 дней назад

    Very nice!

  • @Mohidthevloger
    @Mohidthevloger Месяц назад

    Great video! One thing I would be interested to know that were you be able to compare the results between the two approaches Graph or Vector search? Which one is retrieving the most relevant results?

    • @codingcrashcourses8533
      @codingcrashcourses8533  Месяц назад

      It depends on the question :). You probably want to use a Hybrid approach

  • @Yes-lm9dq
    @Yes-lm9dq 22 дня назад

    I tried following the tutorial and encountered some issues (I suspect the issue stems from the openai fallback?). Anyways, I have created a pull request on your github. Hope that helps anyone who wants to only use local models and encountered a similar problem.

  • @bhaibhai-qe8tt
    @bhaibhai-qe8tt 3 дня назад

    I am having this error "no validator found for , see `arbitrary_types_allowed` in Config"

  • @sandeepgajula7635
    @sandeepgajula7635 Месяц назад

    I have created a Neo4j graph but querying is getting difficult. So, Here the retrieval function varies for different inputs(like this retrieval function will work for this example only if I change the input it mayn't give me the proper results). How to overcome this problem. Can you give us a solution for querying the neo4j graph with different llms (like gemini,openai,ollma etc..)
    Thank you hoping for proper response.

  • @yashbajpai231
    @yashbajpai231 16 дней назад

    When i use the convert to graph document function, for some reason it's not creating any nodes. Am i missing a step? From my understanding it is the LLM which will decide the nodes and the relationship between them right?

  • @Transforming-AI
    @Transforming-AI 2 месяца назад +1

    You mentioned you received a key error at one point. I am getting that key error as well. Please share how you worked around the key error. Following your notebook, my key error is, KeyError: 'head'.

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +1

      @@Transforming-AI had the same issue a few times. I think the 8k Model produced this, bit not always. Or did you try the 70b or 405b Model?

    • @Transforming-AI
      @Transforming-AI 2 месяца назад

      @@codingcrashcourses8533 I have the 8b installed, will try the 70b. Thanks.

    • @saadsafi6073
      @saadsafi6073 2 месяца назад

      @@codingcrashcourses8533 had the same, seems to be caused by the double quotes in dummytext.txt, if you replace them (say, with single quotes) then the json will not be broken. key error was to do with missing 'head' which is one of the node keys.

    • @mingjenchen2975
      @mingjenchen2975 2 месяца назад

      @@codingcrashcourses8533 I tried it with llama3.1:70b and it took 13 minutes on my Mac Studio.

    • @carlospareja7951
      @carlospareja7951 2 месяца назад

      @@codingcrashcourses8533 I'm also running into this issue when running with the 8B model.

  • @sylap
    @sylap 2 месяца назад

    Thank you for this video! 👏 I've been eager to see someone cover this topic. I have a couple of questions I hope you can help me with.
    In this approach, is there a way to rerank the retrieved documents?
    Did I understand correctly that you first query the graph database, and based on the retrieved entities and relationships, it then retrieves pieces of documents from the vector store?

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +2

      Regarding reranking: I don´t know to be honest. I could not think of any metric to do that.
      Regarding retrieval: No! I used a hybrid approach there, but they are independent approaches. The LLM will use docs from both approaches.

    • @sylap
      @sylap 2 месяца назад

      @@codingcrashcourses8533 Honestly, not sure if reranking is even needed for Graph based RAGs, as they are very accurate. 😄
      Thanks! 👊

  • @QuocNguyen-se7vi
    @QuocNguyen-se7vi 10 дней назад

    please help me, i start to run the second comand line (from langchain_core.runnables import RunnablePassthrough......) and the result is false, no .evn file apppears. Please help me to fix thiss. Thank you.

    • @codingcrashcourses8533
      @codingcrashcourses8533  10 дней назад

      @@QuocNguyen-se7vi you have to create it yourself first ;).

    • @QuocNguyen-se7vi
      @QuocNguyen-se7vi 9 дней назад

      @@codingcrashcourses8533 thank you, I see LightRAG has a better constructive knowledge graph than GraphRAG, would you take a look at it!

  • @maxlgemeinderat9202
    @maxlgemeinderat9202 2 месяца назад

    Interesting! Is Neo4j language dependent? So is it capable of finding relations of multilingual data or does Neo4j not care about the language and only the LLM to create the knowledge graph should be multilingual?

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +2

      the LLM is responsible for creating the entites and documents, you will just save them in the required format in the database. Neo4J has got many libraries with different languages. The format of the documents matter.

  • @moellerseo
    @moellerseo 2 месяца назад

    Nice video. 🎉 Thanks

  • @HarshBhatnagar-i1v
    @HarshBhatnagar-i1v 19 дней назад

    docker compose is not working
    any suggestions ?

  • @tuba-inxs
    @tuba-inxs 2 месяца назад

    Is this really an RDF Knowledge Grap, or just a Neo4J property graph. That doesn't look like a SPARQL query in your code.

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад

      Hm I am not a Graph Expert to be honest. I had go through some tutorials myself to learn how to construct a query first, since I mainly worked with the RAG part so far. How would it look like in your opinion?

  • @hitlab
    @hitlab Месяц назад

    does anyone know if there is a graphics package for python that works the way this one does but not for Jupyter notebook?
    Looking to build an app an want to be able to show the same kind of graph viz in a web browser.

    • @codingcrashcourses8533
      @codingcrashcourses8533  Месяц назад

      I can do a Video on that if you want to

    • @hitlab
      @hitlab Месяц назад

      @@codingcrashcourses8533 that would be great!

  • @geniuscva
    @geniuscva 19 дней назад

    I am encountering an error when running the command
    print(graph_retriever("Who is Nonna Lucia?"))
    Generated Query: Nonna~2 AND Lucia~2
    ClientError: {code: Neo.ClientError.Procedure.ProcedureCallFailed} {message: Failed to invoke procedure db.index.fulltext.queryNodes: Caused by: java.lang.IllegalArgumentException: There is no such fulltext schema index: entity}"

  • @oluwaseunakinropo6318
    @oluwaseunakinropo6318 2 месяца назад

    Hii, amazing content as usual man, i am trying to implement this noe4j graph on my pc after downloading your code, i went to neo4jaura and downloaded a .txt file containing the database credentials but whenever i try it from your code i get this error message below, please help
    " Could not connect to Neo4j database. Please ensure that the url is correct"
    it is exactly the same thing i downloaded from their website

  • @giantworks1366
    @giantworks1366 2 месяца назад

    Nice. Can I only use llm ollama3.1 without openAi? Thanks

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад

      @@giantworks1366 yes, i wrote the Code in a way that you can switch from one to another via the llm_type variable. But the 8b param models seems sometimes to have issues to create the docs in the required way

  • @SilviaRicci13
    @SilviaRicci13 Месяц назад

    I try this approach with Ollama but It seesm when execute the row "llm_transformer.convert_to_graph_documents" it run for more time and I need to stop the run..can help me? What is it the problem? The graph DB neo4j connection?

    • @codingcrashcourses8533
      @codingcrashcourses8533  Месяц назад

      @@SilviaRicci13 any errors?

    • @SilviaRicci13
      @SilviaRicci13 Месяц назад

      @@codingcrashcourses8533 No errors!

    • @SilviaRicci13
      @SilviaRicci13 Месяц назад

      @@codingcrashcourses8533 After 118 minutes with ollama end with this error: ValidationError: 2 validation errors for Node
      id
      none is not an allowed value (type=type_error.none.not_allowed)
      type
      none is not an allowed value (type=type_error.none.not_allowed). Error is generated from the line: graph_documents = llm_transformer.convert_to_graph_documents(documents)

    • @VanshMaurya-b3b
      @VanshMaurya-b3b Месяц назад

      @@SilviaRicci13 so what did you do?

  • @Matthew-rb7rs
    @Matthew-rb7rs Месяц назад

    LangChain --version ?

  • @PassingTheDog
    @PassingTheDog 2 месяца назад +2

    Running this error over and over in the "graph_documents = llm_transformer.convert_to_graph_documents(documents)" cell:
    731 parsed_json = self.json_repair.loads(raw_schema)
    732 for rel in parsed_json:
    733 # Nodes need to be deduplicated using a set
    --> 734 nodes_set.add((rel["head"], rel["head_type"]))
    735 nodes_set.add((rel["tail"], rel["tail_type"]))
    737 source_node = Node(id=rel["head"], type=rel["head_type"])
    TypeError: string indices must be integers

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +1

      Hmhmh, some people seem to get this unfortunately. Did you use the 8B model? That could explain it, since it might create empty documents or so.

    • @Wingly113
      @Wingly113 2 месяца назад

      @@codingcrashcourses8533 getting the same error as well

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад

      @@Wingly113 also with 8b? I did not run in any errors with 70b of gpt4-small yet

    • @PassingTheDog
      @PassingTheDog 2 месяца назад

      @@codingcrashcourses8533 Yup, 8b. I can't handle 70b tho...

    • @sidharthgarg2348
      @sidharthgarg2348 2 месяца назад

      Llama 3.1 8B can also handle it, instead of ollama use llama.cpp and download llama 3.1 8B.gguf model and customise prompt template and you are good to go.i For reference you can use this repo
      github.com/s3dhanth/GraphRAG-with-Hermes-2.5-Pro-LLM-using-neo4j-database

  • @DePhpBug
    @DePhpBug 2 месяца назад

    I just finished learning the ChromaDB , can the GraphRAG works with ChromaDB instead of Neo4J?

  • @CptSnunu-ii4um
    @CptSnunu-ii4um 2 месяца назад

    How much VRAM and RAM do you think it takes to run the 70b model at minimum?

    • @Yes-lm9dq
      @Yes-lm9dq 22 дня назад

      The amount of VRAM you would need is roughly 70 x 2 = 140gb

  • @truliapro7112
    @truliapro7112 2 месяца назад

    How can I use this for my code project files. I need to convert whole php project with python.

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +1

      The code is in the description, it´s free, do whatever you want with it :).

  • @Sundaysun9633
    @Sundaysun9633 2 месяца назад

    Cool 🤙

  • @federicosalvati2454
    @federicosalvati2454 Месяц назад

    where is the .env file in the repository?

    • @codingcrashcourses8533
      @codingcrashcourses8533  Месяц назад

      @@federicosalvati2454 its not there. You need to use your own api Keys and Passwords of course

    • @federicosalvati2454
      @federicosalvati2454 Месяц назад

      @@codingcrashcourses8533 thank you so much I am approaching coding with python and RAGs for the first time and all of this helps a lot

  • @shyamvai
    @shyamvai Месяц назад

    I tried to git clone your repo.. it does not work.

    • @codingcrashcourses8533
      @codingcrashcourses8533  Месяц назад +1

      why? Cloning should always work if you dont have an error in your git configuration

  • @kevli6373
    @kevli6373 2 месяца назад

    Thank u for great video.
    Is it possible to still use ollama via langchain? I got a error when i use "if llmtype=="olllama"
    -- error message : Received unsupported message type for Ollama.
    (my_path) : LangChainDeprecationWarning: The method `BaseChatModel.__call__` was deprecated in langchain-core 0.1.7 and will be removed in 0.3.0. Use invoke instead.
    warn_deprecated(

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +1

      Did you change something in the codebase? Which line exactly produces this error? I have used the invoke method, to perform calls to the LLMs, so this kind of warning should not appear

  • @rgolanng
    @rgolanng 2 месяца назад

    look like llama cant generate low resource language like Vietnamese

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад

      no, but to honest most models have issues with that. You probably have to create the english documents first and then try to translate it

    • @rgolanng
      @rgolanng 2 месяца назад

      @@codingcrashcourses8533 when i first tried it since release, it failed to generate low resource language, but i just tested in Poe, it seems to work well now. Btw thank you, your content really help.

    • @rgolanng
      @rgolanng 2 месяца назад

      @@codingcrashcourses8533 I really appreciate your content. When i first test with togetherAI, it only responded with English, it seems work well now when i tested again

  • @mahoanghai3364
    @mahoanghai3364 2 месяца назад

    Cool. Thank a lot

  • @tawakalitolamide578
    @tawakalitolamide578 2 месяца назад

    Thank you for this video, I have been looking for a way to use local graphrag until I see this. Thank you so much.
    But I encountered an issue while using the neo4j browser. It keeps saying "Cannot load from URL 'file:///import/test_container.csv': Couldn't load the external resource at: file:///import/test_container.csv ()"
    I have tried all means to resolve this issue but the error keep persisting. Please I need your help.

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад +1

      You got this issue when running my code? I don´t use a test_container.csv file

    • @tawakalitolamide578
      @tawakalitolamide578 2 месяца назад

      @@codingcrashcourses8533 No, I did not get the issue while running your code. I already solved it. Thank you

  • @kevli6373
    @kevli6373 2 месяца назад +1

    for entity in entities.names:
    response = graph.query(
    """CALL db.index.fulltext.queryNodes('entity', $query, {limit:2})
    YIELD node,score
    CALL {
    WITH node
    MATCH (node)-[r:!MENTIONS]->(neighbor)
    RETURN node.id + ' - ' + type(r) + ' -> ' + neighbor.id AS output
    UNION ALL
    WITH node
    MATCH (node)

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад

      @@kevli6373 did you try openai models too? Or just the small 8b llama model?

    • @kevli6373
      @kevli6373 2 месяца назад

      @@codingcrashcourses8533
      I used only openai model.
      i got a message :
      ClientError: {code: Neo.ClientError.Procedure.ProcedureCallFailed} {message: Failed to invoke procedure db.index.fulltext.queryNodes: Caused by: java.lang.IllegalArgumentException: There is no such fulltext schema index: entity}
      Output is truncated. View as a scrollable element or open in a text editor. Adjust cell output settings...
      😂

    • @kevli6373
      @kevli6373 2 месяца назад

      @@codingcrashcourses8533 thank you reply. i used only openai model. i didnt try llama yet.

    • @kevli6373
      @kevli6373 2 месяца назад

      @@codingcrashcourses8533 I used only OpenAI gpt 4o mini not llama. In your case, Does it work well for both ?

    • @codingcrashcourses8533
      @codingcrashcourses8533  2 месяца назад

      @@kevli6373 I had issues with Llama 8B. Everything else worked fine :/. The LLM seems not be able to create/extract entities