@@abhishekkrthakurreality is they won’t beat closed source if they continue using closed source for data tagging . You and I both know , since you are with HF, you won’t admit it .. may be focus on doing something better instead of replying me
@@prafullsharma8029done with better things. thanks for the suggestion. now its time to do the useless thing and respond to your useless comment. i believe you didnt read the paper where you can read what kind of data was it trained on. anyways, do something useful, its been a year and you are stuck on this without any proper references 🤣🤣🤣
@@abhishekkrthakur right , I believe you haven’t read the same, most of these models are just boasting with useless Benchmarks…. Using benchmarks according their model’s suitability.
@abhishek Thank you for this tutorial. Can you please suggest what should be added in the parser if the pdf file is know to have tables with data only?
I am confused with the fine tuning llama 2 video and %100 pdf video you have. I have hundreds pages of pdf that I want to make query and get correct answer. Do I need to follow your fine tuning llama2 video for peft and then follow this pdf video? Or just use this pdf video as reference? What is your opinion?
can you explain in the line of if he pdf document contains a mixture of tables and text in sequence such that the tables and text paragraphs make sense when are embedded together?
Thanks for this video. You say that you have the 7B model running on your local machine, can you please specify the specifications of your local machine?
I couldn't ask this on time during the stream xD. Could you share your thoughts about the debate of using proprietary APIs vs deploying your own open source LLM (I mean, taking into account costs, privacy, etc.)? Thanks!
Quite curious to know your local machine configuration. Is that something that can be disclosed? Or have you already mentioned it in the previous videos? The reason I am asking is currently I am operating with a 8 GB ram on windows 10 and that feels like config from a different era :) on which most of the things won't run.
@@abhishekkrthakur What are your views on Azure Open AI vs Open AI. My knowledge is Azure uses Open AI models as well, so is there a chance that the data on Azure Open AI has the same privacy issues as Open AI?
raise ConnectionError(e, request=request) requests.exceptions.ConnectionError: HTTPConnectionPool(host='0.0.0.0', port=6000): Max retries exceeded with url: / (Caused by NewConnectionError(': Failed to establish a new connection: [Errno 111] Connection refused')) please help me with this
Please subscribe and like the video to help me keep motivated to make awesome videos like this one. :)
What a genius statement: Open source models are going to beat the closed source models soon, can't wait.💫
in dreams
@@prafullsharma8052 still in dreams? :)
@@abhishekkrthakurreality is they won’t beat closed source if they continue using closed source for data tagging . You and I both know , since you are with HF, you won’t admit it .. may be focus on doing something better instead of replying me
@@prafullsharma8029done with better things. thanks for the suggestion. now its time to do the useless thing and respond to your useless comment. i believe you didnt read the paper where you can read what kind of data was it trained on. anyways, do something useful, its been a year and you are stuck on this without any proper references 🤣🤣🤣
@@abhishekkrthakur right , I believe you haven’t read the same, most of these models are just boasting with useless Benchmarks…. Using benchmarks according their model’s suitability.
Amazing content, love it when the videos are long and Abhishek takes the time to explain and code everything.
Great video, I liked seeing you put this together and explain along the way. Very interesting to a novice like me. Thanks so much!
Crazy video! Subscribed!
If it could be i would like that in the first minute you show the result for knowing better what will you do in 38minutes 😂
ill take care of it in future videos :)
Very clean explanation!!!!
@abhishek Thank you for this tutorial. Can you please suggest what should be added in the parser if the pdf file is know to have tables with data only?
The `sentences.append(window)` in the `embed` function should go before the `if`.
Wow. Keep it up.
Amazing! Thank you.
I am confused with the fine tuning llama 2 video and %100 pdf video you have. I have hundreds pages of pdf that I want to make query and get correct answer. Do I need to follow your fine tuning llama2 video for peft and then follow this pdf video? Or just use this pdf video as reference? What is your opinion?
can you explain in the line of if he pdf document contains a mixture of tables and text in sequence such that the tables and text paragraphs make sense when are embedded together?
Abhishek, great videos! I know you are using rtx 3090 for your local setup. How are you able to load Falcon40B on it?
open killed the closed star 😎
Thanks for the tutorials, could you please do tutorial on multiple instance learning using Neural networks ?
Hidden gem. Can we also use this for other languages?
So cool!👍
Thanks for this video. You say that you have the 7B model running on your local machine, can you please specify the specifications of your local machine?
Great work, Abhishek! Any idea how to prevent randomly generated output by 7B model - Is prompt engineering the only solution?
Even I have the same question. Wasted a whole day trying to figure this out but without any luck.
Retreival Augmented Generation. The generated outputs should be checked against a vector DB to prevent hallucinations.
Sir, What is cheaper option? Using open ai API or deplying falcon 40 b on cloud?
how to get this kind of detailed terminal?
Want to work for hunggingface, how can I apply and get noticed? Awesome content btw!
I couldn't ask this on time during the stream xD. Could you share your thoughts about the debate of using proprietary APIs vs deploying your own open source LLM (I mean, taking into account costs, privacy, etc.)? Thanks!
I was about to ask the same question. Would really love to know from you Abhishek sir
this !, i'm considering making my thesis on this
Quite curious to know your local machine configuration. Is that something that can be disclosed? Or have you already mentioned it in the previous videos? The reason I am asking is currently I am operating with a 8 GB ram on windows 10 and that feels like config from a different era :) on which most of the things won't run.
i run code for my videos on a machine with 32gb ram and 3090 gpu.
hi. please help me. how to create custom model from many pdfs in Persian language? tank you.
Can we replace my gpt 4 calls in my deployed flask app with this? Offering clients far more usage for same price?
how i could add a webui to this code?
do you store the code in github?
Thank you very much sir
Can't we reduce the hallucination by reducing the temperature value??
sure. playing around with parameters is upto the viewers and is almost always helpful :)
thank you master.
Nice flexibility on offer here.
Can you please post the code as well
updated in description :)
@@abhishekkrthakur What are your views on Azure Open AI vs Open AI. My knowledge is Azure uses Open AI models as well, so is there a chance that the data on Azure Open AI has the same privacy issues as Open AI?
@@soodisin Let me help to answer, for Azure they will capsulate your data and won't gather it. Your data will not move anywhere.
@abhishekkrthakur this is not running on M2 Macbook locally. I am not using Docker.
raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPConnectionPool(host='0.0.0.0', port=6000): Max retries exceeded with url: / (Caused by NewConnectionError(': Failed to establish a new connection: [Errno 111] Connection refused'))
please help me with this
did u get it resolved ?