Jason Liu
Jason Liu
  • Видео 44
  • Просмотров 29 222
Text Chunking in RAG: Essential Guide with Anton from ChromaDB
Anton from ChromaDB delivers a comprehensive guide on text chunking strategies for RAG applications, explaining why chunking remains essential even as LLM context windows expand and demonstrating how different approaches impact retrieval performance. He shares two key principles: maximize embedding model context window usage while avoiding grouping unrelated information, and emphasizes his core message that developers must "always, always look at your data" rather than rely on defaults. The presentation concludes with practical insights on measuring chunking effectiveness through recall metrics and how to tailor retrieval systems to specific tasks and datasets.
TIMESTAMPS
00:00 Introduction...
Просмотров: 672

Видео

Building Resilient RAG Systems for Large-Scale Data | Office Hours
Просмотров 1467 часов назад
Jason leads an office hours session exploring practical challenges in implementing AI systems, from clinical trial subject selection to organizational chatbots and production retrieval systems. He demonstrates relevancy labeling using the Instructor library, showing participants how to improve model reasoning and data labeling processes. The discussion concludes with his insights on memory syst...
Building Dynamic AI Memory Systems: Sam Whitmore's Approach to Personalization
Просмотров 1,7 тыс.12 часов назад
Sam Whitmore, CEO of New Computer, reveals how her team built a dynamic memory system for their AI assistant Dot. She breaks down their approach to making AI remember both specific details and abstract concepts about users, explaining their innovative pre-computation strategy and router structure for memory retrieval. Sam shares hands-on insights about optimizing for low latency, using syntheti...
anyone: ..., every good ai engineer:
Просмотров 29628 дней назад
improvingrag.com
Everything I know about AI Consulting
Просмотров 1,4 тыс.2 месяца назад
This is a video about Everything I know about AI Consulting 00:00 Introduction and Purpose of the Stream 00:30 Common Mistakes in Consulting 01:40 Focusing on Outcomes, Not Tasks 03:01 Creating Effective Landing Pages 03:44 Understanding Customer Pain Points 08:24 The Importance of Content and Free Calls 20:04 Structuring Proposals for Success 22:22 Pricing Strategies and Accountability 24:27 T...
The Overlooked Mistake in AI Development
Просмотров 1172 месяца назад
This is a video about The Overlooked Mistake in AI Development 00:00 Challenges in Evaluating Generation
The Painful Truth About Scaling Your Business
Просмотров 622 месяца назад
This is a video about The Painful Truth About Scaling Your Business 00:00 The Painful Truth About Scaling Your Business
The Realities of AI Consulting: Lessons Learned and Future Directions w/ Jason and Hamel
Просмотров 2222 месяца назад
This is a video about The Realities of AI Consulting: Lessons Learned and Future Directions w/ Jason and Hamel 00:00 Introduction to the Course 00:15 Understanding Biases in Teams 00:38 Challenges in Evaluating Generation 03:34 Systematizing Data Analysis 04:04 Segmentation and Clustering Methods 07:17 Consulting and Value-Based Pricing 14:19 The Reality of AI Consulting 19:10 The True Cost of ...
Hamel on LLM as a Judge
Просмотров 582 месяца назад
Hamel on LLM as a Judge
Eugene Yan on Using LLMs as Judges: Insights, Challenges, and Best Practices
Просмотров 6692 месяца назад
Eugene Yan on Using LLMs as Judges: Insights, Challenges, and Best Practices
My prediction on the future of RAG
Просмотров 1,6 тыс.3 месяца назад
My prediction on the future of RAG
Systematically Improving RAG applications
Просмотров 5883 месяца назад
Systematically Improving RAG applications
The rag playbook.
Просмотров 1,3 тыс.6 месяцев назад
The rag playbook.
Values vs Benefits
Просмотров 2906 месяцев назад
Values vs Benefits
working out
Просмотров 3206 месяцев назад
working out
Freedom
Просмотров 2076 месяцев назад
Freedom
Trascending the medium
Просмотров 2066 месяцев назад
Trascending the medium
Hiring and Talent
Просмотров 1,3 тыс.6 месяцев назад
Hiring and Talent
Picking Metrics, Setting Goals
Просмотров 7336 месяцев назад
Picking Metrics, Setting Goals
AI Agents: Looping vs Planning
Просмотров 3 тыс.6 месяцев назад
AI Agents: Looping vs Planning
Instructor Streaming
Просмотров 35810 месяцев назад
Instructor Streaming
Streaming in Instructor
Просмотров 40910 месяцев назад
Streaming in Instructor
Using Netflix to think about the future state of RAG
Просмотров 57510 месяцев назад
Using Netflix to think about the future state of RAG
You need to classify documents before trying to extract data
Просмотров 58710 месяцев назад
You need to classify documents before trying to extract data
Routing for RAG
Просмотров 50710 месяцев назад
Routing for RAG
Importance of Routing
Просмотров 82910 месяцев назад
Importance of Routing
Extracting Tables out of Images using GPT-Vision and Instructor
Просмотров 1,3 тыс.10 месяцев назад
Extracting Tables out of Images using GPT-Vision and Instructor

Комментарии

  • @lLvupKitchen
    @lLvupKitchen 3 часа назад

    This was one of the questions I couldn’t find answers to. Great video Jason

  • @EkShunya
    @EkShunya День назад

    please please please make more content we need competent ppl to make content cuz these marketers are flooding my feed with BS

  • @joesligo1516
    @joesligo1516 День назад

    There are so many glitches happening at once, please

  • @uzairmirza6964
    @uzairmirza6964 День назад

    More please!!!

  • @Jay-wx6jt
    @Jay-wx6jt 2 дня назад

    Excellent office hour. Thanks for sharing

  • @pixelperfectpravin
    @pixelperfectpravin 2 дня назад

    Such a great video - thanks Sam

  • @KevinKreger
    @KevinKreger 3 дня назад

    So much importance rides on the quality of the data even though the hand labeling is tedious! Love it.

  • @nithingeorge9025
    @nithingeorge9025 3 дня назад

    What happened to her right eye?

  • @Jay-wx6jt
    @Jay-wx6jt 4 дня назад

    Nice intro, and good way to share the rag course content possibly. Thanks Jason.

  • @moslehmahamud
    @moslehmahamud 26 дней назад

    Nothing beats data

  • @kennedygrimes6582
    @kennedygrimes6582 2 месяца назад

    Here are the top 10 takeaways: 1. Consultants vs. Contractors: Many people call themselves consultants but are essentially contractors, selling their time instead of outcomes. Successful consultants focus on outcomes rather than trading time for money. 2. Outcome-Based Selling: The key to scaling consulting is to move away from task-based selling to outcome-based pricing. Clients should pay for results, not tasks or hours worked. 3. Effective Landing Pages:A consultant's website or landing page should not focus on the consultant's skills but on the pain points of potential clients and how those can be resolved. It’s about framing problems and showing the outcomes. 4. Understanding Client Pain Points: To be successful, consultants need to identify and clearly articulate the specific pain points of their clients. This helps in crafting proposals that are relevant and outcomes-focused. 5. Multiple Pricing Options: Always offer multiple pricing options in a proposal. Each option should reflect different levels of commitment and accountability for the client’s outcome. This gives clients a choice and anchors higher pricing tiers. 6. Value-Based Pricing: Instead of attaching costs to time or tasks, structure pricing around the value delivered. Higher prices should reflect greater accountability and responsibility for delivering results. 7. Free Calls and Content: Jumping on free calls and producing content related to client problems is an effective strategy for gaining trust and positioning oneself as an expert. The content can serve as a sales flywheel. 8. Metrics and Accountability: To add value, it’s important to include metrics that show improvement and success. This helps clients see the impact of the work and builds trust in the consultant's ability to deliver. 9. Test and Iterate on Proposals: Proposals should not be static. Iterate and learn from them, offering tiers that provide different levels of service and accountability, and adjusting based on feedback and results. 10. Proof of Expertise: Building a library of case studies, testimonials, and content that proves past successes is crucial for long-term client trust and acquisition. Showing proof of concept and expertise through real-world examples drives conversions.

  • @michaelfloyd2525
    @michaelfloyd2525 2 месяца назад

    Would you change your discovery strategy on outbound vs inbound? It makes sense how you are able to ask all of those questions on quantifying the pains because you already know they want to work with you. If someone had no idea who you were and had never read your blogs or twitter, how would you approach that conversation? This content was awesome! Thanks

  • @calmcode-io
    @calmcode-io 2 месяца назад

    Interesting. From my experience with annotators, I found it was less about "firing people" who performed bad and perhaps more about "rewriting the guidelines". Sometimes an annotator takes the guidelines literally (actually not a bad thing) and as a result generates annotations that the guideline designer did not have in mind. This is also partially why it makes a tonne of sense for folks who write guidelines to also annotate on the task. It can also help to have an annotation interface where folks are able to flag a task/example as confusion so that it's easy to reflect. I have not tried it with LLMs, but my gut says that allowing the LLM to flag an example/task combo as confusing can also really help in designing a few solid prompts.

  • @giacomociarlini4557
    @giacomociarlini4557 2 месяца назад

    love this kind of video! learned so much

  • @alexwoxst
    @alexwoxst 2 месяца назад

    Jason, how come the your free content on yt etc is aimed at other consultants (competitors) and not your actual customers? It seems like a missed marketing opportunity unless your customers are actually AI consultants and not companies in industry?

  • @jamescoholan
    @jamescoholan 2 месяца назад

    If you think services aren't a great model, why don't you go full-time into SaaS?

  • @jamescoholan
    @jamescoholan 2 месяца назад

    This is great

  • @GualterioPulvirenti
    @GualterioPulvirenti 2 месяца назад

    man, this is a goldmine, thanks!

  • @MatijaGrcic
    @MatijaGrcic 2 месяца назад

    Great discussion, thanks.

  • @ajt_2023
    @ajt_2023 2 месяца назад

    Can you please add links to references (like Eugene's post) in the description? Great content!

  • @MatijaGrcic
    @MatijaGrcic 2 месяца назад

    Thanks, this was really good.

  • @pepingege
    @pepingege 2 месяца назад

    Great point. I'd love to hear examples of different decision-making reports in a future video!

  • @paulelliotco
    @paulelliotco 2 месяца назад

    So good!

  • @andrewwu2639
    @andrewwu2639 3 месяца назад

    Thanks, this really helps!

  • @GigaFro
    @GigaFro 3 месяца назад

    We have built exactly this. Happy to see someone who aligns with our hypothesis :)

  • @gReKusiek
    @gReKusiek 3 месяца назад

    Jason, excellent point on the higher level output for a RAG. Out of curiosity, what was the book you mentioned on sales engagement letters?

  • @elvissaravia
    @elvissaravia 3 месяца назад

    Interesting take on the report generation because of the obvious value for a company. Curious to hear from you how this would defer from using standard calls to a long-context LLM, if and when they get to a point of desent latency and long-retrieval understanding.

  • @coolboy4rocks
    @coolboy4rocks 3 месяца назад

    I thought someone was about to grab you from behind in the thumbnail 😭

  • @itsjustmeemman
    @itsjustmeemman 3 месяца назад

    turning the ambiguity of language and communication into concrete and well structured insights that can drive effective business decisions with the help of LLM's ?

    • @itsjustmeemman
      @itsjustmeemman 3 месяца назад

      though its kinda awesome to have a marketplace of these well structured "templates" made by professionals on their field of expertise compared to general purpose agents that might waste tokens (paths are already been built no need for an autonomous agent to search for one) or even maybe business books and the principles within it should include LLM templates as well lol books aren't interactive enough.

  • @BrianFernandez-Dili
    @BrianFernandez-Dili 3 месяца назад

    noticed this as well - general rag without domain expertise is lackluster. Using llm calls as a compute primitive to build extremely good verticalized software, much better.

  • @oneandonlyflow
    @oneandonlyflow 3 месяца назад

    repost?

  • @matthewrideout2677
    @matthewrideout2677 3 месяца назад

    I agree, but would replace "report" with "artifact" as a general output category.

  • @eyekey346
    @eyekey346 3 месяца назад

    Thanks for making Instrucor bro

  • @AmmarAdil
    @AmmarAdil 4 месяца назад

    Where can I find code for this

  • @richmond_a
    @richmond_a 5 месяцев назад

    Great converstation and insights guys!

  • @paulomtts
    @paulomtts 5 месяцев назад

    I just found Instructor. Cannot thank you enough! This is the most elegant way to program LLM interactions I've seen!

  • @divine7223
    @divine7223 5 месяцев назад

    who are coaching you when it comes to consulting?

  • @DolphinAnalyticsAI
    @DolphinAnalyticsAI 6 месяцев назад

    Its great to listen to your guys! I am a python / C# .Net guy, but the last 2 years been nothing but pure excitement learning everything i can about AI Agents and all the new design pattern and architetures that the LLM as the central magical black box. By the way your instructor class is wonderful... i have integrated that into my Agent Builder tools and its working great! thanks yall.

  • @shrimple-
    @shrimple- 6 месяцев назад

    What’s rag ?🙏

  • @Fishylocker
    @Fishylocker 6 месяцев назад

    slay

  • @asetkn
    @asetkn 6 месяцев назад

    Jason thank you for dropping all this wisdom on us! This short video format on separate topics is great.

  • @AGI-Bingo
    @AGI-Bingo 6 месяцев назад

    What do you think about Live-Updating RAG? Let's say some info changed in some client's watched doc, we would want to immediately prepare it for rag, remove the old outdated chunks, rechunk new data, and update the db/vdb. Ideally very quickly. I think it really sets apart fun-to-do projects and real world production systems. Would love it if you could cover this 💜 Thanks & All the best! ps - Your takes around benchmarking metrics are spot on. We need to develop better tooling for ourselves

  • @xumx
    @xumx 6 месяцев назад

    Is there any reference implementation on how to address routing/query classification? considering multi-turn, context aware conversations.

  • @AGI-Bingo
    @AGI-Bingo 6 месяцев назад

    Hey Jason, great takes! This whole thing almost sounds like trying to develop a timezones system from scratch haha. I'm guessing very soon RAG will be "solved" (sota foss) but we're still at the bleeding edge. I think most of the meta "non embeddings" questions you're asking could be answered by a small script over the db, if the llm knows the structure of you data and the right keys to search, it could write the extraction code and provide an answer with 2 steps. Also for the most common questions, the answers could be ragged and live updated

  • @kikithepupper6774
    @kikithepupper6774 6 месяцев назад

    Im not exactly sure where you were going with it either lol but i definitely agree with some things you said

  • @gibbz00
    @gibbz00 6 месяцев назад

    Very useful concepts to think about. Thank you for the video.

  • @3stdv93
    @3stdv93 6 месяцев назад

    This is so enlightened me, Thanks! ❤

  • @eduardovernier7628
    @eduardovernier7628 6 месяцев назад

    Hey Jason! I've had a lot of success using instructor with open ai models, but not much when using local models (such llama3 and Mi(s/x)tral via ollama for example). Any suggestions on how to get those to work well with structured data? Thanks!

  • @slyracoon23
    @slyracoon23 6 месяцев назад

    Amazing advice, all of these things struck internal conversations I have had

  • @simonayotte50
    @simonayotte50 6 месяцев назад

    Absolutely loved this one. 🫡