Evaluate LLMs with Language Model Evaluation Harness

Поделиться
HTML-код
  • Опубликовано: 11 май 2024
  • In this tutorial, I delve into the intricacies of evaluating large language models (LLMs) using the versatile Evaluation Harness tool. Explore how to rigorously test LLMs across diverse datasets and benchmarks, including HellaSWAG, TruthfulQA, Winogrande, and more. This video features the LLaMA 3 model by Meta AI and demonstrates step-by-step how to conduct evaluations directly in a Colab notebook, offering practical insights into AI model assessment.
    Don't forget to like, comment, and subscribe for more insights into the world of AI!
    GitHub Repo: github.com/AIAnytime/Eval-LLMs
    Join this channel to get access to perks:
    / @aianytime
    To further support the channel, you can contribute via the following methods:
    Bitcoin Address: 32zhmo5T9jvu8gJDGW3LTuKBM1KPMHoCsW
    UPI: sonu1000raw@ybl
    #openai #llm #ai
  • НаукаНаука

Комментарии • 11

  • @TheIITianExplorer
    @TheIITianExplorer 2 месяца назад +3

    I love you man, ❤
    You are awesome, keep uploading 😊

  • @joserfjunior8940
    @joserfjunior8940 2 месяца назад

    I LIKE THIS... nice job man !

  • @Techonsapevole
    @Techonsapevole Месяц назад

    Thanks, great LLM tips

  • @bdoriandasilva
    @bdoriandasilva Месяц назад

    nice! thank you for the video!

  • @muhammedajmalg6426
    @muhammedajmalg6426 2 месяца назад

    nice work

  • @sagarbhaskar8688
    @sagarbhaskar8688 13 дней назад

    do we have to add any dataset?

  • @A7_-
    @A7_- 8 дней назад

    Can i do it on llava model

  • @abhijoy.sarkar
    @abhijoy.sarkar 27 дней назад

    How to do it on whole mmlu?

  • @krishnapriya9881
    @krishnapriya9881 2 месяца назад

    PackageNotFoundError: No package metadata was found for bitsandbytes. I am getting this error even though bitsandbytes is installed and my cuda version is 12.1, please help me with this

  • @saumyajaiswal6585
    @saumyajaiswal6585 2 месяца назад

    What about langsmith?It does the same thing right?

  • @araara2142
    @araara2142 2 месяца назад +1

    I need rag chatbot part 2 video, please release, my exam is coming