How to Reduce Your OpenAI Spend by up to 90% with Small Language Models

5 Reasons Why Adapters are the Future of Fine-tuning LLMs

Introducing Solar LLM: The Best LLM for Fine-tuning that beats GPT-4, exclusively on Predibase

Barstool Pizza Review - Del Rossi's (Philadelphia, PA) Bonus Cheesesteak Presented by Tommy John

The History of Super Mario’s Hidden Ending

Blox Fruits ALL Changes in Dragon Rework Update

LoRA Land: How We Trained 25 Fine-Tuned Mistral-7b Models that Outperform GPT-4

Predibase

Просмотров 6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 29 дек 2024

Комментарии • 8

@JulianHarris 9 месяцев назад ⁺²
Have you guys looked at the next generation of quantisation: eg ternary/1.58 bit quantisation? It’s a different technique to conventional quantisation because you have matrices that only have 0, 1, -1, and you eliminate matrix multiplication almost entirely. The intuition is that the combination may not bring quite as many benefits, but it might be interesting to see how it performs in CPU architectures for instance.
@ofir952 10 месяцев назад ⁺²
Thanks! How did you manage to remove the surrounding text of the LLM response?
@pieromolino_pb 10 месяцев назад ⁺¹
It's a side effect of fine-tuning on output that contains only the JSON without tany other text
@ofir952 10 месяцев назад
So, we cannot achieve this without fine-tuning? Llama2 keeps on adding it all the time 🥲@@pieromolino_pb
@jeffg4686 10 месяцев назад ⁺¹
Nice !
@tankieslayer6927 10 месяцев назад ⁺¹
FINE-TUNED MODEL RESPONSE
Named Entity Recognition (CoNLL++)
{"person": ["Such"], "organization": ["Yorkshire"], "location": [], "miscellaneous": []}
Yeah, I am not impressed with the result of this fine-tuning.
@pieromolino_pb 10 месяцев назад ⁺¹²
The input text is: By the close Yorkshire had turned that into a 37-run advantage but off-spinner Such had scuttled their hopes , taking four for 24 in 48 balls and leaving them hanging on 119 for five and praying for rain.
Yorkshire in this case is a sports team, so organization is correct, and Such is a a player, so both model's predictions are correct indeed.
I'd suggest to try to understand better what is going on next time.
@The_Real_Goodboy_Link 3 месяца назад
Found the real solution, @tankieslayer6927, click on your icon on the top-right screen here, then settings, advanced settings, delete channel. Then go over to Google and do similarly for your account there. Problem solved!

Следующие

Автовоспроизведение

How to Reduce Your OpenAI Spend by up to 90% with Small Language Models

How to Reduce Your OpenAI Spend by up to 90% with Small Language Models

5 Reasons Why Adapters are the Future of Fine-tuning LLMs

5 Reasons Why Adapters are the Future of Fine-tuning LLMs

Introducing Solar LLM: The Best LLM for Fine-tuning that beats GPT-4, exclusively on Predibase

Introducing Solar LLM: The Best LLM for Fine-tuning that beats GPT-4, exclusively on Predibase

Barstool Pizza Review - Del Rossi's (Philadelphia, PA) Bonus Cheesesteak Presented by Tommy John

Barstool Pizza Review - Del Rossi's (Philadelphia, PA) Bonus Cheesesteak Presented by Tommy John

The History of Super Mario’s Hidden Ending

The History of Super Mario’s Hidden Ending

Blox Fruits ALL Changes in Dragon Rework Update

Blox Fruits ALL Changes in Dragon Rework Update

Searching the Jungle for WWII Battlefields (6 Days Fishing, Kayaking & Snorkeling in Palau)

Searching the Jungle for WWII Battlefields (6 Days Fishing, Kayaking & Snorkeling in Palau)

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

10x Coffee Talk: Strategies for PCCP Implementation

10x Coffee Talk: Strategies for PCCP Implementation

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Lessons From Fine-Tuning Llama-2

Lessons From Fine-Tuning Llama-2

Fine-Tuning Mistral-7B with LoRA (Low Rank Adaptation)

Fine-Tuning Mistral-7B with LoRA (Low Rank Adaptation)

Beat GPT-4 with a Small Model and 10 Rows of Data and Synthetic Data Generation

Beat GPT-4 with a Small Model and 10 Rows of Data and Synthetic Data Generation

Node.js: The Documentary | An origin story

Node.js: The Documentary | An origin story

ChatGPT & LLM Ethics: History, Architecture, and Debate | MLBBQ | Theodore LaGrow

ChatGPT & LLM Ethics: History, Architecture, and Debate | MLBBQ | Theodore LaGrow

Highlights of the Fireside Chat with Ilya Sutskever & Jensen Huang: AI Today & Vision of the Future

Highlights of the Fireside Chat with Ilya Sutskever & Jensen Huang: AI Today & Vision of the Future

АВТОДОМ за 400.000₽ - СВАРИЛИ КУЗОВ

АВТОДОМ за 400.000₽ - СВАРИЛИ КУЗОВ

Догулялся котик

Догулялся котик

Канада депортирует 5 миллионов иммигрантов? Шокирующая правда!!!!!

Канада депортирует 5 миллионов иммигрантов? Шокирующая правда!!!!!

Потушил свет дерзкому бородачу!

Потушил свет дерзкому бородачу!

УЛУЧШИЛ ВСЕ ПРЕДМЕТЫ В ДОТЕ И СЛОМАЛ ИГРУ! (Игра реально сломалась)

УЛУЧШИЛ ВСЕ ПРЕДМЕТЫ В ДОТЕ И СЛОМАЛ ИГРУ! (Игра реально сломалась)

Неудобно получилось.. #aminkavitaminka #aminokka #аминкавитаминка

Неудобно получилось.. #aminkavitaminka #aminokka #аминкавитаминка

МОЛОДОЙ ДЕД - 11я серия (смешное видео, приколы, юмор, поржать)

МОЛОДОЙ ДЕД - 11я серия (смешное видео, приколы, юмор, поржать)

Собрал БАГГИ к Новому Году - Хочу порадовать подписчиков

Собрал БАГГИ к Новому Году - Хочу порадовать подписчиков