Stealing Part of a Production LLM | API protects LLMs no more

Best Ways to Use Gemini 2.0 (over ChatGPT & Perplexity)!

Salesforce CEO Marc Benioff: Don't think Microsoft will use OpenAI in the future

How Employees Are Coffee Badging To Avoid Full Days At The Office

OUR FIRST 24 HOURS HOME WITH A NEWBORN + HER NAME REVEAL!!

Engineers vs Extreme Hide & Seek

Shapley Values Explained | Interpretability for AI models, even LLMs!

AI Coffee Break with Letitia

Просмотров 6 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 24 янв 2025

Комментарии • 29

@AICoffeeBreak 8 месяцев назад ⁺⁵
Maybe I should have mentioned this in the video: A huge problem in AI interpretability is faithfulness vs. plausibility. Users like *plausible* explanations which look right to them ("aha, this makes sense!"). But sometimes they see things that are counterintuitive or attributions that make no sense to them. Then, even if the explanations are *faithful* to the model's workings, they will seem alien, weird, and users will dislike such a model, or blame it into the interpretability method.
Why is feature attribution seldomly used in production? Because they can help users game the system. 😅 If you know your credit score is low because you have two cars, you will sell that extra car and increase your score.
8 месяцев назад ⁺¹
It's inevitable that some folks will try to exploit any system, no matter how well-designed it is. Recently, we've seen some clever algorithms try to game the benchmarks, but they often fail spectacularly in the real world. It would be great to have a little extra help to detect these kinds of frauds. Something like 'humans' or 'reasoning agents' could be a good place to start.
@DerPylz 8 месяцев назад ⁺¹²
It's always great to see "old" ideas getting used for solving new problems . I had heard about shapley values and was hoping you'd make a video explainer about it Thanks!
@vietvu9714 8 месяцев назад ⁺²
The explaination was farrrr better than anything I expected :D very well done
@MyrLin8 8 месяцев назад ⁺²
Very nice training vid. gj. useful info. good examples and references.
@juanmanuelcirotorres6155 8 месяцев назад ⁺⁴
A serie about interpretability would be awesome
@AGI-Bingo 8 месяцев назад ⁺²
This is really cool, I can imagine in the future we'll have really good interpretability tools, for example marking a piece of text from the llm output and it will highlight the tokens from the context that influenced it the most ❤
@SU3D3 8 месяцев назад ⁺³
Excellent! Always providing the goods.
@AICoffeeBreak 8 месяцев назад ⁺²
Thank you!
@manolisnikolakakis7292 8 месяцев назад ⁺³
Thanks for another great explanation! Good luck with your thesis :)
@AICoffeeBreak 8 месяцев назад ⁺¹
Thank you!
@dianai988 8 месяцев назад ⁺³
Interpretability was the rabbit hole that got me into deep leaning, would love to see more content on this topic (and if you need ideas on things to explore, lmk) ♥ (also, SHAP was one of the earliest interpretability techniques I came cross after meeting the researcher working on it at the University of Washington at a poster session--so great to see how far this work has come since then!)
@md.enamulhoq9389 8 месяцев назад ⁺⁴
best of luck with your Thesis. Stay sound. Love You
@Nif3 8 месяцев назад ⁺²
This is really interesting and your explanation was excellent, but... did that coffee bean really just wink at me?
@AICoffeeBreak 8 месяцев назад ⁺²
@abhishekshakya6072 8 месяцев назад ⁺¹
Thanks for referencing the mathematical equations from research papers. It really validates the authenticity of your work. I felt the video was rushed a bit. I was probably expecting a longer video with more examples.
But I understand you might have time crunch with your thesis. Good luck ✌
@MachineLearningStreetTalk 8 месяцев назад ⁺⁶
🔥🔥🔥
@MachineLearningStreetTalk 8 месяцев назад ⁺⁶
"Neat"! Best part haha
@AICoffeeBreak 8 месяцев назад ⁺⁴
@Ben_D. 8 месяцев назад ⁺⁹
Came for the AI commentary. Stayed for the god level lipstick.
@yannickpezeu3419 8 месяцев назад ⁺²
thanks !
@harumambaru 8 месяцев назад ⁺²
are those acoustic boards for walls? RLHF pretty easy to get all the words as another eastern european english speaker
@AICoffeeBreak 8 месяцев назад ⁺¹
Yes, that is acoustic foam. Otherwise I sound like I'm speaking from a bathroom. 🤭
@sifonios 8 месяцев назад ⁺²
Hm. I would have liked to watch this but the background music is far too loud and very distracting. ... Ah it does stop after a while. Yes it is very interesting and useful for me :)
@AICoffeeBreak 8 месяцев назад ⁺²
I agree, I noticed that too in the final pass. Will make it better next time.
@DerPylz 8 месяцев назад ⁺²
Sorry, that's on me (her editor). Something got messed up in the audio mixing and we didn't notice it before uploading. Luckily, it's only during the introduction, so the main part of the video should be fine 😅
@gordonfreeman4357 8 месяцев назад
Not gonna lie, I think that this is basically useless on autoregressive models.
@AICoffeeBreak 8 месяцев назад ⁺⁸
👀 Don't leave us hanging here, explain your statement. 😅

Следующие

Автовоспроизведение

Stealing Part of a Production LLM | API protects LLMs no more

Stealing Part of a Production LLM | API protects LLMs no more

Best Ways to Use Gemini 2.0 (over ChatGPT & Perplexity)!

Best Ways to Use Gemini 2.0 (over ChatGPT & Perplexity)!

Salesforce CEO Marc Benioff: Don't think Microsoft will use OpenAI in the future

Salesforce CEO Marc Benioff: Don't think Microsoft will use OpenAI in the future

How Employees Are Coffee Badging To Avoid Full Days At The Office

How Employees Are Coffee Badging To Avoid Full Days At The Office

OUR FIRST 24 HOURS HOME WITH A NEWBORN + HER NAME REVEAL!!

OUR FIRST 24 HOURS HOME WITH A NEWBORN + HER NAME REVEAL!!

Engineers vs Extreme Hide & Seek

Engineers vs Extreme Hide & Seek

Drone sightings force New York airport to shut down temporarily

Drone sightings force New York airport to shut down temporarily

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

Transformers (how LLMs work) explained visually | DL5

Transformers (how LLMs work) explained visually | DL5

Forget About LLMs - Large Concept Models (LCM) Are Here Now!

Forget About LLMs - Large Concept Models (LCM) Are Here Now!

Scale AI CEO Alexandr Wang on U.S.-China AI race: We need to unleash U.S. energy to enable AI boom

Scale AI CEO Alexandr Wang on U.S.-China AI race: We need to unleash U.S. energy to enable AI boom

What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED

What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED

Transformers explained | The architecture behind LLMs

Transformers explained | The architecture behind LLMs

This AI Robot Is Doing the Impossible - Unitree x ElizaWakesUp

This AI Robot Is Doing the Impossible - Unitree x ElizaWakesUp

LLM Lecture: A Deep Dive into Transformers, Prompts, and Human Feedback

LLM Lecture: A Deep Dive into Transformers, Prompts, and Human Feedback

"С передка вернулись пятеро, остальные без вести" #война #Россия #Украина

"С передка вернулись пятеро, остальные без вести" #война #Россия #Украина

Каха в супермаркете

Каха в супермаркете

спидран по ютуб шортс 105 | Британская кухня

спидран по ютуб шортс 105 | Британская кухня

Салют Маска, реакция Путина, будет ли мир? Детали инаугурации и первые шаги Трампа. Разбор новостей

Салют Маска, реакция Путина, будет ли мир? Детали инаугурации и первые шаги Трампа. Разбор новостей

13 Карт - Мёртвые души | 1 сезон 9 серия

13 Карт — Мёртвые души | 1 сезон 9 серия

Узнала пол ребенка на ПВЗ

Узнала пол ребенка на ПВЗ

ДАЛЬШЕ БУДЕТ ХУЖЕ | Volvo s90

ДАЛЬШЕ БУДЕТ ХУЖЕ | Volvo s90

👹МИМИК: от РОЖДЕНИЯ до СМЕРТИ за 0-100 ЛЕТ в Майнкрафт!

👹МИМИК: от РОЖДЕНИЯ до СМЕРТИ за 0-100 ЛЕТ в Майнкрафт!