Sébastien Bubeck on Phi-2 and the surprising power of small models

Synthetic Data with Digital Humans

Getting Modular with Language Models: Building, Reusing a Library of Experts for Task Generalization

DRAGON BALL: Sparking! ZERO - Fused Warriors Trailer [BUDOKAI TENKAICHI Series]

Sydnie Christmas blows Judges away singing 'My Way' | Semi-Finals | BGT 2024

I Mixed Every Cookie Into One Cookie

AI Forum 2023 | The Small Models Revolution

Microsoft Research

Просмотров 3,3 тыс.

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 30 май 2024
I will discuss a new method we are pioneering at Microsoft Research to build smaller language models that exhibit many of the properties of the largest language models such as ChatGPT. The focus will be on our latest model, phi-1.5, which is a 1 billion parameters model that can rival competitor with 10 billion or more parameters.
Learn more about the AI Forum 2023 hosted by Microsoft Research Asia in collaboration with The University of Tokyo: www.microsoft.com/en-us/resea...
Наука

Комментарии • 5

@bwhit7919 2 месяца назад ⁺¹
This is brilliant. It’s tough to beat GPT-4. But if you make smaller, specialized models, I think it would be possible to beat GPT-4 on certain benchmarks. That’s what I hope the tech industry starts doing. Especially when 90% of the time I use Chat GPT it’s to write computer code.
@khangvutien2538 4 месяца назад ⁺²
1. I love the relax but precise style of this presentation
2. What we are learning here reminds me of my engineer thesis on PCA in 1975: to get significant eigen vectors, it is better to filter the data for meaningful samples 😅 or else there’s plenty of noise that waste computation time.
3. Question: in view of the lawsuit of the NYT against Microsoft and OpenAI, how can you make sure that the synthetic textbook-quality contents generated automatically by GPT-4 to train Phi-2 doesn’t contain litigious sentences?
@que_93 5 месяцев назад ⁺²
This is brilliant work and gives so much for us to think and work upon. I guess, "size doesn't always matter". Pun intended. And I am glad that you have made phi open-source. Thank you!
@ShubhamSinghYoutube 5 месяцев назад ⁺³
How do you ensure that the textbook quality scoring by GPT4 and GPT3.5 is reliable/ true?
@jeetmajumdar7588 5 месяцев назад ⁺¹
SLMs are good for individual purpose, but why not you building a gpt4 like llm model. Google just launched its gpt4 killer Gemini ai. Hope Microsoft will also come up with multimodal language model.

Следующие

Автовоспроизведение

Sébastien Bubeck on Phi-2 and the surprising power of small models

Sébastien Bubeck on Phi-2 and the surprising power of small models

Synthetic Data with Digital Humans

Synthetic Data with Digital Humans

Getting Modular with Language Models: Building, Reusing a Library of Experts for Task Generalization

Getting Modular with Language Models: Building, Reusing a Library of Experts for Task Generalization

DRAGON BALL: Sparking! ZERO - Fused Warriors Trailer [BUDOKAI TENKAICHI Series]

DRAGON BALL: Sparking! ZERO – Fused Warriors Trailer [BUDOKAI TENKAICHI Series]

Sydnie Christmas blows Judges away singing 'My Way' | Semi-Finals | BGT 2024

Sydnie Christmas blows Judges away singing 'My Way' | Semi-Finals | BGT 2024

I Mixed Every Cookie Into One Cookie

I Mixed Every Cookie Into One Cookie

Every Pixar Villain Ranked

Every Pixar Villain Ranked

Augmenting Human Cognition and Decision Making with AI

Augmenting Human Cognition and Decision Making with AI

How small Language Models in AI could reform Education | Roger Basler de Roca | TEDxSchaan

How small Language Models in AI could reform Education | Roger Basler de Roca | TEDxSchaan

Alien Megastructure Candidates - Not as Crazy as it Sounds!

Alien Megastructure Candidates – Not as Crazy as it Sounds!

The AI That's Changing Academia? Must-See for Researchers!

The AI That's Changing Academia? Must-See for Researchers!

15 crazy new JS framework features you don’t know yet

15 crazy new JS framework features you don’t know yet

What is Retrieval-Augmented Generation (RAG)?

What is Retrieval-Augmented Generation (RAG)?

Generative AI and Plural Governance: Mitigating Challenges and Surfacing Opportunities

Generative AI and Plural Governance: Mitigating Challenges and Surfacing Opportunities

Synthetic Data: Future of Data Science and AI

Synthetic Data: Future of Data Science and AI

GigaPath: Foundation Model for Digital Pathology

GigaPath: Foundation Model for Digital Pathology

Ультрабюджетный игровой ноутбук? 😮 Blackview Acebook 8

Ультрабюджетный игровой ноутбук? 😮 Blackview Acebook 8

Nokia 3310 versus Red Hot Ball

Nokia 3310 versus Red Hot Ball

Умные очки с камерой от RayBan и Meta #распаковка #умныйдом #техника #rayban #очки #meta #raybanmeta

Умные очки с камерой от RayBan и Meta #распаковка #умныйдом #техника #rayban #очки #meta #raybanmeta

Этот школьник ПО ПРИКОЛУ ПОЛОЖИЛ 6000 компов! 🖥️ #технологии #пк #вирус #хакер

Этот школьник ПО ПРИКОЛУ ПОЛОЖИЛ 6000 компов! 🖥️ #технологии #пк #вирус #хакер

AMD больше не конкурент для Intel

AMD больше не конкурент для Intel

Apple, как вас уделал Тюменский бренд CaseGuru? Конец удивил #caseguru #кейсгуру #наушники

Apple, как вас уделал Тюменский бренд CaseGuru? Конец удивил #caseguru #кейсгуру #наушники

Orzungizdagi telefon sizni kutmoqda! #shortvideo #smartphone #youtubeshorts#shorts

Orzungizdagi telefon sizni kutmoqda! #shortvideo #smartphone #youtubeshorts#shorts

ДЕШЕВЫЙ И НАДЕЖНЫЙ ЭЛЕКТРОСАМОКАТ ДЛЯ МОИХ 100кг

ДЕШЕВЫЙ И НАДЕЖНЫЙ ЭЛЕКТРОСАМОКАТ ДЛЯ МОИХ 100кг