Multimodal Data Lake, Video Repetition Counting, and Low-Resource Vision | Multimodal Weekly 51

Generalized Contrastive Learning and Transforming Video Production | Multimodal Weekly 50

The quantum revolution - with Sean Carroll

A NEW SPOOKY CHAMPION!! FREE TO PLAY IMPROVEMENTS... AND MORE!

THE COST OF MANSORY FINISHING MY ROLLS ROYCE REBUILD

The MLB Season Just Ended in the Dumbest Way Possible

Time-Interval Machine, ID-Aware Movie Descriptions, and Story Summarization | Multimodal Weekly 56

Twelve Labs

Просмотров 87

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 2 окт 2024
In the 56th session of Multimodal Weekly, we have three exciting presentations across different video understanding tasks: action recognition, video description, and video summarization.
✅ Jacob Chalk and Jaesung Huh will discuss Time Interval Machine (TIM) - which addresses the interplay between the two modalities in long videos by explicitly modeling the temporal extents of audio and visual events
Follow Jacob: jacobchalk.git...
Follow Jaesung: www.robots.ox....
TIM: jacobchalk.git...
✅ Haran Raajesh and Naveen Reddy Desanur will discuss Movie-Identity Captioner (MICap) - which is a new single stage approach that can seamlessly switch between id-aware caption generation or fill-in-the-blanks when given a caption with blanks.
Follow Haran: haran71.github...
Follow Naveen: dnaveenr.githu...
MICap: katha-ai.githu...
✅ Aditya Kumar Singh and Dhruv Srivastava will discuss their work "Previously on ..." From Recaps to Story Summarization - which tackles multimodal story summarization by leveraging TV episode recaps - short video sequences interweaving key story moments from previous episodes to bring viewers up to speed.
Follow Aditya: rodosingh.gith...
Follow Dhruv: www.github.com...
Recap Story: katha-ai.githu...
Timestamps:
00:10 Introduction
03:52 Jacob & Jaesung start
05:15 Audio and visual labels
06:22 Current recognition approaches fail to utilize the true context
07:20 Introducing TIM
09:00 TIM - the full picture
10:43 Encoding time intervals
11:25 Qualitative results
12:02 Recognition results
13:14 Adapting TIM for detection
14:43 Detection results
15:03 Analyzing time intervals
18:12 Q&A with Jacob
21:20 Naveen and Haran start
23:35 Audio descriptions
24:02 Identity aware captioning
25:35 Large-scale movie description challenge: Fill-in-the-blanks and full captioning tasks
26:14 Challenging example
26:48 Method overview: movie-identity captioner
27:32 Method (step 1: feature extraction)
28:20 Method (step 2: creation of captioning memory)
29:36 Method (step 3: causal shared decoder)
31:06 iSPICE
32:51 SoTA results
33:07 Attention analysis
34:35 Q&A with Naveen and Haran
43:03 Aditya and Dhruv start
43:50 Goal and key idea
44:44 Motivation
46:07 PlotSnap dataset
47:08 How to construct story-summary labels?
48:35 TaleSumm - our approach for story summarization
52:47 Experiments and ablations
54:32 Qualitative analysis
57:10 Q&A with Aditya and Dhruv
01:02:20 Conclusion
Join the Multimodal Minds community to receive an invite for future webinars: / discord

Комментарии •

Следующие

Автовоспроизведение

Multimodal Data Lake, Video Repetition Counting, and Low-Resource Vision | Multimodal Weekly 51

Multimodal Data Lake, Video Repetition Counting, and Low-Resource Vision | Multimodal Weekly 51

Generalized Contrastive Learning and Transforming Video Production | Multimodal Weekly 50

Generalized Contrastive Learning and Transforming Video Production | Multimodal Weekly 50

The quantum revolution - with Sean Carroll

The quantum revolution - with Sean Carroll

A NEW SPOOKY CHAMPION!! FREE TO PLAY IMPROVEMENTS... AND MORE!

A NEW SPOOKY CHAMPION!! FREE TO PLAY IMPROVEMENTS... AND MORE!

THE COST OF MANSORY FINISHING MY ROLLS ROYCE REBUILD

THE COST OF MANSORY FINISHING MY ROLLS ROYCE REBUILD

The MLB Season Just Ended in the Dumbest Way Possible

The MLB Season Just Ended in the Dumbest Way Possible

True Facts: How Jellyfish Hunt

True Facts: How Jellyfish Hunt

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Introducing Marengo-2.6, a SOTA video foundation model for any-to-any search | Multimodal Weekly 38

Introducing Marengo-2.6, a SOTA video foundation model for any-to-any search | Multimodal Weekly 38

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

What's the future for generative AI? - The Turing Lectures with Mike Wooldridge

Australia To Block Internationally Purchased 4G/5G Phones As Part of 3G Shutdown - Starting 1st Nov

Australia To Block Internationally Purchased 4G/5G Phones As Part of 3G Shutdown - Starting 1st Nov

What is the Internet of Things? And why should you care? | Benson Hougland | TEDxTemecula

What is the Internet of Things? And why should you care? | Benson Hougland | TEDxTemecula

The Turing Lectures: The future of generative AI

The Turing Lectures: The future of generative AI

Bring Enterprise Data to Video Foundation Models with MindsDB and Twelve Labs | Multimodal Weekly 43

Bring Enterprise Data to Video Foundation Models with MindsDB and Twelve Labs | Multimodal Weekly 43

A DAY (NIGHT) in the LIFE of a NOC ENGINEER!

A DAY (NIGHT) in the LIFE of a NOC ENGINEER!

How China Is Using Artificial Intelligence in Classrooms | WSJ

How China Is Using Artificial Intelligence in Classrooms | WSJ

Iran launches wave of missiles at Israel

Iran launches wave of missiles at Israel

Ракетные удары Ирана: Ответ Израиля и дальнейшие шаги

Ракетные удары Ирана: Ответ Израиля и дальнейшие шаги

Провал со стеклянным хлебом…

Провал со стеклянным хлебом…

Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy

Life hack 😂 Watermelon magic box! #shorts by Leisi Crazy

Ольга Бузова отказалась выступать с БАЛЕТОМ из уважения к Алёне Апиной❤️‍🩹 | Битва поколений

Ольга Бузова отказалась выступать с БАЛЕТОМ из уважения к Алёне Апиной❤️‍🩹 | Битва поколений

Разбитый горем Лещенко О ПОСЛЕДНИХ МИНУТАХ ЖИЗНИ Добрынина

Разбитый горем Лещенко О ПОСЛЕДНИХ МИНУТАХ ЖИЗНИ Добрынина

HA-HA-HA-HA 👫 #countryhumans

HA-HA-HA-HA 👫 #countryhumans

не имей 100 рублей, а имей 100 эмо друзей

не имей 100 рублей, а имей 100 эмо друзей