Shravya Kunamalla - Using application informed pacing to be a friendly internet neighbor

Jeremy Doig - Open and Free: How Web philosophy challenged legacy media mindsets - and won

Adam Wieckowski - Why is your encoder so slow? The curse of next-gen standards

Tom MacDonald - "Heroes"

Love Ain't It (From "Descendants: The Rise of Red")

SIDEMEN AMONG US BUT THERE’S A SECRET ROLE

Ramdas Satyan - Content Aware Encoding for low latency live streaming encoders using deep learning

Demuxed

Просмотров 180

Добавить в
- Мой плейлист
- Посмотреть позже
Поделиться

Поделиться

HTML-код

Размер видео:

Показать панель управления

Автовоспроизведение

Автоповтор

Опубликовано: 1 фев 2024
Livestreaming has emerged as a captivating medium that is reshaping the way we engage, communicate, and entertain with the world. Platforms like Twitch, RUclips Live, Facebook Live and others have become go-to destinations for audiences seeking real-time experiences, immediate interaction, and the thrill of being part of a live event. Real-time transcoding plays a crucial role in delivering high-quality, compatible, and optimized video content to these audiences using a range of devices and platforms.
Most of this transcoding today, occurs using a fixed adaptive bitrate (ABR) ladder that has a predetermined set of bitrate/resolution combinations for each encoded stream. This static or one size fits all approach has the following limitations: content type blindness leading to inefficient use of bandwidth and suboptimal video quality (VQ).
To overcome these limitations, Netflix pioneered content aware encoding (or Per-Shot encoding) for VOD use case. The video content is analyzed offline for every shot and efficient encoding decisions such as bitrate, resolution, quantization level is chosen based on convex hull of a given shot (best quality-bitrate points obtained from an ocean of encodes with different parameters) to maximize the VQ while minimizing bandwidth requirements.
Live streaming does not have the luxury of “infinite” latency that VOD offers. Real-time transcoding at scale puts additional cap on the processing capacity. This is a challenging problem that has attracted quite a bit of research in recent times. There are several approaches to content aware encoding for low latency encoding. Finding the best possible quality-bitrate trade-off in real-time with available compute while maintaining latency is the name of this game.
In this talk, we showcase our work using deep learning (DL) that predicts the “optimal” bitrate for incoming video in real time using data from input and encoder lookahead. We train a fully connected regression network using input statistics (luma histogram) and encoder lookahead statistics (SAD, mv and activity histograms). The ground truth for our purpose of achieving “optimal” bitrate is the bitrate which achieves a minimum VMAF value of 90 (this is the minimum quality bar) for each chosen shot during training.
This regression network is very light on compute and can efficiently run without affecting real-time performance or density of the encoder. This network needs a minimum of four frames of lookahead data to produce a high prediction accuracy. We have trained our network to have maximum savings for low complexity content with negligible loss in video quality and bypass very high complexity content.
We tested this algorithm using a variety of video clips downloaded from Twitch.tv and here are some of our results:
1. Bitrate savings of more than 30% with less than 1 VMAF point degradation for easy content such as talking head and low complexity content.
2. Bitrate savings of 9% on average for medium complexity content with less than 1 VMAF point degradation.
3. Negligible savings for high complexity content (as the algorithm knows lowering bitrate would cause VQ degradation).
The reasons for using a deep learning-based approach to predict CAE bitrate over traditional approaches are twofold:
1. Nonlinear function produced from DL provides precise bitrate savings without degrading VQ.
2. DL models can be trained/retrained by the content distributor using propriety and specific set to maximize bitrate savings while maintaining high VQ.
This approach is applicable for both hardware and software-based encoders which have access to the encoder lookahead statistics mentioned above. These bitrate savings can provide substantial savings on CDN bandwidth and storage costs for content distributors.
This talk was presented at Demuxed '23, a conference for video nerds in San Francisco featuring amazing talks like this one.
Наука

Комментарии •

Следующие

Автовоспроизведение

Shravya Kunamalla - Using application informed pacing to be a friendly internet neighbor

Shravya Kunamalla - Using application informed pacing to be a friendly internet neighbor

Jeremy Doig - Open and Free: How Web philosophy challenged legacy media mindsets - and won

Jeremy Doig - Open and Free: How Web philosophy challenged legacy media mindsets - and won

Adam Wieckowski - Why is your encoder so slow? The curse of next-gen standards

Adam Wieckowski - Why is your encoder so slow? The curse of next-gen standards

Tom MacDonald - "Heroes"

Tom MacDonald - "Heroes"

Love Ain't It (From "Descendants: The Rise of Red")

Love Ain't It (From "Descendants: The Rise of Red")

SIDEMEN AMONG US BUT THERE’S A SECRET ROLE

SIDEMEN AMONG US BUT THERE’S A SECRET ROLE

Ignition Teaser: A Name Forged in Flames | Genshin Impact #Ignition #Teaser #GenshinImpact

Ignition Teaser: A Name Forged in Flames | Genshin Impact #Ignition #Teaser #GenshinImpact

Josselin Cozanet - Towards efficient dynamic Ad insertion in the Edge

Josselin Cozanet - Towards efficient dynamic Ad insertion in the Edge

Exploring JPEG AI: The Future of Image Compression with Dr. Elena Alshina

Exploring JPEG AI: The Future of Image Compression with Dr. Elena Alshina

Tomas Bacik - CDN Challenges of HTTP-based Low Latency Live Streaming Delivery

Tomas Bacik - CDN Challenges of HTTP-based Low Latency Live Streaming Delivery

Pieter-Jan Speelmans - Stories from the trenches: debugging video issues from a client’s perspective

Pieter-Jan Speelmans - Stories from the trenches: debugging video issues from a client’s perspective

2014 Three Minute Thesis winning presentation by Emily Johnston

2014 Three Minute Thesis winning presentation by Emily Johnston

What Is an AI Anyway? | Mustafa Suleyman | TED

What Is an AI Anyway? | Mustafa Suleyman | TED

How AI Will Step Off the Screen and into the Real World | Daniela Rus | TED

How AI Will Step Off the Screen and into the Real World | Daniela Rus | TED

Andrey Pozdnyakov - Debunking the Myth of Video Delivery Monitoring on End-User Devices

Andrey Pozdnyakov - Debunking the Myth of Video Delivery Monitoring on End-User Devices

Optimize Video Encoding via Deep Learning

Optimize Video Encoding via Deep Learning

Лучший Fold 6, Flip 6, Galaxy Watch и Buds: обзор всех новинок Unpacked 2024

Лучший Fold 6, Flip 6, Galaxy Watch и Buds: обзор всех новинок Unpacked 2024

APPLE дают это нам БЕСПЛАТНО!

APPLE дают это нам БЕСПЛАТНО!

I Phone Vs Nokia Phone 😈 Who Is best✅️ #Your favorite Phone 📱 Comment #Youtubeshorts

I Phone Vs Nokia Phone 😈 Who Is best✅️ #Your favorite Phone 📱 Comment #Youtubeshorts

Ого, мп3 плеер, ни когда не видел 😀 Раньше все ходили с такой штуковиной на поясе

Ого, мп3 плеер, ни когда не видел 😀 Раньше все ходили с такой штуковиной на поясе

iPhone 16 - СТОИТ ПРОПУСТИТЬ • Apple ПРОГНУЛИ • iPhone 17 Slim УДИВЛЯЕТ

iPhone 16 – СТОИТ ПРОПУСТИТЬ • Apple ПРОГНУЛИ • iPhone 17 Slim УДИВЛЯЕТ

ЗАКОПАЛ НОВЫЙ ТЕЛЕФОН!!!🎁😱

ЗАКОПАЛ НОВЫЙ ТЕЛЕФОН!!!🎁😱

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts

ВОЗМОЖНО ЛИ ПОЧИСТИТЬ КЛАВИАТУРУ КЛЕЕМ?🤔 #shorts

Как распознать поддельный iPhone

Как распознать поддельный iPhone