Track: WebRTC and Real-Time Applications |
Advanced Reference Structures for Scalable Video in Real-Time Applications |
Video bitstreams for RTC, especially when it comes to multi-way conferencing, often have temporal and/or spatial scalability which allows a back-end component to selectively forward a suitable frame rate and resolution to each receiver based on their current circumstances. This presentation outlines how such features are accomplished by setting up “reference structures”, i.e. how video frames use the reconstructed state from previous frames in such a way as to allow some frames to be skipped or facilitate efficient transmission of multiple resolutions. We’ll also cover the fixed set of structures that are supported by the WebRTC API today via “scalability modes”. Then we’ll have a look at what is coming down the line from the latest W3C working group discussions. In particular WebCodecs and RtpTransport that will allow an application to unbundle video encoding from the WebRTC monolith and create fully custom per-frame adaptive reference structures, unlocking many new use cases. This is somewhat of a paradigm shift, and we'd love to hear feedback from the developer community! |
|
Presentation Video |
Presentation Notes |
Sprang-AdvancedReferenceStructuresForScalableVideoInRealtimeApplications1.pdf |