Program

Sunday, 03 July 2022	Monday, 04 July 2022	Tuesday, 05 July 2022
	08:00 Registration and Preparation	08:30 Registration and Preparation
	09:00 Welcome and Keynote	09:00 Session 6: Image Coding
	10:00 Session 1: Video Compression	10:20 Coffee Break and Demos
	10:40 Coffee Break and Demos	11:00 Session 7: Film Look
	11:20 Session 2: SQoE	11:40 Session 8: 360° Video and Point Clouds
	12:00 Session 3: VVC Codec	12:20 Summary and Closure
	13:00 Lunch Break at Mensa	13:00 Networking Lunch at ML2
	14:00 Session 4: Image and Networks
	15:20 Poster Session + Coffee break	Departure
Arrival	16:20 Session 5: Video
	17:45 Networking Hike to Gabelbach
19:00 Pre-Event at BC club	19:45 Social Event at Gabelbach

Pre-School Event

The pre-school event will be held on Sunday, 03 July 2022 at BC-club (student club of Ilmenau).

Location. Starting time: 19:00

You will receive a Welcome Snack from the barbecue and will get free entry to the club’s Sunday event. This event is for PhD students and others to meet and greet and exchange before the actual summer school starts.

Keynote/Invited Lecture

Speaker: Christian Feldmann
Title: Video Coding in the Cloud – Challenges and Solutions
Abstract: After the rise of video streaming over the internet (OTT) more and more workflows for video transcoding and distribution are moving to the cloud. While this combination can be a match made in heaven, it is full of challenges like a multitude of different video and codecs, streaming container formats, subtitle formats, etc. The talk will give an overview of the transcoding landscape and the biggest challenges that we and our customers are facing. We will talk about codecs and device compatibility, cloud infrastructure, encoders and how some of the biggest names in streaming are trying to solve these problems.
About Christian Feldmann: He is a video coding engineer at Bitmovin working on next generation video coding technologies in the most recent video coding standards, such as HEVC, VVC and AV1. His main focus points at Bitmovin are practical application of video and transport standards in the scalable cloud transcoding solution. After his study of computer engineering at RWTH University Aachen, he completed his doctoral degree (PhD) at the Institut für Nachrichtentechnik (Institute for Communication Technologies) in Aachen. With his detailed experience in video coding, he is developing video coding technologies for the future of video coding. Christian participates in the standardization activities of the Moving Picture Experts Group (MPEG).

It is planned to have a 2 hour networking hike towards Hotel Gabelbach right after Session 5 on Monday. There, an included joint dinner will take place.

Session 1: Video Compression Performance

Christian Herglotz (FAU Erlangen-Nürnberg): Beyond Bjøntegaard: Limits of Video Compression Performance Comparisons
Sophie Pientka (Fraunhofer HHI): Rate-Distortion-Optimization for Deep Image Compression

Session 2: Sustainable QoE

Matthias Kränzler (FAU Erlangen-Nürnberg): Energy Efficient Video Decoding for VVC Using a Greedy Strategy Based Design Space Exploration
Geetha Ramasubbu (FAU Erlangen-Nürnberg): Investigation of the Impacts of Video and Encoding Parameters on the Encoding Energy of the HEVC Software Encoding Process

Session 3: Usage of the VVC Codec

Nicolas Horst (RWTH Aachen): New Reference Subpicture Padding Techniques for Use with Independently Decodable Subpictures
Martin Benjak (Leibniz Universität Hannover): Neural Network-based Error Concealment for B-Frames in VVC
Adam Wieckowski (Fraunhofer HHI): Encoding complexity analysis through the measurement of the search space

Session 4: Image and Networks

Simon Grosche (FAU Erlangen-Nürnberg): Applications of Non-Regular Image Sampling using Locally Fully Connected Reconstruction Network
Frank Sippel (FAU Erlangen-Nürnberg): Camera Array for Multispectral Imaging
Konstantin Schmidt (Fraunhofer IIS): Comparing Generative Adversarial Networks and Normalizing Flow for Super-Resolution
Ulrike Pestel-Schiller (Leibniz Universität Hannover): Determination of Relevant Hyperspectral Bands using a Spectrally Constrained CNN

Session 5: Video

Goluck Konuko (CentraleSupélec Paris): Ultra-low Bitrate Video Compression using Deep Image Animation
Marta Milovanovic (Telecom Paris): Hybrid Method for Patch-depth Transmission in MPEG Immersive Video
Smitha Lingadahalli Ravi (INSA Rennes): DIBR for Immersive Videos: Traditional Depth Estimators versus Learning-Based Depth Estimators

Session 6: Image Coding

Maximiliane Gruber (FAU Erlangen-Nürnberg): Domain Adaptation for Unknown Image Distortions
Shabhrish Reddy Uddehal (Leipzig University of Telecommunications): Image Segmentation for Improved Lossless Screen Content Compression
Marwa Tarchouli (INSA Rennes): Patch-based Image Coding with End-to-End Learned Codec using Overlapping
Daniel Gritzner (Leibniz Universität Hannover): Using Binary Space Partitioning for the Semantic Segmentation of Aerial Images

Session 7: Film Look

Dominik Keller (Technische Universität Ilmenau): Application of sensory evaluation techniques in film - How do expert and non-expert evaluations differ?
Zoubida Ameur (INSA Rennes): Deep-learning based film grain generation and removal

Session 8: 360° Video and Point Clouds

Viktoria Heimann (FAU Erlangen-Nürnberg): Point Cloud Upsampling Using Frequency-Based Models
Andy Regensky (FAU Erlangen-Nürnberg): Motion-Plane-Adaptive Inter Prediction in 360-Degree Video Coding

Poster Session

Fabian Brand (FAU Erlangen-Nürnberg): Rate-Distortion Optimized Deep Image Compression
Kristian Fischer (FAU Erlangen-Nürnberg): Boosting Neural Image Compression for Machines Using Latent Space Masking
Thibaut Meyer (RWTH Aachen): Point Cloud Attribute Coding using Graph Representations
Melan Vijayaratnam (Telecom Paris): Towards zero latency video transmission through frame extrapolation
Dominik Mehlem (RWTH Aachen): Entropy Coding of Neural Network Latent Space Coefficients for Point Clouds
Tomasz Konaszynski (AGH Krakow): Analysis of context factors affecting human perceptions of 360º video quality
Ashutosh Singla (Technische Universität Ilmenau): Holistic Assessment of 360° Video QoE

Demos

Hearing in Mixed Reality

A current trend in research and development is the extension of reality by adding artificial, virtual objects to realize mixed reality and augmented reality. The lab tour deals with methods and possibilities of measurement and auralization for the creation of virtual audio objects in the real listening environment. An audio system is presented in a practical way, in which all essential components for audio augmented reality are included, from recording to signal analysis and signal processing to playback. In addition to the headphone-based dynamic binaural audio reproduction, an (ambisonic-based) 22.2 loudspeaker reproduction will be presented, which gives an insight into the current loudspeaker-based sound reproduction of e.g. cinemas.

IVE for educational research

The Immersive Virtual Environments (IVE) presented in this demo are developed at TU Ilmenau to investigate the suitability of audio-visual IVEs for a “real-world” assessment of the influence of the visuospatial and acoustic environment on cognitive performance of adults and children in classroom-type environments. There are two different visual representations: 360° video and a CGI model, while we will compare both visualizations with regard to the influence on quality, listening effort in noise and role of spatial audio for different paradigms. The demo is part of a DFG-funded project within the SPP 2236 AUDICTIVE.

Co-Presence for Seniors

An overview of the goals of the CO-HUMANICS project funded by the Carl-Zeiss-Stiftung (www.co-humanics.de) and its current research stage will be given, focusing on the factors impacting QoE in AR/MR and robot-based telepresence.

Immersive Audiovisual Communication - Meeting in VR

IVEs promise to be an ideal medium for performing and studying group communication in a variety of situations. In an ongoing DFG project (as part of SPP 2236 AUDICTIVE), we aim to identify and gain a robust understanding of factors influencing plausibility and experience in multi-party audiovisual communication and interactive shared virtual environments. In this context, we assess plausibility, presence, quality as well as Quality of Experience (QoE) and develop methods for evaluation of such immersive systems. This demo shows a VR communication system that represents the basis for user studies conducted in our audiovisual test labs.

Mobile Groovability: How well does your phone groove?

In an industry-funded project from 2019 - 2021, we investigated the impact of smartphone acoustic reproduction on the perceived groove of music. Groove in this context is the human experience that motivates dance movements such as rhythmic nodding or foot tapping. In the present test, 8 different smartphones can be directly compared with each other based on various pieces of music.

HD studio direction in action

We are happy to show our fully functional HD studio suitable for offline and live production.

Student film productions

We encourage students to produce short films, movies, music videos, and audio productions in our labs. With regard to audio, voice-overs can be recorded and media productions can be edited. Some students projects are shown.

4K HDR Reference Screen

A professional reference monitor offers 4K HDR image quality, accuracy and consistency, making it perfect for demanding on-set, studio and post-production applications, including 4K digital film production. The capabilities are demonstrated with a few test sequences.