Overview

The learning objective is to analyze selected research papers published at top computer vision and machine learning venues. A key focus will be placed on identifying and discussing open problems and novel solutions in this space. The seminar will achieve this via several components: reading papers, technical presentations, writing analysis and critique summaries, class discussions, and exploration of potential research topics. In this seminar we will discuss state-of-the-art literature on human-centric computer vision topics including but not limited to human pose estimation, hand and eye-gaze estimation as well as generative modeliing of detailed human activities.


Goal

The goal of the seminar is not only to familiarize students with exciting new research topics, but also to teach basic scientific writing and oral presentation skills. The seminar will have a different structure from regular seminars to encourage more discussion and a deeper learning experience.

We will treat papers as case studies and discuss them in-depth in the seminar. Once per semester, every student will have to take one of the following roles:

  1. Presenter: Give a presentation about the paper that you read in depth.
  2. Reviewer: Write a critical review of the paper following this template (use ETH credentials to access the file).

All other students read the paper and submit questions they have about the paper before the presentation.


Schedule

Wk. Date TA Paper
2 26.09.2024
Chengwei Zheng
3D Gaussian Splatting for Real-Time Radiance Field Rendering
2 26.09.2024
Zijian Dong
Neural Strands: Learning Hair Geometry and Appearance from Multi-View Images
3 03.10.2024
Gengyan Li
NPGA: Neural Parametric Gaussian Avatars
3 03.10.2024
Mert Albaba
Eureka: Human-Level Reward Design via Coding Large Language Models
4 10.10.2024
Zijian Dong
GaussianHair: Hair Modeling and Rendering with Light-aware Gaussians
4 10.10.2024
Tianjian Jiang
Zero-1-to-3: Zero-shot One Image to 3D Object
5 17.10.2024
Egor Zakharov
DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis
5 17.10.2024
Egor Zakharov
Sapiens: Foundation for Human Vision Models
6 24.10.2024
Manuel Kaufmann
TryOnDiffusion: A Tale of Two UNets
6 24.10.2024
Vanessa Sklyarova
NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads
7 31.10.2024
Lixin Xue
Single-Shot High-Quality Facial Geometry and Skin Appearance Capture
7 31.10.2024
Yufeng Zheng
Dream-in-4D: A Unified Approach for Text- and Image-guided 4D Scene Generation
8 07.11.2024
--
no seminar
9 14.11.2024
--
no seminar
10 21.11.2024
Gengyan Li
GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians
10 21.11.2024
Tianjian Jiang
TripoSR: Fast 3D Object Reconstruction from a Single Image
11 28.11.2024
Manuel Kaufmann
Physical Non-inertial Poser (PNP): Modeling Non-inertial Effects in Sparse-inertial Human Motion Capture
11 28.11.2024
Yufeng Zheng
HOOD: Hierarchical Graphs for Generalized Modelling of Clothing Dynamics
12 05.12.2024
Lixin Xue
DUSt3R: Geometric 3D Vision Made Easy
12 05.12.2024
Mert Albaba
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
13 12.12.2024
Chengwei Zheng
4K4D: Real-Time 4D View Synthesis at 4K Resolution
13 12.12.2024
Vanessa Sklyarova
HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles
14 19.12.2024
--
no seminar