Dr. Manuel Kansy

Associate Research Scientist, DisneyResearch|Studios

About Me

I am an associate research scientist at DisneyResearch|Studios. Specifically, I am part of the Facial VFX research team led by Dr. Derek Bradley.

Previously, from 2021 to 2024, I completed my PhD at the Computer Graphics Laboratory at ETH Zurich in cooperation with DisneyResearch|Studios, where I was supervised by Prof. Dr. Markus Gross and Dr. Romann M. Weber. My research interests include various areas of deep learning and especially generative modeling. I always find it most fascinating and satisfying if I’m working on projects that interact with the environment in some way, i.e., where I can see/hear/feel its results. In recent years, I focused mostly on face-related image and video generation tasks.

Before starting my PhD studies, I completed my master’s degree in Robotics, Cognition, Intelligence at the Technical University of Munich. Thereby, I became really interested in deep learning and had the privilege to work on my master’s thesis and several other projects in Prof. Dr. Laura Leal-Taixé’s Dynamic Vision and Learning Group. Prior to that, I did my bachelor’s degree in IT-Automotive at the Baden-Wuerttemberg Cooperative State University (DHBW) Stuttgart in cooperation with Robert Bosch GmbH.

Recent News

01.01.2025

I started a new position as Associate Research Scientist at DisneyResearch|Studios.

03.12.2024

I successfully defended my PhD.

Publications

Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion

Preprint

We propose motion-textual inversion, a general method to transfer the semantic motion of a given reference motion video to given target images. We thereby optimize a motion representation composed of a set of text/image embedding tokens using a frozen, pre-trained image-to-video diffusion model. Our method generalizes across various domains and supports multiple types of motions, including full-body, face, camera, and even hand-crafted motions.

Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Jacek Naruniec (DisneyResearch|Studios), Christopher Schroers (DisneyResearch|Studios), Markus Gross (DisneyResearch|Studios / ETH Zurich), Romann M. Weber (DisneyResearch|Studios)

Project Page   —   Paper

No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models

Preprint

We show that applying classifier-free guidance (CFG) does not require any specific training procedure (e.g., inserting a null condition during training), and CFG can be extended to a more general method that is applicable to any diffusion model, including unconditional ones.

Seyedmorteza Sadat (DisneyResearch|Studios / ETH Zurich), Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Otmar Hilliges (ETH Zurich), Romann M. Weber (DisneyResearch|Studios)

Paper

Controllable Inversion of Black-Box Face Recognition Models via Diffusion

IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2023

We tackle the challenging task of inverting the latent space of pre-trained face recognition models without full model access (i.e. black-box setting). Our method, the identity denoising diffusion probabilistic model (ID3PM), leverages the stochastic nature of the denoising diffusion process to produce high-quality, identity-preserving face images with various backgrounds, lighting, poses, and expressions.

Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Anton Raël (ETH Zurich), Graziana Mignone (DisneyResearch|Studios), Jacek Naruniec (DisneyResearch|Studios), Christopher Schroers (DisneyResearch|Studios), Markus Gross (DisneyResearch|Studios / ETH Zurich), Romann M. Weber (DisneyResearch|Studios)

Project Page   —   Paper   —   Supplementary Material

Self-Supervised Effective Resolution Estimation with Adversarial Augmentations

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2023

The terms high-resolution and high-quality are not equivalent, and high-resolution does not always imply high-quality. In this paper, we motivate and precisely define the concept of effective resolution and propose a novel self-supervised learning scheme to train a neural network for effective resolution estimation. We demonstrate that our method outperforms state-of-the-art image quality assessment methods in estimating the sharpness of real and generated human faces, despite using only unlabeled data during training.

Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Julian Balletshofer (DisneyResearch|Studios), Jacek Naruniec (DisneyResearch|Studios), Christopher Schroers (DisneyResearch|Studios), Graziana Mignone (DisneyResearch|Studios), Markus Gross (DisneyResearch|Studios / ETH Zurich), Romann M. Weber (DisneyResearch|Studios)

Project Page   —   Video   —   Paper   —   Supplementary Material

University Projects

Multiple Object Tracking with Spatio-Temporal Proposals

Master’s Thesis (2020 - 2021)

  • Adapted Faster R-CNN to produce spatio-temporal object proposals (STPs)
  • Adapted Message Passing Network (MPN) tracking framework to jointly perform detection and data association given the STPs
  • Evaluated different design choices and data sets to improve detection performance

Used technologies: Python, PyTorch, Ubuntu

Deep Learning for Video Analysis

Course: Master Practical (2019 - 2020)

  • Created dataset with drone videos of different interestingness levels
  • Implemented novel neural network architectures to predict drone video interestingness
  • Developed pipeline to output edited composite video given several raw videos as input

Used technologies: Python, PyTorch, Ubuntu

Reinforcement Learning for Robotics

Course: Reinforcement Learning for Robotics (2019 - 2020)

  • Created simple environments: Inverted pendulum, cart-pole
  • Implemented classic function approximators: Tabular, variable resolution (constant, linear combination of basis functions), Gaussian mixture model
  • Implemented and compared reinforcement learning and optimal control (MPC) methods

Used technologies: MATLAB, Simulink, MacOS

Edge Region Inpainting for Video Stabilization

Course: Advanced Deep Learning for Computer Vision (2019)

  • Created synthetic dataset of unstabilized videos by applying transformations to videos
  • Reimplemented and extended image inpainting architecture to inpaint video edge regions (Python, PyTorch, Ubuntu)
  • Compared results qualitatively and quantitatively to state-of-the-art methods

Used technologies: Python, PyTorch, Ubuntu

Various Projects During Bachelor

(2015 - 2018)

  • Semester 6 (Bachelor Thesis): Development and Validation of a Maneuver Strategy to Avoid Collision with Oncoming Traffic
  • Semester 5+6 (Student Research Project): Development of a Concept to Optically Localize a Measuring Device
  • Semester 5 (Semester Project): Development of a Testing Concept to Visualize Accidents with Oncoming Traffic and Validation of the Sensors Necessary for the Corresponding Safety Function
  • Semester 4 (Semester Project, Thailand): Integrating and Evaluating the Reduced Order Model of the Magnetic Circuit of the Flow Control Valve of a High Pressure Pump in a Simulation Software
  • Semester 3 (Semester Project): Enhancement of a Software to Calculate the Fuel Benefit of a Vehicle Equipped with the Coasting Function
  • Semester 2 (Semester Project): Development of a VBA Tool to Automatize the Creation of Software Component Release Notes

Used technologies: Simulink, MATLAB, C++, C#, Unity, Windows, MacOS, iOS, Embedded Linux, Vehicle Deployment

Student Supervision

If you are an ETH student and are interested in doing your thesis in our group, feel free to reach out to me via email.

Bastian Amrhein

Master’s Thesis (Fall 2024)

Andrea Bionda

Master’s Thesis (Fall 2023)

Joel Neuner-Jehle

Master’s Thesis (Fall 2023)

Mathias Vogel

Semester Thesis (Spring 2023)

Danny Camenisch

Bachelor’s Thesis (Spring 2023)

Burim Dervishaj

Master’s Thesis (Spring 2023)

Christopher Raffl

Master’s Thesis (Fall 2022)

Federico Mantovani

Bachelor’s Thesis (Fall 2022)

Nayanika Debnath

Bachelor’s Thesis (Spring 2022)

David Meyer

Bachelor’s Thesis (Spring 2022)

Anton Raël

Master’s Thesis (Spring 2022)

Teaching

Backoffice TA (Fall 2024)

Head TA (Fall 2023)

Head TA (Fall 2023)

Head TA (Fall 2022)

Head TA (Fall 2022)

Regular TA (Spring 2022)

Regular TA (Fall 2021)

Other Activities

Student Council

Student organization (2016 - 2018)

  • Managed and represented student council as Vice President
  • Led and re-organized commission that decides over 500 000€ university budget
  • Helped obtain funding and machinery for the Formula Student team
  • Kept close contact with university administration
  • Held several other positions in the engineering faculty, local senate, student parliament, and student union
  • Was awarded the KOMMUNITY award (given to one student per faculty every year for exceptional extracurricular commitment; with cash prize)