I am a research scientist at Luma AI working on post-training of large multi-modal models. My research interests include various areas of deep learning and especially generative modeling. I always find it most fascinating and satisfying if I’m working on projects that interact with the environment in some way, i.e., where I can see/hear/feel its results. In recent years, I focused mostly on image and video generation and editing tasks.
Previously, I was an associate research scientist in the Facial VFX group at DisneyResearch|Studios led by Dr. Derek Bradley. From 2021 to 2024, I completed my PhD at the Computer Graphics Laboratory at ETH Zurich in cooperation with DisneyResearch|Studios, where I was supervised by Prof. Dr. Markus Gross and Dr. Romann M. Weber.
I earned my master’s in Robotics, Cognition, Intelligence at the Technical University of Munich, where I developed a strong interest in deep learning in Prof. Dr. Laura Leal-Taixé’s Dynamic Vision and Learning Group. Before that, I completed my bachelor’s in IT-Automotive at DHBW Stuttgart in collaboration with Robert Bosch GmbH.

I started a new position as Research Scientist at Luma AI.
I started a new position as Associate Research Scientist at DisneyResearch|Studios.
I successfully defended my PhD.

ACM SIGGRAPH (Conference Track), 2025
We propose motion-textual inversion, a general method to transfer the semantic motion of a given reference motion video to given target images. We thereby optimize a motion representation composed of a set of text/image embedding tokens using a frozen, pre-trained image-to-video diffusion model. Our method generalizes across various domains and supports multiple types of motions, including full-body, face, camera, and even hand-crafted motions.
Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Jacek Naruniec (DisneyResearch|Studios), Christopher Schroers (DisneyResearch|Studios), Markus Gross (DisneyResearch|Studios / ETH Zurich), Romann M. Weber (DisneyResearch|Studios)
Project Page — Paper — Supplementary Material

International Conference on Learning Representations (ICLR), 2025
We show that applying classifier-free guidance (CFG) does not require any specific training procedure (e.g., inserting a null condition during training), and CFG can be extended to a more general method that is applicable to any diffusion model, including unconditional ones.
Seyedmorteza Sadat (DisneyResearch|Studios / ETH Zurich), Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Otmar Hilliges (ETH Zurich), Romann M. Weber (DisneyResearch|Studios)

IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2023
We tackle the challenging task of inverting the latent space of pre-trained face recognition models without full model access (i.e. black-box setting). Our method, the identity denoising diffusion probabilistic model (ID3PM), leverages the stochastic nature of the denoising diffusion process to produce high-quality, identity-preserving face images with various backgrounds, lighting, poses, and expressions.
Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Anton Raël (ETH Zurich), Graziana Mignone (DisneyResearch|Studios), Jacek Naruniec (DisneyResearch|Studios), Christopher Schroers (DisneyResearch|Studios), Markus Gross (DisneyResearch|Studios / ETH Zurich), Romann M. Weber (DisneyResearch|Studios)
Project Page — Paper — Supplementary Material

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2023
The terms high-resolution and high-quality are not equivalent, and high-resolution does not always imply high-quality. In this paper, we motivate and precisely define the concept of effective resolution and propose a novel self-supervised learning scheme to train a neural network for effective resolution estimation. We demonstrate that our method outperforms state-of-the-art image quality assessment methods in estimating the sharpness of real and generated human faces, despite using only unlabeled data during training.
Manuel Kansy (DisneyResearch|Studios / ETH Zurich), Julian Balletshofer (DisneyResearch|Studios), Jacek Naruniec (DisneyResearch|Studios), Christopher Schroers (DisneyResearch|Studios), Graziana Mignone (DisneyResearch|Studios), Markus Gross (DisneyResearch|Studios / ETH Zurich), Romann M. Weber (DisneyResearch|Studios)
Project Page — Video — Paper — Supplementary Material
Seyedmorteza Sadat, Manuel Kansy, Romann Weber
Seyedmorteza Sadat, Manuel Kansy, Romann Weber
Manuel Kansy, Jacek Naruniec, Christopher Schroers, Romann Weber
Jacek Naruniec, Andrea Bionda, Manuel Kansy, Christopher Schroers, Romann Weber
Manuel Kansy, Joel Neuner-Jehle, Jacek Naruniec, Romann Weber
Jacek Naruniec, Manuel Kansy, Graziana Mignone, Christopher Schroers, Romann Weber
Manuel Kansy, Anton Raël, Jacek Naruniec, Christopher Schroers, Romann Weber
Intern (06/2025 - 08/2025)
Intern (05/2025 - 11/2025)
Master’s Thesis (03/2025 - 09/2025)
PhD Student Co-Advisor (02/2025 - 12/2025)
PhD Student Co-Advisor (02/2025 - 12/2025)
Master’s Thesis (09/2024 - 03/2025)
Master’s Thesis + Intern (10/2023 - 08/2024)
Master’s Thesis (09/2023 - 03/2024)
Semester Thesis (02/2023 - 06/2023)
Bachelor’s Thesis (02/2023 - 08/2023)
Master’s Thesis (02/2023 - 08/2023)
Master’s Thesis (10/2022 - 04/2023)
Bachelor’s Thesis (09/2022 - 03/2023)
Intern (08/2022 - 11/2022)
Bachelor’s Thesis (02/2022 - 08/2022)
Bachelor’s Thesis (01/2022 - 07/2022)
Master’s Thesis + Intern (01/2022 - 12/2022)
Backoffice TA (Fall 2024)
Head TA (Fall 2023)
Head TA (Fall 2023)
Head TA (Fall 2022)
Head TA (Fall 2022)
Regular TA (Spring 2022)
Regular TA (Fall 2021)

Master’s Thesis (2020 - 2021)
Used technologies: Python, PyTorch, Ubuntu

Course: Master Practical (2019 - 2020)
Used technologies: Python, PyTorch, Ubuntu

Course: Reinforcement Learning for Robotics (2019 - 2020)
Used technologies: MATLAB, Simulink, MacOS

Course: Advanced Deep Learning for Computer Vision (2019)
Used technologies: Python, PyTorch, Ubuntu

(2015 - 2018)
Used technologies: Simulink, MATLAB, C++, C#, Unity, Windows, MacOS, iOS, Embedded Linux, Vehicle Deployment
Student organization (2016 - 2018)