Persona:
Pérez Molina, Clara María

Cargando...
Foto de perfil
Dirección de correo electrónico
ORCID
0000-0001-8260-4155
Fecha de nacimiento
Proyectos de investigación
Unidades organizativas
Puesto de trabajo
Apellidos
Pérez Molina
Nombre de pila
Clara María
Nombre

Resultados de la búsqueda

Mostrando 1 - 1 de 1
  • Publicación
    RGB-D-Fusion: Image Conditioned Depth Diffusion of Humanoid Subjects
    (IEEE Xplore, 2023-09-04) Kirch, Sascha; Olyunina, Valeria; Ondřej, Jan; Pagés, Rafael; Martín Gutiérrez, Sergio; Pérez Molina, Clara María; https://orcid.org/0000-0002-5578-7555; https://orcid.org/0009-0000-9766-5057; https://orcid.org/0000-0002-5409-1521; https://orcid.org/0000-0002-5691-9580
    We present RGB-D-Fusion, a multi-modal conditional denoising diffusion probabilistic model to generate high resolution depth maps from low-resolution monocular RGB images of humanoid subjects. Accurately representing the human body in 3D is a very active research field given its wide variety of applications. Most 3D reconstruction algorithms rely on depth maps, either coming from low-resolution consumer-level depth sensors, or from monocular depth estimation from standard images. While many modern frameworks use VAEs or GANs for monocular depth estimation, we leverage recent advances in the field of diffusion denoising probabilistic models. We implement a multi-stage conditional diffusion model that first generates a low-resolution depth map conditioned on an image and then upsamples the depth map conditioned on a low-resolution RGB-D image. We further introduce a novel augmentation technique, depth noise augmentation, to increase the robustness of our super-resolution model. Lastly, we show how our method performs on a wide variety of humans with different body types, clothing and poses.