-
Vision, Language and Reading group at the Computer Vision Center (CVC), in Barcelona, Spain. The position is initially for 3 years and linked to the “European Large Open Multi-Modal Foundation Models
-
international references in the field. Since 2017, the group has driven advances in end-to-end (sensorimotor) autonomous driving, developing the CIL and CIL++ models that learn directly from human driving
-
“Leveraging VIsual Foundation Models for Enhancement and Editing (ViFEE)” CALL FPI94-CVC The CVC offers one pre-doctoral fellowship linked to the project “Leveraging VIsual Foundation Models for Enhancement and
-
on developing new methods for color-specific conditional generation in multi-modal generative models. The candidate will work on exploiting the intrinsic knowledge about color within text-to-image (T2I) models