Research
|
Guiding Text-to-Image Diffusion Model Towards Grounded Generation
Ziyi Li* ,
Qinye Zhou* ,
Xiaoyun Zhang ,
Ya Zhang ,
Yanfeng Wang ,
Weidi Xie
International Conference on Computer Vision (ICCV), 2023
we propose to augment a pre-trained text-to-image diffusion model with the ability of open-vocabulary objects grounding, i.e., simultaneously generating images and segmentation masks for the corresponding visual entities described in the text prompt.
|
|
A Simple Plugin for Transforming Images to Arbitrary Scales
Qinye Zhou* ,
Ziyi Li* ,
Weidi Xie ,
Xiaoyun Zhang ,
Yanfeng Wang ,
Ya Zhang
British Machine Vision Conference (BMVC), 2022
we propose to develop a general plugin that can be inserted into existing super-resolution models, conveniently augmenting their ability towards Arbitrary Resolution Image Scaling, thus termed ARIS.
|
|