
Multimodal Model: Dataset/Training Strategy

Unified Multimodal Model: Understanding + Generation

Text-to-Image Diffusion: Structural / Semantic Control

Text-to-Image Diffusion: Structural / Semantic Control

Text-to-Image Diffusion: Personalization & Concept Control

Text-to-Image Diffusion: Personalization & Concept Control

Text-to-Image Diffusion: Structural / Semantic Control

Diffusion

Vision-Language Alignment

Computer Vision: Vision Transformer (ViT)

Image-to-Image Diffusion: Style-transfer

Semantic Image Segmentation

Text-to-Image Diffusion: Structural / Semantic Control

Survey: Domain Generalization

Instruction-based Image Editing

Domain Generalization Semantic Segmentation: CLOUDS