
Multimodal Model: Dataset/Training Strategy

Unified Multimodal Model: Understanding + Generation

Text-to-Image Diffusion: Structural / Semantic Control

Text-to-Image Diffusion: Structural / Semantic Control

Text-to-Image Diffusion: Personalization & Concept Control

Text-to-Image Diffusion: Personalization & Concept Control

Text-to-Image Diffusion: Structural / Semantic Control

Diffusion

Vision-Language Alignment

Computer Vision: Vision Transformer (ViT)

Image-to-Image Diffusion: Style-transfer

Semantic Image Segmentation

Text-to-Image Diffusion: Structural / Semantic Control