XF-Blog
ProjectMachine LearningdevelopmentAbout
MACHINE LEARNING PAPER NOTE
[Paper Note] Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
[Paper Note] Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Existing Problem

While MaskGIT-like methods offer fast inference, their overall performance isn’t great.

Innovation

Architecture

Feature Compression Layers

Micro-Conditioning Details

Masking Strategy

Training

Four Training Stages

Results

Related Works