Stack GAN
This paper's model architecture has many components, so I thought it would be good to layout the specifics of the architecture before implementing it.
Model Architecture

Stage-I GAN
Stage-2 GAN
Stage-I GAN
Input: Text embedding of the text description
Conditioning Augmentation (CA)
Purpose: Create vector that captures the meaning of with variations.
Process: → FC layer → → → sampled from this Gaussian distribution
Output:
Last updated
Was this helpful?