Hierarchical Text-Conditional Image Generation with Clip Latents [pdf] | Heykuki News