The Farooks

Conversation

Fahim Farook

"Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis. (arXiv:2212.05032v2 [cs.CV] UPDATED)" — Improving the compositional skills of text-to-image models; specifically, obtainining more accurate attribute binding and better image compositions by incorporating linguistic structures with the diffusion guidance process based on the controllable properties of manipulating cross-attention layers in diffusion-based models.

Paper: http://arxiv.org/abs/2212.05032
Code: https://github.com/weixi-feng/structured-diffusion-guidance

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Three challenging phenomena in …

Fahim Farook

Terms of Service