Conversation

Fahim Farook

"Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation. (arXiv:2206.07771v2 [cs.CV] UPDATED)" — Synthesis of multiple types of content such as dance-to-music or text-to-image using a new diffusion mechanism, at fewer steps.

Paper: http://arxiv.org/abs/2206.07771
Code: https://github.com/l-yezhu/cdcd

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Examples of the input (left col…
0
1
0