Conversation

Fahim Farook

"Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models. (arXiv:2302.03900v1 [cs.CV])" — A neural pipeline for generating a coherent storybook from the plain text of a story by leveraging a combination of a pre-trained Large Language Model and a text-guided Latent Diffusion Model to generate coherent images.

Paper: http://arxiv.org/abs/2302.03900

#AI #CV #NewPaper #DeepLearning #MachineLearning

<<Find this useful? Please boost so that others can benefit too 🙂>>
Zero-shot generation example of…
0
1
0