"Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models. (arXiv:2302.03900v1 [cs.CV])" — A neural pipeline for generating a coherent storybook from the plain text of a story by leveraging a combination of a pre-trained Large Language Model and a text-guided Latent Diffusion Model to generate coherent images.