Vision Datasets logo Vision Datasets

Image to text is like image captioning

Image to text or text to image

LAION-5B: A NEW ERA OF OPEN LARGE-SCALE MULTI-MODAL DATASETS

Imagen