Images uploaded to the internet are being scraped at scale for ingestion into AI datasets. But the captions associated with the images were written fo

Image Synthesis Has an SEO Problem

submited by
Style Pass
2023-01-25 10:00:08

Images uploaded to the internet are being scraped at scale for ingestion into AI datasets. But the captions associated with the images were written for SEO purposes, and not for the benefit of machine learning systems. Badly-captioned images can, therefore, negatively influence the accuracy and usability of multimodal systems that are trained on them. Here we take a look at the problems, and some of the possible solutions.

Leave a Comment