The AI is “trained” on millions of existing images, descriptions, and captions available across the web. This is often how it “learns” what factors seem like and what they are identified as. Crowd of individuals rejoice national day of Sweden by using a flag. Swedish people celebrating a soccer group. https://pinterest.com/mirellanydickrdq4