Fooling OpenAI’s CLIP Vision system

OpenAI have developed a computer vision system called CLIP to recognize objects, images, and text, but this leaves it vulnerable to be easily confused by  text labels. 

“CLIP’s multimodal neurons generalize across the literal and the iconic, which may be a double-edged sword. Through a series of carefully-constructed experiments, we demonstrate that we can exploit this reductive behavior to fool the model into making absurd classifications. We have observed that the excitations of the neurons in CLIP are often controllable by its response to images of text, providing a simple vector of attacking the model.”

AI Memes by imgFlip

This AI meme generator autocompletes popular meme templates using Machine Learning. The Neural Network model was trained on memes created by Imgflip users. The generator has an easy-to-use interface (pictured top) where you can choose a template and add optional keywords to influence the generated text. Here is a result I got using the ‘Distracted Boyfriend’ meme, along with some of the most popular results from the AI Memes stream.