Transform images based on text instructions
Try on clothes on a person image
Generate a talking face video from an image and audio