Transcribe audio from microphone, files, or YouTube
Generate images from text prompts and reference images