Nvidia AI Image Personalization Method Fits on a Floppy Disk and Takes 4 Minutes to Train - Decrypt

August 5, 2023

(Summary via Kagi)

Nvidia researchers have created a new text-to-image AI model called Perfusion that is significantly smaller and faster to train compared to existing tools. The 100KB model only takes 4 minutes to train, yet it can outperform larger models in terms of personalizing concepts. The key innovation is a “Key-Locking” technique that ties new concepts to general categories, allowing the model to flexibly portray personalized concepts while maintaining their core identity. The small size of Perfusion allows it to easily update only the parts that need to change when fine-tuning, whereas larger models have to retrain the entire model. Nvidia’s focus on efficient AI models like Perfusion could give the company an edge over competitors pouring billions into generative AI research.