Inlay

//

Post

Check out our new work: MIRO No more post-training alignment! We integrate human alignment right from the start, during pretraining! Results: ✨ 19x faster convergence ⚡ ✨ 370x less compute 💻 🔗 Explore the project: nicolas-dufour.github.io/miro/

7mo

Train once, align many rewards. MIRO achieves 19× faster convergence and 370× less compute than FLUX while reaching GenEval score of 75. Controllable trade-offs at inference time.

nicolas-dufour.github.io

MIRO: Multi-Reward Conditioning for Efficient Text-to-Image Generation

Lucas Degeorge

We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control. - 19x faster convergence ⚡ - 370x less FLOPS than FLUX-dev 📉

7mo

Nicolas Dufour