//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Check out our new work: MIRO No more post-training alignment! We integrate human alignment right from the start, during pretraining! Results: ✨ 19x faster convergence ⚡ ✨ 370x less compute 💻 🔗 Explore the project: nicolas-dufour.github.io/miro/
7mo
Train once, align many rewards. MIRO achieves 19× faster convergence and 370× less compute than FLUX while reaching GenEval score of 75. Controllable trade-offs at inference time.
nicolas-dufour.github.io
MIRO: Multi-Reward Conditioning for Efficient Text-to-Image Generation
Lucas Degeorge
We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control. - 19x faster convergence ⚡ - 370x less FLOPS than FLUX-dev 📉
7mo
Nicolas Dufour