Inlay

Profile

Check out our new work: MIRO No more post-training alignment! We integrate human alignment right from the start, during pretraining! Results: ✨ 19x faster convergence ⚡ ✨ 370x less compute 💻 🔗 Explore the project: nicolas-dufour.github.io/miro/

With @nicolasdufour.bsky.social @arrijitghosh.bsky.social @vickykalogeiton.bsky.social @davidpicard.eurosky.social 🌐Site nicolas-dufour.github.io/miro 📄Paper arxiv.org/abs/2510.25897 🛠️Git github.com/nicolas-dufour/MIRO 🤗HF huggingface.co/nicolas-dufour/miro 🎨Demo huggingface.co/spaces/nicol...

🎉 Our work MIRO is accepted to #ICML2026 @icmlconf.bsky.social We integrate human preferences directly during pretraining with multi-reward conditioning. ⚡MIRO is 19x faster than baselines and 370x cheaper at inference! 🤗 Try out the models: huggingface.co/spaces/nicol... See you in Seoul 🇰🇷 !

Very proud of our recent work, kudos to the team! Read @davidpicard.bsky.social’s excellent post for more details or the paper arxiv.org/pdf/2502.21318

7mo

24d

8mo

We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control. - 19x faster convergence ⚡ - 370x less FLOPS than FLUX-dev 📉

Thrilled to share that MIRO is accepted to ICML 2026 @icmlconf.bsky.social ! 🎉 By training on the reward scores, we can simply condition the model on high rewards at inference time to guarantee top-tier, aligned outputs. We’ve updated our paper with some additional results!

7mo

👀 arxiv.org/abs/2510.25897 Thread with all details coming soon!

Lucas Degeorge

25d

Multi-reward conditioned text-to-image diffusion (ICML 2026)

huggingface.co

MIRO - a Hugging Face Space by nicolas-dufour

7mo

arxiv.org

Vicky Kalogeiton

The default paradigm of post-training text-to-image generators includes post-hoc selection of generated images, and subsequent training with one reward model to align the generator to the reward, typi...

arxiv.org

MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

nicolas-dufour.github.io

Nicolas Dufour

Train once, align many rewards. MIRO achieves 19× faster convergence and 370× less compute than FLUX while reaching GenEval score of 75. Controllable trade-offs at inference time.

MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

Nicolas Dufour

25d

👏 Folks! If you are curious about the Generative Modeling via Drifting paper, but you find it difficult to understand → I wrote a different interpretation of it. It's called: "An Expectation-Maximization interpretation of Generative Modeling via Drifting" davidpicard.github.io/pdf/An_Expec...

🚨 arxiv.org/abs/2604.06129 PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer This paper is the result of doing a lab-wide hackathon on an idea I've had for some time. Probably the paper with the highest number of authors I've ever done. It's a CVPR Findings 26. Thread 🧵👇

MIRO: Multi-Reward Conditioning for Efficient Text-to-Image Generation

Train once, align many rewards. MIRO achieves 19× faster convergence and 370× less compute than FLUX while reaching GenEval score of 75. Controllable trade-offs at inference time.

nicolas-dufour.github.io

Final note: I'm (we're) tempted to organize a challenge on that topic as a workshop at a CV conf. ImageNet is the only source of images allowed and then you compete to get the bold numbers. Do you think there would be people in for that? Do you think it would make for a nice competition?

29d

2mo

David Picard

8mo

7mo

Nicolas Dufour

arxiv.org

MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

This paper introduces the Polynomial Mixer (PoM), a novel token mixing mechanism with linear complexity that serves as a drop-in replacement for self-attention. PoM aggregates input tokens into a comp...

arxiv.org

PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer