//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...









Loading...
Check out our new work: MIRO No more post-training alignment! We integrate human alignment right from the start, during pretraining! Results: ✨ 19x faster convergence ⚑ ✨ 370x less compute πŸ’» πŸ”— Explore the project: nicolas-dufour.github.io/miro/
With @nicolasdufour.bsky.social @arrijitghosh.bsky.social @vickykalogeiton.bsky.social @davidpicard.eurosky.social 🌐Site nicolas-dufour.github.io/miro πŸ“„Paper arxiv.org/abs/2510.25897 πŸ› οΈGit github.com/nicolas-dufour/MIRO πŸ€—HF huggingface.co/nicolas-dufour/miro 🎨Demo huggingface.co/spaces/nicol...
πŸŽ‰ Our work MIRO is accepted to #ICML2026 @icmlconf.bsky.social We integrate human preferences directly during pretraining with multi-reward conditioning. ⚑MIRO is 19x faster than baselines and 370x cheaper at inference! πŸ€— Try out the models: huggingface.co/spaces/nicol... See you in Seoul πŸ‡°πŸ‡· !
Very proud of our recent work, kudos to the team! Read @davidpicard.bsky.social’s excellent post for more details or the paper arxiv.org/pdf/2502.21318
7mo
24d
24d
8mo
We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control. - 19x faster convergence ⚑ - 370x less FLOPS than FLUX-dev πŸ“‰
Thrilled to share that MIRO is accepted to ICML 2026 @icmlconf.bsky.social ! πŸŽ‰ By training on the reward scores, we can simply condition the model on high rewards at inference time to guarantee top-tier, aligned outputs. We’ve updated our paper with some additional results!
7mo
πŸ‘€ arxiv.org/abs/2510.25897 Thread with all details coming soon!
Lucas Degeorge
Lucas Degeorge
Lucas Degeorge
25d
Multi-reward conditioned text-to-image diffusion (ICML 2026)
huggingface.co
MIRO - a Hugging Face Space by nicolas-dufour
7mo
arxiv.org
Vicky Kalogeiton
Thrilled to share that MIRO is accepted to ICML 2026 @icmlconf.bsky.social ! πŸŽ‰ By training on the reward scores, we can simply condition the model on high rewards at inference time to guarantee top-tier, aligned outputs. We’ve updated our paper with some additional results!
The default paradigm of post-training text-to-image generators includes post-hoc selection of generated images, and subsequent training with one reward model to align the generator to the reward, typi...
arxiv.org
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
nicolas-dufour.github.io
Nicolas Dufour
Train once, align many rewards. MIRO achieves 19Γ— faster convergence and 370Γ— less compute than FLUX while reaching GenEval score of 75. Controllable trade-offs at inference time.
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
Nicolas Dufour
25d
πŸ‘ Folks! If you are curious about the Generative Modeling via Drifting paper, but you find it difficult to understand β†’ I wrote a different interpretation of it. It's called: "An Expectation-Maximization interpretation of Generative Modeling via Drifting" davidpicard.github.io/pdf/An_Expec...
🚨 arxiv.org/abs/2604.06129 PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer This paper is the result of doing a lab-wide hackathon on an idea I've had for some time. Probably the paper with the highest number of authors I've ever done. It's a CVPR Findings 26. Thread πŸ§΅πŸ‘‡
MIRO: Multi-Reward Conditioning for Efficient Text-to-Image Generation
Train once, align many rewards. MIRO achieves 19Γ— faster convergence and 370Γ— less compute than FLUX while reaching GenEval score of 75. Controllable trade-offs at inference time.
nicolas-dufour.github.io
Final note: I'm (we're) tempted to organize a challenge on that topic as a workshop at a CV conf. ImageNet is the only source of images allowed and then you compete to get the bold numbers. Do you think there would be people in for that? Do you think it would make for a nice competition?
We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control. - 19x faster convergence ⚑ - 370x less FLOPS than FLUX-dev πŸ“‰
29d
2mo
David Picard
8mo
7mo
Nicolas Dufour
The default paradigm of post-training text-to-image generators includes post-hoc selection of generated images, and subsequent training with one reward model to align the generator to the reward, typi...
arxiv.org
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
This paper introduces the Polynomial Mixer (PoM), a novel token mixing mechanism with linear complexity that serves as a drop-in replacement for self-attention. PoM aggregates input tokens into a comp...
arxiv.org
We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control. - 19x faster convergence ⚑ - 370x less FLOPS than FLUX-dev πŸ“‰
PoM: A Linear-Time Replacement for Attention with the Polynomial Mixer