Inlay

//

Profile

4/ 🚄 We introduce NO data Map-based self-play for Autonomous Driving (NOMAD), which adapts a driving policy to a new city using only the city's lane-level map + meta info. The policy is trained via KL-regularized self-play in a target-city simulator.

8/ :heart: Thanks to my amazing collaborators @saeedrmd.bsky.social , @daphne-cornelisse.bsky.social @bidiptas13.bsky.social @alexdgoldie.bsky.social @jfoerst.bsky.social @shimon8282.bsky.social. Thanks to all colleagues for the helpful discussions. If you’re into AVs / RL, we’d love your thoughts!

6/ 🔬 We also analyze: • role of behavioral priors • necessity of target-city maps • comparison to methods that do use target-city demos • generalization across cities • sensitivity to KL strength • evaluation under non-self-play agents • effect of map mirroring

7/ 🔮 NOMAD substantially narrows cross-city generalization gaps, supporting scalable deployment of autonomous driving systems across diverse environments and highlighting the promise of self-play MARL for improving safety and robustness. Paper: arxiv.org/abs/2602.15891