1/ 🚗 🌏 What if an autonomous vehicle could move to a new city without collecting a single human demonstration in that city?
I am so excited to introduce our new work: Learning to Drive in New Cities Without Human Demonstrations.
2/ 🤔 Autonomous vehicles now outperform humans within specific operating regions, but their deployment to new cities remains
costly and slow.
Key bottleneck: Collecting human demonstrations from new cities.
8/ :heart: Thanks to my amazing collaborators @saeedrmd.bsky.social , @daphne-cornelisse.bsky.social @bidiptas13.bsky.social @alexdgoldie.bsky.social @jfoerst.bsky.social @shimon8282.bsky.social. Thanks to all colleagues for the helpful discussions.
If you’re into AVs / RL, we’d love your thoughts!
6/ 🔬 We also analyze:
• role of behavioral priors
• necessity of target-city maps
• comparison to methods that do use target-city demos
• generalization across cities
• sensitivity to KL strength
• evaluation under non-self-play agents
• effect of map mirroring