Inlay

3/ 💡 However, the lane-level map and traffic meta-information are prevalent and inexpensive. Our solution: Adapting driving policies to new cities by map-based self-play multi-agent reinforcement learning. ❌ Target-city human trajectories. ❌ Expensive data collection loop.