3/ š” However, the lane-level map and traffic meta-information are prevalent and inexpensive.
Our solution: Adapting driving policies to new cities by map-based self-play multi-agent reinforcement learning.
ā Target-city human trajectories. ā Expensive data collection loop.