6/ 🔬 We also analyze:
• role of behavioral priors
• necessity of target-city maps
• comparison to methods that do use target-city demos
• generalization across cities
• sensitivity to KL strength
• evaluation under non-self-play agents
• effect of map mirroring