Inlay

6/ 🔬 We also analyze: • role of behavioral priors • necessity of target-city maps • comparison to methods that do use target-city demos • generalization across cities • sensitivity to KL strength • evaluation under non-self-play agents • effect of map mirroring