Both leaders and followers integrate partner position into their decision-making, exerting asymmetric yet reciprocal influence. Forward multi-agent RL (MARL) shows simulated agents can learn the task and also develop stable leader–follower dynamics, suggesting social roles can self-organize (2/5)