Xrd model now uses RL, this isn't a finished training run, and no online RL has been applied yet as i'm still working on the mod setup for that, but i'm very happy with the progress! here's the model playing against itself, i think it's a massive step up from the last footage i posted
Video
Pangaea
i really thought i was done pretraining models but i guess results will have to wait a little bit longer