as a comparison, this is what the model was doing a couple months ago
bsky.app/profile/topt...
Pangaea
new model, this time it's trained to predict further ahead in the future, hoping that if i tweak this right it will learn to plan ahead better and maybe execute real combos, there's already some improvement to sols short strings but it doesn't seem to be enough yet