(7/n) A standard DNN update rule uses only the current trial.
So we add an RNN learning rule (non-Markovian):
updates can depend on multi-trial history, capturing richer learning dynamics.
(We also note you could swap in Transformers/Mamba; RNN is the minimal upgrade.)