We taught a quantum computer to learn from its own mistakes-- without stopping.
Our RL framework repurposes QEC detection events as learning signals to stabilize the system.
Result: Improved Logical Error Rates for both Surface and Color codes on Willow!
see arxiv: arxiv.org/abs/2511.08493