Inlay

Are all rewards useful? Yes! Our new "leave-one-out" ablation shows that removing even a single reward drops performance. Even though these rewards are quite entangled, each one still provides unique, useful bits of information that the model needs to succeed.