//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
This suggests a shift in how we train agents: Instead of external critics or verifiers, 👉 Let agents learn by tracking their own uncertainty reduction. A step toward agents that reason about what they don’t know. (7/8)