//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
πŸ‘€ Check out @marcelhussing.bsky.social and @liv-daliberti.bsky.social's amazing new work on behavioral consistency! ❓The challenge? Retraining an RL agent might give you a completely different policy than before! This makes everything harder, as we never quite know whether we simply got unlucky πŸ€”
19d
19d
🚨 New Preprint Alert: Behavior-Consistent Deep Reinforcement Learning 🚨 TLDR: We introduce an approach that achieves behavioral similarity across independent algorithm executions in continuous state-action space deep RL.
Claas Voelcker
Marcel Hussing