//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
šŸ¤” How to extract knowledge from LLMs to train better RL agents? šŸ“š Our new paper (w. Q. Zheng, @mikaelhenaff.bsky.social, A. Zhang, A. Grover) studies LLM-driven feedback for NetHack! Paper: arxiv.org/abs/2410.23022 Code: github.com/facebookrese...
Dec 19, 2024
Brandon Amos