š¤ How to extract knowledge from LLMs to train better RL agents?
š Our new paper (w. Q. Zheng, @mikaelhenaff.bsky.social, A. Zhang, A. Grover) studies LLM-driven feedback for NetHack!
Paper: arxiv.org/abs/2410.23022
Code: github.com/facebookrese...