//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Introducing Hero’s Journey, meticulously designed to test inductive generalization in a fun text game. ⚖️Verdict: all LLMs we tested trail far behind humans when induction involves generalization across procedures!
1d
Jessy Li
Spotting the rule from past experience is one thing; acting on it correctly is another. To find out, we introduce HERO's JOURNEY🦸‍♀️ to test for the LLMs’ inductive reasoning ability in multi-step setups. We found models show signs of rule induction, but scratch the surface.😮
1d
Asher Zheng