Inlay

//

by @danabra.mov

by @danabra.mov

by @jimpick.com

+ new component

Post

Takeaway: reasoning LLMs are getting better and better on math and code—deterministic reasoning tasks. But we should also evaluate them on open-ended, inherently uncertain everyday reasoning! (9/10)