//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
LLM performance? šŸ“‰ Non-thinking models under 30% (with CoT), most thinking models under 60%. šŸ“‰ Models perform up to 17% worse on creative vs. factual questions. Crucially, models *can* retrieve the relevant facts — they just fail to form the creative connection between them.
2mo
Mete