LLMs can retrieve knowledge — but can they connect it in *creative* ways to solve problems?
Introducing CresOWLve 🦉, a new benchmark that evaluates creative problem-solving over real-world knowledge, using puzzles that require multiple creative thinking strategies.👇