ChatGPT performed worst, repeatedly opening the empty mailbox at the beginning of the game.
If find these results especially astounding because Zork is a legendary game and the web is full of walkthroughs. Even if LLMs aren't prompted explicitly with "Play Zork", they should recognize it.