I feel like this is up there with “My test suit silently prompt injects in an attempt to screw users who develop with LLMs” in terms of not being great.
Scoiattolo
the part where they say they're going to silently sabotage us based on completely opaque criteria