Inlay

Good: The UK Gov has an AI Security Institute (AISI) and a shiny website which publishes evaluations of frontier models: www.aisi.gov.uk/blog/our-eva... Bad: Said website features rookie accessibility issues.

We conducted cyber evaluations of Anthropic’s Claude Mythos Preview and found continued improvement in capture-the-flag (CTF) challenges and significant improvement on multi-step cyber-attack…