//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Good: The UK Gov has an AI Security Institute (AISI) and a shiny website which publishes evaluations of frontier models: www.aisi.gov.uk/blog/our-eva... Bad: Said website features rookie accessibility issues.
We conducted cyber evaluations of Anthropic’s Claude Mythos Preview and found continued improvement in capture-the-flag (CTF) challenges and significant improvement on multi-step cyber-attack…
www.aisi.gov.uk
Our evaluation of Claude Mythos Preview’s cyber capabilities | AISI Work
1mo
Dan