//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
TLDR: Not reliably enough www.aisi.gov.uk/blog/reality...
17h
www.aisi.gov.uk
A new benchmark grounded in how real users actually probe AI identity during interactions – covering five languages, across text and speech.
RealityTest: Do AI systems disclose their identity when asked? | AISI Work
David Barnard-Wills