//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Research reveals that AI model safety evaluations can vary widely by structure, with deployment configurations causing safety degradation of up to 37 percentage points. This highlights the urgent necessity for tailored testing and standardized safety benchmarks. https://arxiv.org/abs/2603.10044
14d
ArXiv link for Safety Under Scaffolding: How Evaluation Conditions Shape Measured Safety
Safety Under Scaffolding: How Evaluation Conditions Shape Measured Safety
arxiv.org
AI Firehose