Inlay

Research reveals that AI model safety evaluations can vary widely by structure, with deployment configurations causing safety degradation of up to 37 percentage points. This highlights the urgent necessity for tailored testing and standardized safety benchmarks. https://arxiv.org/abs/2603.10044

ArXiv link for Safety Under Scaffolding: How Evaluation Conditions Shape Measured Safety