//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
π—‘π—˜π—ͺ π—£π—”π—£π—˜π—₯:Language models recognize dropout and Gaussian noise applied to their activations. The team introduced an a-semantic perturbation into a language model. See what happened ➑️ lawzero.org/en/publicati... #AISafety #MLSky #LLM #LawZero @yoshuabengio.bsky.social
28d
We provide evidence that language models can detect, localize and, to a certain degree, verbalize the difference between perturbations applied to their activations. More precisely, we either (a) mask ...
lawzero.org
LawZero | Language Models Recognize Dropout and Gaussian Noise Applied to Their Activations
LawZero - LoiZΓ©ro