//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
You can literally watch repression & displacement consolidate over fine-tuning: here's next-token probs across checkpoints of OLMo-3-7B-Think-SFT. Explicit words are repressed almost instantly but safer (displaced) alternatives emerge much later. It learns what not to say before what to say instead.
2mo
Ryan Heuser
Submitting this abstract to "Accelerationism Revisited", a symposium in Dublin. Mapping psychoanalytic topology in LLM base models → instruction-tuned → safety-tuned models. They progressively "displace" (in Freudian sense) censored content into adjacent semantics, even across hidden model layers.
3mo
Ryan Heuser