Do instructions affect how LMs process and produce language?
☝️Not the way you think!
😲LMs barely change task information when processing a task sample. Instead, instructions shape how this information is accessed and expressed when producing output tokens.
#interpretability #nlproc (1/🧵)