Reverse engineering neural networks at Anthropic. Previously Distill, OpenAI, Google Brain.Personal account.
Chris Olah
Loading...
Political violence is bad. It usually begets more political violence.
Celebrating political violence is bad. It usually encourages more political violence, against various targets.
Campus shootings are bad. They make everyone on campus less safe.
It's bad that what I wrote here is controversial.
Applications for Anthropic AI Safety Fellows are due Aug 17!
US: job-boards.greenhouse.io/anthropic/jo...
UK: job-boards.greenhouse.io/anthropic/jo...
CA: job-boards.greenhouse.io/anthropic/jo...
It's a great opportunity to get mentorship and funding to work on safety for ~2 months.
The interpretability team will be mentoring more fellows this cycle, so if you're interested in interpretability, it might be worth applying!
Some of our fellows last cycle did this: arxiv.org/pdf/2507.21509