Inlay

Can steering remove LLM shortcuts without breaking legitimate LLM capabilities? In our @eaclmeeting.bsky.social paper, we show that conceptual bias is separable from concept detection; this means inference-time debiasing is possible with minimal capability loss.