My article "What does it mean to understand AI?" is out now in Harvard Data Science Review! I discuss mechanistic interpretability, representation engineering, world models, and Potemkin understanding from philosophical perspectives.
hdsr.mitpress.mit.edu/pub/w1tfg5lx...