“what you end up learning by getting really good at predicting the next word is a separate question. And it's an empirical question that is hard to answer with just a priori speculation about what is or isn't possible to learn in that way.”
Episode of Many Minds podcasts.apple.com/us/podcast/m...