Btw here’s a good deep dive into how LLMs actually work (and how they don’t, aka no thinking or reasoning going on): www.0xkato.xyz/how-llms-act...
A from-the-ground-up walkthrough of how modern LLMs work, from tokens to transformer blocks to the next-token loop
www.0xkato.xyz
Thomas Fuchs