6/
Paper, dataset, and models here: arxiv.org/abs/2606.19468
huggingface.co/collections/...
github.com/johnsont4/na...
The narrative composition of web-scale LLM pretraining corpora remains largely unexplored even though narrative is a fundamental mode of human communication. We present the first fine-grained study of...