1/ Excited to report we have a new paper out
@nature.com today! The bottom line: training data for LLMs does not just fall from the sky - it is created in the context of existing social political institutions - and that has consequences for LLM output.
nature.com/articles/s41...
Government-controlled media influences the output of large language models via their training data, and models queried in the languages of countries with lower media freedom show a stronger ...