New preprint: "Decoding Alignment: A Critical Survey of LLM Development Initiatives through Value-setting and Data-centric Lens" 🔍 Beyond a survey, this is a presentation of a series of concerns on what we could frame as "corporate" alignment #llms #ai arxiv.org/abs/2508.16982
AI Alignment, primarily in the form of Reinforcement Learning from Human Feedback (RLHF), has been a cornerstone of the post-training phase in developing Large Language Models (LLMs). It has also been...