Chatbots don't *want* anything and don't *recognize* anything.
LLMs can't take responsibility for their mistakes. When a human journalist puts their name on AI-written text, they take on that responsibility.
Increasingly I see inaccurate and badly written news stories authored by AI, many of which have actual humans listed as authors or editors.
📚 How good are language models at utilising contexts in RAG scenarios?
We release 🧙🏽♀️DRUID to facilitate studies of context usage in real-world scenarios.
arxiv.org/abs/2412.17031
w/ @saravera.bsky.social, H.Yu, @rnv.bsky.social, C.Lioma, M.Maistro, @apepa.bsky.social and @iaugenstein.bsky.social ⭐️
What if LLMs knew when to stop? 🚧
HALT finetuning teaches LLMs to only generate content they’re confident is correct.
🔍 Insight: Post-training must be adjusted to the model’s capabilities.
⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝
🧵
Excited to announce the COLM 2025 keynote speakers: Shirley Ho, Nicholas Carlini, @lukezettlemoyer.bsky.social, and Tom Griffiths!
See you in October in Montreal!
Carl T. Bergstrom
Carl T. Bergstrom
NEW: Luke Zettlemoyer (@lukezettlemoyer.bsky.social) of the University of Washington and Meta AI walks through different approaches to building multimodal foundation models.
Watch the video: youtu.be/vTI4cziw84Q
#NeuroAI2025 #AI #ML #LLMs #NeuroAI
My Keynote Talk entitled “Dungeons and DQNs: The Serious Quest for Open Ended Role Playing Game Playing Agents” is now online.
youtu.be/EiurL9eyUNc
In which I might or might not have said “I’m working to take the ‘ick’ out of ‘agentic’”
#AI hallucinations are a problem; #UWAllen Ph.D. student @akariasai.bsky.social may have the answer. She was named a @techreviewjp.bsky.social Innovator Under 35 for her work to make #LLMs more transparent and useful—without making stuff up. #IU35 #AIforGood
news.cs.washington.edu/2025/01/07/w...