Inlay

ProfilePosts

Chatbots don't *want* anything and don't *recognize* anything.

LLMs can't take responsibility for their mistakes. When a human journalist puts their name on AI-written text, they take on that responsibility. Increasingly I see inaccurate and badly written news stories authored by AI, many of which have actual humans listed as authors or editors.

Feb 23, 2025

Mar 6, 2025

Jan 2, 2025

📚 How good are language models at utilising contexts in RAG scenarios? We release 🧙🏽‍♀️DRUID to facilitate studies of context usage in real-world scenarios. arxiv.org/abs/2412.17031 w/ @saravera.bsky.social, H.Yu, @rnv.bsky.social, C.Lioma, M.Maistro, @apepa.bsky.social and @iaugenstein.bsky.social ⭐️

What if LLMs knew when to stop? 🚧 HALT finetuning teaches LLMs to only generate content they’re confident is correct. 🔍 Insight: Post-training must be adjusted to the model’s capabilities. ⚖️ Tunable trade-off: Higher correctness 🔒 vs. More completeness 📝 🧵

Jun 6, 2025

Excited to announce the COLM 2025 keynote speakers: Shirley Ho, Nicholas Carlini, @lukezettlemoyer.bsky.social, and Tom Griffiths! See you in October in Montreal!

Carl T. Bergstrom

Jan 9, 2025

NEW: Luke Zettlemoyer (@lukezettlemoyer.bsky.social) of the University of Washington and Meta AI walks through different approaches to building multimodal foundation models. Watch the video: youtu.be/vTI4cziw84Q #NeuroAI2025 #AI #ML #LLMs #NeuroAI

My Keynote Talk entitled “Dungeons and DQNs: The Serious Quest for Open Ended Role Playing Game Playing Agents” is now online. youtu.be/EiurL9eyUNc In which I might or might not have said “I’m working to take the ‘ick’ out of ‘agentic’”

Mar 10, 2025

kicking off 2025 with our OLMo 2 tech report while payin homage to the sequelest of sequels 🫡 🚗 2 OLMo 2 Furious 🔥 is everythin we learned since OLMo 1, with deep dives into: 🚖 stable pretrain recipe 🚔 lr anneal 🤝 data curricula 🤝 soups 🚘 tulu post-train recipe 🚜 compute infra setup 👇🧵

Jun 10, 2025

Jan 3, 2025

Lovisa Hagström

Tim Franzmeyer

Conference on Language Modeling

Jan 8, 2025

#AI hallucinations are a problem; #UWAllen Ph.D. student @akariasai.bsky.social may have the answer. She was named a @techreviewjp.bsky.social Innovator Under 35 for her work to make #LLMs more transparent and useful—without making stuff up. #IU35 #AIforGood news.cs.washington.edu/2025/01/07/w...