Will AI become a confirmation bias machine?
AI can be a powerful tool for truth-seeking. Yet, people might prefer to use AI to confirm their pre-existing beliefs, and features of AI systems (eg sycophancy) may make AI effective at justifying what people want to believe.
osf.io/preprints/ps...
Our work on how gen ai is impacting the labor conditions and power of professional visual artists got a great writeup from Brian Merchant! www.bloodinthemachine.com/p/the-ai-inf...
Excited to share that our paper has been accepted to 𝗜𝗖𝗠𝗟 𝟮𝟬𝟮𝟲! 🎉
Multi-agent is everywhere today. But put frontier LLMs in a room where each holds a different piece of the puzzle, and they fail 70% of the time.
Here's why:
📄 Paper: arxiv.org/abs/2505.11556
Carnegie Mellon University authors from 12 different @cmu.edu depts contributed to 76 #CHI2026 papers:
🛠️ New tools & systems
🧠 New frameworks & taxonomies
☑️ New ways to audit tech
🆕 New advancements in Accessibility, Health, Design & so much more... Details here: hcii.cmu.edu/news/cmu-chi...
Excited to be heading to Barcelona for #CHI2026 to host our workshop PoliSim: LLM Agent Simulation for Policy!
This year, we’ve seen incredible interest from researchers across HCI, NLP, CSS, and Policy. We accepted 25 outstanding papers, with 5 selected as Best Paper nominees.
An alarming new study reveals the dire impact AI is having on artists' livelihoods. It does offer some hope, too.
Multi-agent systems built on large language models (LLMs) are expected to enhance decision-making by pooling distributed information, yet systematically evaluating this capability has remained challen...
#oopsie
Anthropic secretly installs spyware when you install Claude Desktop
www.thatprivacyguy.com/blog/anthrop...
Willie Agnew
Yuxuan Li
Yuxuan Li
Anthropic's Claude Desktop silently installs a Native Messaging bridge into seven Chromium browsers, including browsers Anthropic's own documentation says it does not support, and browsers the user ha...
In 2019 it took 6 months for GPT-2 to go from too-dangerous to release. Claude Mythos/Fable ran that cycle in 2 months.
The AI PR singularity is amongst us. In 2029, there will be zero latency between a company saying a model is too dangerous and it being released.
CMU Human-Computer Interaction Institute
When was the last time you read a paper end-to-end for non-reviewing purposes? Seems like we write more than ever and read less than ever. (I'm guilty too)
GenAI tools are kind of like a ddos attack on our attention.
Far too much being produced to handle and justify consuming.
The overleaf git project for one of my #uist2026 submissions was assigned a uuid starting with "67". I believe this lets me mine the latest block on the gen alpha blockchain using the "Proof of Relevance" protocol. Who should I speak with about this