Inlay

Profile

QUDs going multimodal! With MQUD, we can train models to generate scientific questions❓that are inquisitive and insightful enough to be answered in the scientific paper! Huge thanks to the many paper authors who contributed to our data. Check out @yatingwu.bsky.social’s work:

New work on LLM safety in medicine 💊! Drug names have fixed morphological structure, so is that exploited by LLMs? Kaijie's paper reveals over-generalization that may pose safety risks if not handled carefully 👇

Introducing Hero’s Journey, meticulously designed to test inductive generalization in a fun text game. ⚖️Verdict: all LLMs we tested trail far behind humans when induction involves generalization across procedures!

We are thrilled to present a detailed report describing the system built for the AAAI-26 AI review pilot, the survey results, and a new benchmark that was created to assess the capabilities of the system. Read the full article: arxiv.org/pdf/2604.13940

CosmicAI personnel contributed to the AAAI-26 AI review pilot, which generated automatic AI reviews of all research papers submitted to the conference’s main track. The AI reviews complemented human reviews. @mattlease.bsky.social @jessyjli.bsky.social @sebajoe.bsky.social Joydeep Biswas

1mo

20h

I had a fantastic time at the 2026 Harrington Symposium this week at UT Austin. It was wonderful to be able to dig into more science of AI with brilliant researchers across many specialties and viewpoints! Many things to think about! harrington.utexas.edu/faculty-fell...

1mo

New profession just dropped: pharma-morphologist #linguistics #NLP #morphology

I'm on a new committee reviewing ARR's Responsible NLP Checklist and looking to potentially make changes. I'd love to hear others' thoughts on what is working well or needs revised, especially given it might be fresh in memory from the recent ARR cycle. 1/

1mo