//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...
PhD student at University of Montreal // Mila Β·Β·Β· mechanistic understanding of LLMs + Human-AI collaboration for science Β·Β·Β· http://mirandrom.github.io
Andrei Mircea





Loading...
Thanks to my collaborators and mentors @katelobacheva.bsky.social, Irina Rish, Supriyo Chakraborty, and Nima Chitsazan. Also Ashwinee Panda for coining "zero-sum learning", which is honestly a pretty great name.
TL;DR We find two new phenomena (loss deceleration + zero-sum learning) and show quantifiably how scaling improves LLMs by mitigating these. What’s cool is that these could potentially be mitigated independent of scaling (Step 2). Exactly how to do this remains an open question.
11mo
Mechanistic understanding of systematic failures in language models is something more research should strive for IMO. This is really interesting work in that vein by @ziling-cheng.bsky.social, highly recommend you check it out.
Do LLMs hallucinate randomly? Not quite. Our #ACL2025 (Main) paper shows that hallucinations under irrelevant contexts follow a systematic failure mode β€” revealing how LLMs generalize using abstract classes + context cues, albeit unreliably. πŸ“Ž Paper: arxiv.org/abs/2505.22630 1/n
All of our code and artefacts are also open, which hopefully will help. Code: github.com/mirandrom/zsl Checkpoints: huggingface.co/mirandrom/zs... Wandb logs: wandb.ai/amr-amr/zsl/...
Step 1: Understand how scaling improves LLMs. Step 2: Directly target underlying mechanism. Step 3: Improve LLMs independent of scale. Profit. In our ACL 2025 paper we look at Step 1 in terms of training dynamics. Project: mirandrom.github.io/zsl Paper: arxiv.org/pdf/2506.05447
11mo
Jun 10, 2025
Jun 6, 2025
11mo
11mo
Andrei Mircea
Andrei Mircea
Andrei Mircea
Andrei Mircea
Andrei Mircea
Ziling Cheng