//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...
Machine Learning ELLIS PhD at Johannes Kepler University Linz and University of Oxford
Lukas Aichberger





Loading...
We unlocked the working memory of LLMs 💥 Reasoning in Memory (RiM) replaces autoregressive "thinking out loud" with fixed memory blocks that form a task-specific workspace for latent reasoning. The key idea is simple: reasoning should happen inside the LLM, not in its output!
Hot take: I think we just demonstrated the first AI agent computer worm 🤔 When an agent sees a trigger image it's instructed to execute malicious code and then share the image on social media to trigger other users' agents This is a chance to talk about agent security 👇
15d
Mar 20, 2025
Lukas Aichberger
Yarin
4/ Results RiM consistently improves over direct-answer SFT, outperforms Coconut, and becomes competitive with CoT-style reasoning across model families and scales on math reasoning tasks. Crucially, it preserves direct-answer inference speed 🚀
15d
2/ Intuition Established reasoning methods make LLMs externalize their thoughts. Chain-of-thought generates text token by token. Coconut replaces text with continuous thoughts, but still generates them step by step. RiM instead moves reasoning into the LLM's working memory 🧠
5/ Takeaway LLMs do not always need to externalize their thoughts. They can learn to reason in working memory instead, decoupling intermediate computation from autoregressive generation 💡 Full paper: arxiv.org/abs/2605.30343 Huge thanks to @hochreitersepp.bsky.social for the guidance!
3/ Method Simply adding memory blocks to the context is not enough. The hard part is making the LLM actually use them. RiM solves this with an efficient two-stage training curriculum that teaches the model to route reasoning through the memory-block representations 🧩
15d
15d
15d
Lukas Aichberger
To improve the reasoning capabilities of large language models, test-time compute is typically scaled by generating intermediate tokens before the final answer. However, this couples reasoning to auto...
arxiv.org
Unlocking the Working Memory of Large Language Models for Latent Reasoning
Lukas Aichberger
Lukas Aichberger
Lukas Aichberger
Video
⚠️ Beware: Your AI assistant could be hijacked just by encountering a malicious image online! Our latest research exposes critical security risks in AI assistants. An attacker can hijack them by simply posting an image on social media and waiting for it to be captured. [1/6] 🧵
Mar 18, 2025
Lukas Aichberger
Video