🚨 Preprint on LLMs in external environments:
Zhongqi (Nick) Yue, a great post-doc in my lab, has led the development of EARL—a new reinforcement learning framework for LLMs to interact with external environments, greatly improving over text-only interaction in reasoning tasks.
Key to this is to decouple environment interaction from language generation while maintaining the reasoning capabilities of pre-trained models.
Project page: expa-rl.github.io
Pre-print: arxiv.org/abs/2510.07581
PS. Nick is on the job market!
This week has been an absolute joy for me as the leader of the Healthy AI Lab! Two of my students, Anton Matsson (3rd from right) and Lena Stempfle (2nd from right), defended their theses and became the first PhD graduates under my supervision 🎉 You will both be sorely missed!
I'm hiring a post-doc for the Healthy AI Lab 📣. Come join us to work on machine learning methods to improve decision-making, e.g., in health applications. If you can't stop thinking about research problems until you make progress, you may be the right fit!
www.chalmers.se/en/about-cha...
Last Friday, Newton Mwai Kinyanjui defended his PhD thesis "Leveraging Structural Priors and Historical Data for Practical Treatment Personalization with Multi-Armed Bandits". It's been a pleasure having you in the lab, Newton! Looking forward to seeing what the next chapter brings!
Last Friday, Newton Mwai Kinyanjui defended his PhD thesis "Leveraging Structural Priors and Historical Data for Practical Treatment Personalization with Multi-Armed Bandits". It's been a pleasure having you in the lab, Newton! Looking forward to seeing the next chapter!
Thank you Branislav Kveton, Sandeep Juneja, Slawomir Nowaczyk and Yevgeny Seldin for serving as Newton's grading committee and opponent!
Read Newton's PhD thesis here: research.chalmers.se/publication/...
We are delighted to announce the #EurIPS 2025 Workshops 🎉: eurips.cc/workshops/
We received 52 proposals, which were single-blind reviewed by more than 35 expert reviewers, leading to 18 accepted workshops (acceptance rate 34.6%).
Fredrik Johansson
Great first day of the 3rd annual CHAIR Structured Learning Workshop @ Chalmers! 🥳
Event page & agenda: ui.ungpd.com/Events/60bfc...
1st day featuring:
@betapata.bsky.social
@janstuehmer.bsky.social
@arnauddoucet.bsky.social
@frejohk.bsky.social
Fredrik Johansson
Personalizing treatments for patients often requires sequentially trying different options from a set of available therapies until the most effective one is identified for the patient’s characteristic...
Machine learning offers great promise for developing new treatment policies from observational clinical data. However, a key challenge in this offline setting is reliably assessing the performance of ...