Inlay

Profile

Probabilistic machine learning, Bayesian computation, and computer vision PhD in Computer Science 📍Helsinki, Finland https://pipme.github.io/

Chengkun Li

All approximations are wrong, but some are useful --> Stacking can make them more useful 😄

I’ll be at the AABI symposium tomorrow. Looking forward to seeing everyone there!

😀 Enter the amortized Bayesian workflow from here 👉 pipme.github.io/amortized-Ba... Huge thanks to my great collaborators: @avehtari.bsky.social, @paulbuerkner.com, @stefanradev.bsky.social, @lacerbi.bsky.social, @marvin-schmitt.com!

5mo

1/ Introducing ACE (Amortized Conditioning Engine)! Our new AISTATS 2025 paper presents a transformer framework that unifies tasks from image completion to BayesOpt & simulator-based inference under *one* probabilistic conditioning approach. It's Bayes all the way down!

If you want to start your own research group in AI & machine learning, with access to top resources for research incl. @lumi-supercomputer.eu, generous starting package & professorship affiliation with a university in the world’s happiest country, apply by March 9: www.ellisinstitute.fi/PI-recruit

Apr 28, 2025

1mo

1/ If you are at ICLR / AABI / AISTATS, check out work from our lab and collaborators on *inference everywhere anytime all at once*! Go talk to my incredible PhD students @huangdaolang.bsky.social & @chengkunli.bsky.social + amazing collaborator Severi Rissanen. @univhelsinkics.bsky.social FCAI

Do you like to train neural networks to solve all your nasty probabilistic inference and sequential design problems? Do you love letter salads such as NPs, PFNs, NPE, SBI, BED? Then no place is better than the Amortized ProbML workshop we are organizing at #ELLIS UnConference.

1/ Just saw this paper using our PyVBMC (acerbilab.github.io/pyvbmc/) in structural engineering. Nice to see sample-efficient Bayesian inference for expensive computational models used in the wild! (Although we feel a bit more pressure to triple-check that our implementation has no bugs...)

Mar 6, 2025

Feb 14, 2025

Apr 27, 2025

8mo

Apr 7, 2025

Multi-Head Latent Attention vs Group Query Attention: We break down why MLA is a more expressive memory compression technique AND why naive implementations can backfire. Check it out!

Chengkun Li

Adaptive Bayesian workflow combining amortized inference with PSIS and MCMC diagnostics.

pipme.github.io

Amortized Bayesian Workflow

Mar 12, 2025

ELLIS Institute Finland

Luigi Acerbi

Paul Chang

1/ Wouldn't it be nice if you could perform Bayesian inference *efficiently* but also *reliably*? Amortized inference offers the former, while MCMC is often presented as the "gold standard" for accuracy and reliability. Enter the Amortized Bayesian Workflow...

1mo

Luigi Acerbi

me: now I've read the basic lit on approximate bayesian computation via bayesflow and I can start trying to distil it on time for the last lecture, to awe the students. @chengkunli.bsky.social : hold my mcmc (really really cool!)

1mo

Riccardo Fusaroli

Now recruiting new PIs in artificial intelligence and machine learning

www.ellisinstitute.fi

Principal Investigator positions at ELLIS Institute Finland | ELLIS Institute Finland

1/ Excited to share our new work published in Transactions on Machine Learning Research (TMLR), Stacking Variational Bayesian Monte Carlo (S-VBMC)!

1/10🔥 New paper alert in #AABI2025 Proceedings! Normalizing Flow Regression (NFR) — an offline Bayesian inference method. What if you could get a full posterior using *only* the evaluations you *already* have, maybe from optimization runs?

⚡️Multi-Head Latent Attention is one of the key innovations that enabled @deepseek_ai's V3 and the subsequent R1 model. ⏭️ Join us as we continue our series into efficient AI inference, covering both theoretical insights and practical implementation: 🔗 datacrunch.io/blog/deepsee...

5mo

Apr 22, 2025

1mo

Mar 12, 2025

Luigi Acerbi

Adaptive Bayesian workflow combining amortized inference with PSIS and MCMC diagnostics.

pipme.github.io

Amortized Bayesian Workflow

Chengkun Li

1mo

Multi-Head Latent Attention (MLA) improves upon Group Query Attention (GQA), enabling long-context reasoning models and wider adoption across open-source LLMs.

datacrunch.io

DeepSeek + SGLang: Multi-Head Latent Attention

Luigi Acerbi