In a system subject to unobserved control, can you infer both the underlying dynamics and the control objective? 🤔
A year ago, I was presenting our work at IEEE CDC on solving this problem for stochastic LQR.
arxiv.org/abs/2502.15014
Short 🧵 on the results, and how I think about them a year later.