//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...









Loading...
May 2, 2025
Late to the party (since I just took some time to spend with our two little ones) but luckily good science is timeless ;)
wth did we not go to an open-source and non-for profit alternative? en.wikipedia.org/wiki/Bluesky
PQN puts Q-learning back on the map and now comes with a blog post + Colab demo! Also, congrats to the team for the spotlight at #ICLR2025
Jakob Foerster
Nov 23, 2024
Mar 20, 2025
Introducing πŸ₯šEGGROLL πŸ₯š(Evolution Guided General Optimization via Low-rank Learning)! πŸš€ Scaling backprop-free Evolution Strategies (ES) for billion-parameter models at large population sizes ⚑100x Training Throughput 🎯Fast Convergence πŸ”’Pure Int8 Pretraining of RNN LLMs
Apply here and list me as the _first_ supervisor: ox.ac.uk/admissions/g... More information at foersterlab.com. Thanks a lot and happy applying!
🚨 PSA 🚨 Deadline to apply for your dream Phd in ML @FLAIR_Ox is coming up on the 2nd of December AOE. We work on compute-only scaling of LLMs, (meta/multi-agent) RL at the Hyperscale, Human-AI coordination, opponent-shaping for vaccine design, GenAI for finance & much more..
My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...
6mo
Nov 29, 2024
Nov 29, 2024
Mar 12, 2025
Second #runconference @neuripsconf.bsky.social #NeurIPS2024 ! @jfoerst.bsky.social @ferranalet.bsky.social @adamjelley.bsky.social @enjeeneer.io Same deal for tomorrow: 7am at goo.gl/maps/8Z8eMrd... Join us!
Jakob Foerster
Jakob Foerster
Dec 11, 2024
That's the first time that I see a video by chess.com cited in an accepted ICLR paper, in particular on handshakes vs. fist bumps during a chess competition ... By Oxford, @jfoerst.bsky.social Paper: openreview.net/forum?id=wFg... Video: www.youtube.com/watch?v=6fS7... @danielrensch.chess.com
my.corehr.com
Job Details
Jakob Foerster
Jakob Foerster
Jakob Foerster
Feb 9, 2025
@jfoerst.bsky.social take on how the community sees the ARC Challenge and how we evaluate models and use benchmarks nowadays is πŸ‘Œ. #more_science_less_hype (please). PS: Amazing discussion and good brain food, as usual with MLST.
Feb 18, 2025
Pablo Samuel Castro
Christian Wolf
About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support
ox.ac.uk
About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support
DPhil in Engineering Science | University of Oxford
DPhil in Engineering Science | University of Oxford
ox.ac.uk
Amine El Ouassouli
PQN blog 3/3 πŸ‘‰take a look at Matteo's 5-minute blog covering PQN’s key features, plus a Colab demo with JAX & PyTorch implementations mttga.github.io/posts/pqn/ πŸ”Ž For a deeper dive into the theory: blog.foersterlab.com/fixing-td-pa... blog.foersterlab.com/fixing-td-pa... See you in Singapore! πŸ‡ΈπŸ‡¬
PQN Blog 1/3: TD methods are the bread and butter of RL, yet can have convergence issues when used in practice. This has always annoyed me. Find out below why TD is so unstable and how can we understand this instability better using the TD Jacobian. @flair-ox.bsky.social @jfoerst.bsky.social
Mar 20, 2025
Mar 19, 2025
YouTube video by Machine Learning Street Talk
www.youtube.com
ImageNet Moment for Reinforcement Learning?
Mattie Fellows
Mattie Fellows
First #runconference @neuripsconf.bsky.social #NeurIPS2024 was great! Will share tomorrow's deets later today, join us! @zacharylipton.bsky.social @adamjelley.bsky.social @random-steve.bsky.social
Dec 10, 2024
Pablo Samuel Castro
Fixing TD Pt I: Why is Temporal Difference Learning so Unstable?
blog.foersterlab.com
mttga.github.io
A modern implementation of Deep Q-Network without target networks and replay buffers.
Simplifying Deep Temporal Difference Learning
If you're already in Vancouver and like to run, join us tomorrow (Tuesday) at 7:30am in front of the Fairmont Waterfront for the first installment of #runconference @neuripsconf.bsky.social #NeurIPS2024 ! πŸƒπŸΎπŸ€– cc @zacharylipton.bsky.social maps.app.goo.gl/rjrYsuYdPEv4...
Dec 9, 2024
Pablo Samuel Castro
Find local businesses, view maps and get driving directions in Google Maps.
maps.app.goo.gl
Google Maps