The AdamW videos compress to 1/6th the size of the Muon videos. Something AdamW is doing allows the crease visualisation to be compressed well, but not Muon. This is the weirdest observation ever.
Confession time: I use agentic coding all day, every day. It makes me much more productive.
But I am also terrified of skill atrophy, I feel like I need to break out pen & paper to force myself to "weight-lift" mentally so I don't forget how to think.
How do y'all handle this?
With age, the male body produces less testosterone but more advice.
The weirdest observation: I generated movies visualizing the polytope boundaries for ReLU networks using Muon and AdamW.
Same experiment, same data, same random seed. The difference is the "crease pattern" that the optimizers produce.
www.faz.net/premium/digi...
I wrote a FAZ guest article.
Schwachstellen in Computern wurden lange hingenommen. Denn sie auszunutzen war technisch komplex und teuer. KIs ändern das nun. Damit zwingen sie uns, Altlasten schneller anzugehen.
www.faz.net
The most insightful take on Mythos I've seen so far. Everyone should read this but especially those who are currently thinking through the possible regulatory responses.
rewatching @halvarflake.bsky.social 's fuzzing24 keynote and thinkin real hard
www.youtube.com/watch?v=Jd1h...
mxrt
consistently see +/-20% swings on AWS in our benchmarks
@halvarflake.bsky.social this sounds like your prediction.
That machine learning will be more useful for offense, than for defense.
Video by @hankgreen.bsky.social
(Haven’t heard the whole video yet)