dramatization of Mythos finally meeting Dario Amodei
i am either cooking or completely out of my mind
so what is the overlap between "my usecase is not ML, math, physics, biology, or cybersecurity" and "I would pay exorbitant per-token rates for a better model"
on an enterprise level, who wants to use a model with a silent active-sabotage feature and which is trained to never talk about security?
not about anything in particular but some people on here are a bit too gullible w/r/t new AI papers
no, this novel training method didn't make a 3B model better at all tasks than Claude, that paper didn't find a 10x efficiency gain, that new VRAM-saver is slow or degrades performance, etc.
wwdc stands for "what would desus chew"
I guess they're trying not to push their luck with already-tepid non-SF municipalities but I wonder how long it'll be until cities start making infrastructure explicitly more legible to AVs via short-range comms, including IR legibility in standards for signage, etc.