'Bluesky has overtaken its flailing rival X in hosting posts related to new academic research, indicating the platform is fast becoming the go-to place for scholars to share their work.'
Margot Finn
📍
bsky.app/profile/aelo...
Data indicates more scholars turning to alternative social media site to post about their work after Elon Musk’s Twitter takeover
I missed this one when it came out but I can tell that it is one of the most useful piece of research I’ve read in a while.
“GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models”
arxiv.org/html/2410.05...
Needed an update
The more I read and listen to current debates in the field, the more I’m convinced that we have a model evaluation crisis.
13 minutes of wisdom.
“No authorities in science”.
Amen to that.
100%
I never understood people going to concerts to spend their time there attending through the tiny screens of their phones.
We really need better brain-power allocation. The current algorithm is kind of turning crazy.
@jfoerst.bsky.social take on how the community sees the ARC Challenge and how we evaluate models and use benchmarks nowadays is 👌.
#more_science_less_hype (please).
PS: Amazing discussion and good brain food, as usual with MLST.