## Can LLMs Score Medical Diagnoses and Clinical Reasoning as well as Expert Panels? ##
It appears so... "LLM Jury" has high correlation with the original score of a panel of human experts. Another human panel ("Re-score panel") does not correlate as much.
P: arxiv.org/abs/2604.14892
youtu.be/0qn2z9mD4Tc?...
Your AI model fails on new data?
It will likely be due to the domain gap! Here is what you can do for biomedical AI models:
P: arxiv.org/abs/2604.20824
Symbol-equivariant Recurrent Reasoning Models (SE-RRM)
SE-RRM advances HRM and TRM -- guaranteed identical solutions for problems with permuted colors (ARC AGI) or digits (Sudoku).
Coolest part: extrapolation to larger problem sizes!!!
P: arxiv.org/abs/2603.02193
C: github.com/ml-jku/SE-RRM
### NEW ENTRY IN THE AI IN DRUG DISCOVERY LEADERBOARD ###
CheMeleon takes the lead as the **best pre-trained model** and gets the Bronze medal (rank 3) overall.
It's an MPNN pre-trained to predict chemical descriptors.
P: arxiv.org/abs/2506.15792
C: github.com/JacksonBurns...
# AI in Drug discovery just BROKE THROUGH a wall #
A newer AI model, ConGLUDe, as fast but much more accurate than DrugCLIP.
Instead on just 40K structure-based data, ConGLUDe is trained on 100M datapoints from ligand-based data
P: arxiv.org/abs/2601.09693
A NEW ENTRY TO THE TOX21 LEADERBOARD:
GROVER from TENCENT AI LAB obtains rank 5 on the Tox21 leaderboard. Best pre-trained model and clearly outperforms CHEMPROP.
Original implementation: github.com/tencent-aila...
Tox21 leaderboard on Hugging Face: huggingface.co/spaces/ml-jk...
xLSTM for Real-Time DNS Tunnel Detection: arxiv.org/abs/2512.09565
DNS-HyXNet = xLSTM for DNS tunnels.
DNS-HyXNet has 99.99% accuracy, with F1-scores exceeding 99.96%, and per-sample detection latency of just 0.041 ms, confirming its scalability and real-time readiness. wow!
Sepp Hochreiter, Head of LIT AI Lab, comments on the EurIPS conference in this FAZ guest article:
He highlights Europe’s growing visibility in AI research and the role of ELLIS in strengthening the community. 🌍
www.faz.net/pro/digitalw...
Mit #OpenClaw organisieren Nutzende ihr Leben. Der KI-Bot braucht dafür allerdings umfassenden Zugriff auf persönliche Daten. Und was hat es mit #Moltbook auf sich, wo sich diese Bots angeblich miteinander unterhalten? @gklambauer.bsky.social, Professor für KI an der JKU Linz, ordnet den Hype ein.
YouTube video by AI Podcast Series. Byte Goose AI.
In Kopenhagen fand erstmals die EurIPS-Konferenz statt. Sie ist keine Gegenveranstaltung zur berühmten amerikanischen NeurIPS, sondern eine Bühne für erstklassige europäische KI-Forschung. Jetzt sind ...