History of CentOS: How a biochemist's Linux hobby project became the enterprise world's default operating system
Read how we designed and deployed Multipath Reliable Connection (MRC), a multi-company step towards Ultra Ethernet, at 800G in Microsoft's datacenters.
Demonstrated with a 75k GPU pretraining job running stably through multiple faults.
OpenAI blog: buff.ly/X0KL4Xy
Paper: buff.ly/I3OcXWn
Tomorrow (Thursday, May 21), Brad Chamberlain and Jade Abraham will be giving an overview, update, and demo of Chapel at the Northwest C++ Users’ Group at 7pm PT. Attend in person in Bellevue WA, or online using Microsoft Teams.
nwcpp.org/May-2026.html
At the Salishan HPC conference last week, I gave a talk called "AI doesn't need massive supercomputers after all!"
A couple people asked me for the slides (as crappy as they were), so here they are in hastily written blog format.
blog.glennklockwood.com/2026/05/ai-d...
#AI #HPC
#hpc This Friday's exploit: ssh-keysign-pwm github.com/0xdeadbeefne...
Wow. Another HPC acceleration by AI paper. This is the way
When a community came together after Red Hat said Windows was 'probably the right product'
Well that is wrap, Spring 2026 Big Data Management (aka scalable tools) at Lehigh University is done, grades submitted, 29 students now know about Spark, Hadoop, Hive, Kafka, ... and even a bit of Linux command line 😉
#HPC must read,
"FP8 is All You Need (Part 1):
Debunking Hardware FP64 as the HPC Holy Grail" from Satoshi Matsuoka
arxiv.org/pdf/2606.06510
OpenAI introduces MRC (Multipath Reliable Connection), a new supercomputer networking protocol released via OCP to improve resilience and performance in large-scale AI training clusters.