๐ Launching Every Eval Ever: Toward a Common Language for AI Eval Reporting ๐
A shared schema + crowdsourced repository so we can finally compare evals across frameworks and stop rerunning everything from scratch ๐ง
A tale of broken AI evals ๐งต๐
evalevalai.com/projects/eve...