What 50,000 Runs of a 5-Line Eval Taught Us
buff.ly/LZOCgDr
#vscode #evals #visualstudiocode #ai #devtools
How AI coding models calibrate effort, token cost, and tool use on even the simplest task, and what that means for model selection and cost.
buff.ly
Alvin Ashcraft