🚨Preprint time!
Most of us are familiar with coding agents, e.g., GPT-Codex, Claude-Code. They mostly do well with tasks in Python and other familiar languages. However, wrt industry and widespread applications, performance on unfamiliar languages needs to be studied deeply. Here's our attempt!