Can the new DiffusionGemma model help fix broken OCR?
In theory, denoising tokens in parallel could work better for OCR correction since context is seen upfront?
Pointed it at 19th-century newspaper OCR. It corrected better than the autoregressive baseline — at ~8x the speed.