Multi-Token Prediction for MLX, #MTPLX tested on the M3 Max. 2.24× promised, +40% measured on my M3 Max (same quality).
www.rotecodefraktion.de/en/blog/mlx-...
#MLX #AppleSilicon #AI
MTPLX promises noticeably faster local LLMs on the Mac via Qwen 3.6's built-in MTP heads. Measured on an M3 Max, with real numbers instead of marketing claims.