Driven by these findings, we develop variants of local-SSL (CLAPP++). They reach the performance of BP baselines on CIFAR10, STL-10, Tiny-ImageNet, while also setting new SOTA of local learning rules on these dataset and ImageNet. Bonus: 40-60% less GPU VRAM and shorter wall clock time than BP.