//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
In deep linear networks with orthonormal feedforward weights, we prove that local-SSL and BP have identical gradient updates. If orthonormality is broken by shrinking layer widths, we prove that direct feedback from the last layer makes local-SSL gradients more similar to BP gradients.
1d
Zihan Wu