//
sign in
Post
by @danabra.mov
PostEmbed
by @danabra.mov
Record
by @jimpick.com
Record
by @atsui.org
+ new component
Post
Homeostatic dimensional degeneracy during development strikes again!
11h
Arseny Khakhalin
NEW PAPER. Why do larger networks train better? "Because they contain more candidate *sub*networks that can learn the task" → lottery tickets This popular explanation uses an appealing but misleading metaphor🧵 We propose an intuitive alternative grounded in theory: escape dimensions
21h
Flavio Martinelli