This representation parameterization is useful because it (1) tends to generalize better and (2) is more interpretable. But this is just data-efficiency and aesthetics. These properties cannot provide *evidence* against behaviorism. And they provide weak (if any) evidence for the representation.