40
Learning with auditory labels
Alex Kaganov replaced the class labels by binarized cepstral
spectrograms of many different male speakers saying digits.
The auditory pathway then had multiple layers, just like the visual
pathway. The auditory and visual inputs shared the top level layer.
After learning, he showed it a visually ambiguous digit and then
reconstructed the visual input from the representation that the top-
level associative memory had settled on after 10 iterations.
“six”
“five”
reconstruction                 original  visual input              reconstruction