 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
• |
Filter the data
linearly and then applying a non-
|
|
|
linear
“squashing” function.
|
|
|
• |
The aim is to
maximize the information that the
|
|
|
outputs convey
about the input.
|
|
|
|
– |
Since
the outputs are a deterministic function
|
|
of
the inputs, information is maximized by
|
|
|
maximizing
the entropy of the output
|
|
|
distribution.
|
|
|
|
• |
This
involves maximizing the individual entropies
|
|
|
of the
outputs and minimizing the mutual
|
|
|
information
between outputs.
|
|