Another view of mixtures of experts
One way to combine the outputs of the experts
is to take a weighted average, using the gating
network to decide how much weight to place on
each expert.
But there is another way to combine the experts.
How many times does the earth rotate around
its axis each year?
What will be the exchange rate of the
Canadian dollar the day after the Quebec
referendum?