The mixture of experts architecture
Combined predictor:
Simple error function for training
:
(
There is a
better error function
)
Expert 1
Expert 2
Expert 3
Softmax gating network
input