The mixture of experts architecture
Combined predictor:
Simple error function for training:
(There is a  better error function)
Expert 1        Expert 2       Expert 3
Softmax gating network
input