Last updated 3 years ago
Was this helpful?
Unavoidable unless weight = 1, but can be improved with Xavier initialization term: sqrt(1./layers_dims[l-1])
weight = 1
sqrt(1./layers_dims[l-1])