Unavoidable unless weight = 1, but can be improved with Xavier initialization term: sqrt(1./layers_dims[l-1])
weight = 1
sqrt(1./layers_dims[l-1])
Last updated 4 years ago