Weight increase with epochs. Similar to L₂ regularization, stop when weight (validation - train) exceed a pre-determined amount to prevent overfitting.
L₂ regularization
Downside: stop optimizing cost function (eliminate the difference between dev and test)
Alternative is L₂ regularization
Last updated 4 years ago