Gradient Checking

limā”Īµā†’0f(x+Īµ)āˆ’f(xāˆ’Īµ)2Īµ(1)\lim_{\varepsilon\to0} {\frac{f(x+\varepsilon)-f(x-\varepsilon)}{2\varepsilon}} \tag1
limā”Īµā†’0f(x+Īµ)āˆ’f(x)Īµ(2)\lim_{\varepsilon\to0} {\frac{f(x+\varepsilon)-f(x)}{\varepsilon}} \tag2

Take all params and concatenate into vector Īø

  1. Gradient might be correct when W,b is 0, incorrect when W.b is larger, run grad checks at both sites

Last updated