Question: Minimize
F(x,p)
F(x,p)=ā«0Tāf(x,p,t)dt(1) Subject to
g(x0ā,p)=x0āāp=0(2) -> If given x0ā, we can compute p at t=0 through g(x0ā,p), then substitue xtā and p in h(xtā,xĖtā,p,t) and can compute xĖtā
Why is āpāxā difficult to calculate? Because fn(x,p) is unknown, need to solve ALL possible x and p using ODE
Apply Lagrangian function L(x,Ī»)=f(x)āĪ»g(x) and combine (1) (2) (3) in one loss function
Loss=ā«0Tā[f(x,p,t)+Ī»Th(xtā,xĖtā,p,t)]dt+uTg(x0ā,p)(4) Substitute (2) (3)
Loss=ā«0Tā[f(x,p,t)+Ī»T0]dt+uT0=ā«0Tāf(x,p,t)dt=F(x,p)(5) Why does integral equals loss? Why is this loss function? Minimize loss?
āpāLā=āpāFā(6) Why calculate āpāLāduring backprop? To use Newton's Method to approximate f(x) at a given point
āpāLā=āpāFā=ā«0Tā[āxāfāā
āpāxā+āpāfā+Ī»T(āxāhāā
āpāxā+āxĖāhāā
āpāxĖā+āpāhā)]dt+uT(āx0āāgāā
āpāx0āā+āpāgā) To avoid compute āpāxĖā , we apply integration by parts ā«udv=uvāā«vdu
āpāLā=āpāLā