R/optimizer.R
adagrad.Rd
AdaGrad SGD optimization
adagrad(stepsize = 0.05, epsilon = 1e-08)
stepsize for SGD
epsilon for numerical stability
a list of control variables for optimization (used in control_opt function)
control_opt
The update rule for AdaGrad is: vt=vt−1+g2t xt+1=xt−stepsize∗gt√vt+ϵ