Gradient Descent Optimization With AMSGrad From Scratch