Optimizers

PyTorch Implementation of Optimizers from scratch

BGD - Batch Gradient Descent

BGD computes the gradient of the cost function w.r.t. to the parameters θ for the entire training dataset. We perform an update in the direction of the gradients and the learning rate(η), determines how large of an update we perform. Update rule:

Batch gradient descent will converge to the global minimum for convex error surfaces and to a local minimum for non-convex surfaces.

All implementations are based on the paper 'An overview of gradient descent optimization algorithms' by Sebastian Ruder.

References

@article{ruder2016overview,
  title={An overview of gradient descent optimization algorithms},
  author={Ruder, Sebastian},
  journal={arXiv preprint arXiv:1609.04747},
  year={2016},
  url={https://arxiv.org/abs/1609.04747}
}

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
AdaMax.py		AdaMax.py
Adadelta.py		Adadelta.py
Adagrad.py		Adagrad.py
Adam.py		Adam.py
BGD.py		BGD.py
README.md		README.md
RMSprop.py		RMSprop.py
SGD.py		SGD.py
SGD_Nesterov_momentum.py		SGD_Nesterov_momentum.py
SGD_with_momentum.py		SGD_with_momentum.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Optimizers

BGD - Batch Gradient Descent

References

About

Uh oh!

Releases

Packages

Languages

rraghavkaushik/Optimizers

Folders and files

Latest commit

History

Repository files navigation

Optimizers

BGD - Batch Gradient Descent

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages