- Stochastic gradient descent: https://en.wikipedia.org/wiki/Stochastic_gradient_descent
- TensorFlow.js optimizers: https://js.tensorflow.org/api/latest/#Training-Optimizers
- On the importance of initialization and momentum in deep learning, by Ilya Sutskever, James Martens, George Dahl, and Geoffrey Hinton: http://www.cs.toronto.edu/~fritz/absps/momentum.pdf
- Bias-variance trade-off: https://en.wikipedia.org/wiki/Bias%E2%80%93variance_tradeoff





















































