Session 8 - Reinforcement Learning

Neural Networks

Gradient Descent Algorithm - Jacopo Bertolotti, CC0, via Wikimedia Commons.

We choose any starting point and then compute an estimate of the gradient and move a small amount in the steepest downhill direction, repeating until we converge on a point in the weight space with (local) minima loss.

Gradient Descent Algorithm:
Initialise w
repeat
    for each w[i] in w
        Compute gradient: g = ∇Loss(w[i])
        Update weight:   w[i] = w[i] - α * g
until convergence

The size of the step is given by the parameter α, which regulates the behaviour of the gradient descent algorithm. This is a hyperparameter of the regression model we are training, usually called learning rate.

Reinforcement Learning

Last Time

ML Definition

The Data Science Process

Machine Learning Pipeline

The Perceptron

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Deep Learning

Hyperparameters Tuning

Hyperparameter Tuning

Hyperparameter Tuning

Hyperparameter Tuning

Hyperparameter Tuning

Hyperparameter Tuning

Hyperparameter Tuning

Hyperparameter Tuning

Reinforcement Learning