Session 7 - Neural Networks

Neural Networks

Gradient Descent Algorithm - Jacopo Bertolotti, CC0, via Wikimedia Commons.

We choose any starting point and then compute an estimate of the gradient. We then move a small amount in the steepest downhill direction, repeating until we converge on a point in the weight space with (local) minima loss.

Gradient Descent Algorithm:
Initialize w
repeat
    for each w[i] in w
        Compute gradient: g = ∇Loss(w[i])
        Update weight:   w[i] = w[i] - α * g
until convergence

The size of the step is given by the parameter α, which regulates the behaviour of the gradient descent algorithm. This is a hyperparameter of the regression model we are training, usually called learning rate.

Neural Networks

Last Time

ML Definition

The Data Science Process

Machine Learning Pipeline

Probabilistic Interpretation of Linear Regression

Probabilistic Interpretation of Linear Regression

Probabilistic Interpretation of Linear Regression

Probabilistic Interpretation of Linear Regression

Probabilistic Interpretation of Linear Regression

Linear Basis Function Models

Linear Basis Function Models

Linear Basis Function Models

Linear Basis Function Models

Linear Basis Function Models

Linear Basis Function Models

The Perceptron

The Perceptron

The Perceptron

The Perceptron

The Perceptron

The Perceptron

The Perceptron

The Perceptron

The Perceptron

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Neural Networks

Deep Learning

Deep Learning

Deep Learning

Deep Learning

Deep Learning

Deep Learning

Deep Learning

Deep Learning