The Transformer Architecture

Description: This lecture introduces the transformer neural network architecture, which is the architecture of novel Large Language Models (LLMs). We will start formalising the architecture and its training. We will then introduce how to use and tailor LLMs into our ML projects and daily activities for different purposes.
Department: Centro de Estudios y Asesorías en Estadística (CEASE)
Institution: Universidad de Nariño
Date: July 12, 2025
Hours: 4
From: 10:00 am
To: 12:00 am

[HTML Slides] [Colab Notebook] [Back to Course]

Resources

The Transformer Architecture

Resources

Books

Papers and Reports

Web