Session 1 - Artificial Intelligence and Machine Learning

Artificial Intelligence and Machine Learning

Christian Cabrera Jojoa

Senior Research Associate and Affiliated Lecturer

Department of Computer Science and Technology

University of Cambridge

chc79@cam.ac.uk

Session 1 - Artificial Intelligence and Machine Learning

Course Structure

Session 1 - Artificial Intelligence and Machine Learning

Course Structure

  • Remote sessions combining theory and practice
  • Content will be published weekly
  • Recordings are an option
  • You can ask questions at any moment
  • Participation is key
  • You will have (small) homework after each session
  • Contact by email: chc79@cam.ac.uk
Session 1 - Artificial Intelligence and Machine Learning

The ML Context

Session 1 - Artificial Intelligence and Machine Learning

Artificial Intelligence (AI)

"The field of Artificial Intelligence (AI) is concerned with understanding and building intelligent entities."

Artificial Intelligence: A Modern Approach book cover
Stuart Russell and Peter Norvig, Artificial Intelligence: A Modern Approach (4th Edition)
Session 1 - Artificial Intelligence and Machine Learning

AI Dimensions

Human

Rational

Thought

Behaviour

Session 1 - Artificial Intelligence and Machine Learning

AI Dimensions

Human

Rational

Thought

Systems that think like humans, focusing on cognitive and mental models that mimic human reasoning.
• Example: Context-aware chatbots

Behaviour

Session 1 - Artificial Intelligence and Machine Learning

AI Dimensions

Human

Rational

Thought

Systems that think like humans, focusing on cognitive and mental models that mimic human reasoning.
• Example: Context-aware chatbots

Systems that think rationally, using logical principles and formal reasoning to solve problems and make decisions.
• Example: Rule-based problem solvers

Behaviour

Session 1 - Artificial Intelligence and Machine Learning

AI Dimensions

Human

Rational

Thought

Systems that think like humans, focusing on cognitive and mental models that mimic human reasoning.
• Example: Context-aware chatbots

Systems that think rationally, using logical principles and formal reasoning to solve problems and make decisions.
• Example: Rule-based problem solvers

Behaviour

Systems that act like humans, exhibiting behaviours and interactions that are natural and intuitive for human users.
• Example: Human-like robots

Session 1 - Artificial Intelligence and Machine Learning

AI Dimensions

Human

Rational

Thought

Systems that think like humans, focusing on cognitive and mental models that mimic human reasoning.
• Example: Context-aware chatbots

Systems that think rationally, using logical principles and formal reasoning to solve problems and make decisions.
• Example: Rule-based problem solvers

Behaviour

Systems that act like humans, exhibiting behaviours and interactions that are natural and intuitive for human users.
• Example: Human-like robots

Systems that act rationally, making optimal decisions based on available information and defined goals.
• Example: Optimal decision makers

Session 1 - Artificial Intelligence and Machine Learning

AI Dimensions

Human

Rational

Thought

• Cognitive modeling
• Human-like reasoning
• Natural language understanding
• Common sense reasoning

• Logic-based systems
• Theorem proving
• Knowledge representation
• Expert systems

Behaviour

• Natural language processing
• Computer vision
• Robotics
• Human-computer interaction

• Optimisation algorithms
• Game theory
• Planning systems
• Decision theory

Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Philosophy Mathematics Economics Neuroscience Psychology Computer Engineering Control Theory Linguistics Mind-body problem Logic Reasoning Ethics Probability Statistics Optimisation Graph theory Decision theory Game theory Neural networks Learning Cognition Memory Algorithms Systems Feedback Language NLP Communication
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Aristotle
Aristotle (384–322 BC)
Al-Khwarizmi
Al-Khwarizmi (780 - 850 AD)
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Ramon Llull
Ramon Llull (1232 - 1316 AD)
Gottfried Leibniz
Gottfried Leibniz (1646 - 1716 AD)
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Thomas Bayes
Thomas Bayes (1701 - 1761 AD)
Carl Friedrich Gauss
Carl Friedrich Gauss (1777 - 1855 AD)
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Charles Babbage
Charles Babbage (1791 - 1871 AD)
Ada Lovelace
Ada Lovelace (1815 - 1852 AD)
George Boole
George Boole (1815 - 1864 AD)
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

David Hilbert
David Hilbert (1862 - 1943 AD)
Bertrand Russell
Bertrand Russell (1872 - 1970 AD)
Alan Turing
Alan Turing (1912 - 1954 AD)
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Aristotle
Aristotle (384–322 BC)
Al-Khwarizmi
Al-Khwarizmi (780 - 850 AD)
Ramon Llull
Ramon Llull (1232 - 1316 AD)
Gottfried Leibniz
Gottfried Leibniz (1646 - 1716 AD)
Thomas Bayes
Thomas Bayes (1701 - 1761 AD)
Carl Friedrich Gauss
Carl Friedrich Gauss (1777 - 1855 AD)
Charles Babbage
Charles Babbage (1791 - 1871 AD)
Ada Lovelace
Ada Lovelace (1815 - 1852 AD)
George Boole
George Boole (1815 - 1864 AD)
David Hilbert
David Hilbert (1862 - 1943 AD)
Bertrand Russell
Bertrand Russell (1872 - 1970 AD)
Alan Turing
Alan Turing (1912 - 1954 AD)
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Enigma Machine, Bletchley Park
Enigma Machine, Bletchley Park
Statue of Alan Turing, Bletchley Park
Statue of Alan Turing, Bletchley Park (Stephen Kettle, 2007)
The Bombe (Turing & Welchman, 1939)
Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Bletchley Park
Bletchley Park: https://en.wikipedia.org/wiki/Bletchley_Park

Almost ten thousand personnel worked at Bletchley Park by 1945.

Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

Bletchley Park Codebreakers
Bletchley Park - Codebreakers: https://en.wikipedia.org/wiki/Bletchley_Park

Almost ten thousand personnel worked at Bletchley Park by 1945.

Session 1 - Artificial Intelligence and Machine Learning

AI Foundations

The Colossus
The Colossus Computer (Tommy Flowers, 1943-1944): https://en.wikipedia.org/wiki/Colossus_computer

Almost ten thousand personnel worked at Bletchley Park by 1945.

Session 1 - Artificial Intelligence and Machine Learning

AI History

Session 1 - Artificial Intelligence and Machine Learning

AI History

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030
Session 1 - Artificial Intelligence and Machine Learning

AI History - Inception and Early Approaches (1943 - 1969)

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 Artificial Neuron (McCulloch & Pitts, 1943) Information Theory (Shannon, 1948) Cybernetics (Wiener, 1948) Updating Rule (Hebbian, 1949) Computing Machinery and Intelligence (Turing, 1950) SNARC (Minsky, 1951) AI Term (Dartmouth Workshop, 1956) GPS (Newell & Simon, 1957) Advice Taker (McCarthy, 1958) Back-Propagation (Kelley, 1960) Perceptrons (Rosenblatt, 1962) ELIZA (MIT, 1966) The DENDRAL (Buchanan, 1969)
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Problem Definition

Formal Definition

Computational Representation

Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Problem Definition

Formal Definition

Computational Representation

An agent must find a way to reach a goal in its environment. But, the next step is not obvious.

Shortest path problem
Shortest path problem: https://en.wikipedia.org/wiki/Shortest_path_problem
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Problem Definition

Formal Definition

Computational Representation

An agent must find a way to reach a goal in its environment. But, the next step is not obvious.

Shortest path problem
Shortest path problem: https://en.wikipedia.org/wiki/Shortest_path_problem

A search problem is defined by a set of states, an initial state, a set of goal states, and a set of actions or transitions between states.

: set of states
: set of actions
: initial state
: set of goal states

Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Problem Definition

Formal Definition

Computational Representation

An agent must find a way to reach a goal in its environment. But, the next step is not obvious.

Shortest path problem
Shortest path problem: https://en.wikipedia.org/wiki/Shortest_path_problem

A search problem is defined by a set of states, an initial state, a set of goal states, and a set of actions or transitions between states.

: set of states
: set of actions
: initial state
: set of goal states

A search problem is typically modelled using data structures such as graphs, adjacency matrices, queues and stacks.

Adjacency matrix for shortest path graph
Adjacency matrix
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Search problem instance for uninformed algorithms
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Search problem instance for uninformed algorithms
Non-informed search algorithms comparison
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Search problem instance for informed algorithms
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - Searching Algorithms

Finding path search problem
Informed search algorithms comparison
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Perception

Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Perception

"Within ten years a digital computer will be the world's chess champion." (Simon & Newell, 1958)


"Machines will be capable, within twenty years, of doing any work a man can do." (Simon, 1965)


"Within a generation... the problem of creating 'artificial intelligence' will substantially be solved." (Minsky, 1967)


"In from three to eight years we will have a machine with the general intelligence of an average human being." (Minsky, 1970)

You'll own slaves by 1965 (1957)
You'll own "slaves" by 1965 (1957) - https://medium.com/@theo/do-we-need-robot-rights-in-the-age-of-artificial-intelligence-690b9951bae0
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Perception

"Within ten years a digital computer will be the world's chess champion." (Simon & Newell, 1958)


"Machines will be capable, within twenty years, of doing any work a man can do." (Simon, 1965)


"Within a generation... the problem of creating 'artificial intelligence' will substantially be solved." (Minsky, 1967)


"In from three to eight years we will have a machine with the general intelligence of an average human being." (Minsky, 1970)

Machines that Think Held Evil (1960)
Machines that Think Held Evil (1960) - https://newsletter.pessimistsarchive.org/p/the-original-ai-doomer-dr-norbert
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Perception

"Within ten years a digital computer will be the world's chess champion." (Simon & Newell, 1958)


"Machines will be capable, within twenty years, of doing any work a man can do." (Simon, 1965)


"Within a generation... the problem of creating 'artificial intelligence' will substantially be solved." (Minsky, 1967)


"In from three to eight years we will have a machine with the general intelligence of an average human being." (Minsky, 1970)

Automation Concerns (1960)
Automation Concerns (1960) - https://newsletter.pessimistsarchive.org/p/the-original-ai-doomer-dr-norbert
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Perception

"Within ten years a digital computer will be the world's chess champion." (Simon & Newell, 1958)


"Machines will be capable, within twenty years, of doing any work a man can do." (Simon, 1965)


"Within a generation... the problem of creating 'artificial intelligence' will substantially be solved." (Minsky, 1967)


"In from three to eight years we will have a machine with the general intelligence of an average human being." (Minsky, 1970)

IBM Response (1960)
IBM Response (1960) - https://newsletter.pessimistsarchive.org/p/the-original-ai-doomer-dr-norbert
Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Limitations

Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Limitations

Computing power, algorithms, and data were insufficient to solve real-world problems.


Combinatorial explosion


Easy tasks for humans are difficult for AI (Moravec's Paradox)

Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Limitations

Maze navigation search problem

Computing power, algorithms, and data were insufficient to solve real-world problems.


Combinatorial explosion


Easy tasks for humans are difficult for AI (Moravec's Paradox)

Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Limitations

Combinatorial explosion diagram

Computing power, algorithms, and data were insufficient to solve real-world problems.


Combinatorial explosion


Easy tasks for humans are difficult for AI (Moravec's Paradox)

Session 1 - Artificial Intelligence and Machine Learning

Early AI Approaches (1943-1969) - AI Limitations

Growth comparison diagram

Computing power, algorithms, and data were insufficient to solve real-world problems.


Combinatorial explosion


Easy tasks for humans are difficult for AI (Moravec's Paradox)

Session 1 - Artificial Intelligence and Machine Learning

AI History - First Winter (1974 - 1980)

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 First AI Winter (1974-1980) Artificial Neuron (McCulloch & Pitts, 1943) Information Theory (Shannon, 1948) Cybernetics (Wiener, 1948) Updating Rule (Hebbian, 1949) Computing Machinery and Intelligence (Turing, 1950) SNARC (Minsky, 1951) AI Term (Dartmouth Workshop, 1956) GPS (Newell & Simon, 1957) Advice Taker (McCarthy, 1958) Back-Propagation (Kelley, 1960) Perceptrons (Rosenblatt, 1962) ELIZA (MIT, 1966) ALPAC Report (USA, 1966) The DENDRAL (Buchanan, 1969) Perceptrons Book (Minsky & Papert, 1969) Lighthill Report (UK, 1973)
Session 1 - Artificial Intelligence and Machine Learning

AI History - Expert Systems (1969 - 1986)

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 First AI Winter (1974-1980) Artificial Neuron (McCulloch & Pitts, 1943) Information Theory (Shannon, 1948) Cybernetics (Wiener, 1948) Updating Rule (Hebbian, 1949) Computing Machinery and Intelligence (Turing, 1950) SNARC (Minsky, 1951) AI Term (Dartmouth Workshop, 1956) GPS (Newell & Simon, 1957) Advice Taker (McCarthy, 1958) Back-Propagation (Kelley, 1960) Perceptrons (Rosenblatt, 1962) ELIZA (MIT, 1966) ALPAC Report (USA, 1966) The DENDRAL (Buchanan, 1969) Perceptrons Book (Minsky & Papert, 1969) PROLOG (1972) MYCIN (Stanford, 1972) Lighthill Report (UK, 1973) FRAMES (1975) Hopfield net (1982) R1 (McDermott, 1982) Parallel Distributed Processing (Rumelhart & McClelland, 1986)
Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Problem Definition

Formal Definition

Computational Representation

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Problem Definition

Formal Definition

Computational Representation

Agents are limited because of the problem's complexity. They should leverage human knowledge and emulate human reasoning.

Data, Information, and Knowledge
Data, Information, and Knowledge (Per Liew, 2007)
Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Problem Definition

Formal Definition

Computational Representation

Agents are limited because of the problem's complexity. They should leverage human knowledge and emulate human reasoning.

Data, Information, and Knowledge
Data, Information, and Knowledge (Per Liew, 2007)

An expert system is defined by a knowledge base, inference engine, and user interface that work together to apply domain expertise to specific problems.

: Knowledge Base
: Inference Engine
: User Interface

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Problem Definition

Formal Definition

Computational Representation

Agents are limited because of the problem's complexity. They should leverage human knowledge and emulate human reasoning.

Data, Information, and Knowledge
Data, Information, and Knowledge (Per Liew, 2007)

An expert system is defined by a knowledge base, inference engine, and user interface that work together to apply domain expertise to specific problems.

: Knowledge Base
: Inference Engine
: User Interface

Knowledge is typically modelled using subject, object, predicate semantic triple model.

Semantic Triple Model
Basic Semantic Triple

Rules are if statements.

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

"An ontology is an explicit specification of a shared conceptualisation." (Gruber, 1993)

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Vehicles Ontology
Vehicles Ontology: ​English Wikipedia user Gwernol, CC BY-SA 3.0 , via Wikimedia Commons

"An ontology is an explicit specification of a shared conceptualisation." (Gruber, 1993)

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Spacecraft Ontology
Spacecraft Ontology: Fuzheado, CC BY-SA 4.0 , via Wikimedia Commons

"An ontology is an explicit specification of a shared conceptualisation." (Gruber, 1993)

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Ontology of Things
Ontology of Things: Niceclat, CC BY-SA 4.0 , via Wikimedia Commons

"An ontology is an explicit specification of a shared conceptualisation." (Gruber, 1993)

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Knowledge Base

Example: Medical diagnosis

The expert system embodies medical knowledge

Forward Chaining (Data-Driven)

Facts: The patient has fever, cough, headache, and muscle pain

Rules:
R1: If (fever AND cough) then 
(possible_flu)
R2: If (possible_flu AND headache) then (influenza)
R3: If (influenza AND muscle_pain) then (severe_case)
Process:
Facts: [fever, cough, headache, muscle_pain]
→ apply R1: add possible_flu
Facts: [headache, muscle_pain, possible_flu]
→ apply R2: add influenza
Facts: [muscle_pain, influenza]
→ apply R3: conclude sever_case
Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Knowledge-Based Systems

Knowledge Base

Example: Medical diagnosis

The expert system embodies medical knowledge

Backward Chaining (Goal-Driven)

Facts: The patient has fever, cough, headache, and muscle pain

Goal: Determine if the patient's case is severe

Rules:
R1: If (fever AND cough) then 
(possible_flu)
R2: If (possible_flu AND headache) then (influenza)
R3: If (influenza AND muscle_pain) then (severe_case)
Process:
→ check for severe_case via R3
→ check for muscle_pain (found)
→ check for influenza via R2
→ check for headache (found)
→ check for possible_flu via R1
→ check for fever and cough
→ conclude severe_case is true
Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Relative Success

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Relative Success

DENDRAL (1960s)

Identified unknown organic molecules using knowledge of chemistry


MYCIN (Early 1970s)

Supported bacterial infections diagnosis and treatment


XCON/R1 (1982)

eXpert CONfigurer - Automated the configuration of VAX computer systems (successful deployment)

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - Relative Success

DENDRAL (1960s)

Identified unknown organic molecules using knowledge of chemistry


MYCIN (Early 1970s)

Supported bacterial infections diagnosis and treatment


XCON/R1 (1982)

eXpert CONfigurer - Automated the configuration of VAX computer systems (successful deployment)

PUFF (1982)

Interpreted pulmonary function test results to diagnose lung disorders


PROSPECTOR (1986)

An expert system for mineral exploration


DEEP BLUE (1997)

An expert system that defeated a chess world champion

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Perception

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Perception

"In medicine, management, and the military — indeed in most of the world's work — the daily tasks are those requiring symbolic reasoning with detailed professional knowledge." (Feigenbaum, 1982)


"Commercialising Artificial Intelligence." (The New York Times, 1982)


"Gains are Slow for Artificial Intelligence Industry." (The New York Times, 1987)


"New expert systems companies were being formed at a rate of what seemed like one a week. " (Hart, 2021)


Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Perception

Artificial Intelligence (1981)
Issues with knowledge libraries (Robersts, 1981) - https://microship.com/artificial-intelligence-byte/

"In medicine, management, and the military — indeed in most of the world's work — the daily tasks are those requiring symbolic reasoning with detailed professional knowledge." (Feigenbaum, 1982)


"Commercialising Artificial Intelligence." (The New York Times, 1982)


"Gains are Slow for Artificial Intelligence Industry." (The New York Times, 1987)


"New expert systems companies were being formed at a rate of what seemed like one a week. " (Hart, 2021)


Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Perception

Intelligence is more than experts (1985)
Intelligence is More than Experts (The Sydney Morning Herald, 1985)

"In medicine, management, and the military — indeed in most of the world's work — the daily tasks are those requiring symbolic reasoning with detailed professional knowledge." (Feigenbaum, 1982)


"Commercialising Artificial Intelligence." (The New York Times, 1982)


"Gains are Slow for Artificial Intelligence Industry." (The New York Times, 1987)


"New expert systems companies were being formed at a rate of what seemed like one a week. " (Hart, 2021)


Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Perception

AI just could be a smart buy (1987)
AI just could be a smart buy (1987)

"In medicine, management, and the military — indeed in most of the world's work — the daily tasks are those requiring symbolic reasoning with detailed professional knowledge." (Feigenbaum, 1982)


"Commercialising Artificial Intelligence." (The New York Times, 1982)


"Gains are Slow for Artificial Intelligence Industry." (The New York Times, 1987)


"New expert systems companies were being formed at a rate of what seemed like one a week. " (Hart, 2021)


Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Perception

Deep Blue defeats Kasparov (1997)
Deep Blue defeats Kasparov (Los Angeles Times, 1997)

"In medicine, management, and the military — indeed in most of the world's work — the daily tasks are those requiring symbolic reasoning with detailed professional knowledge." (Feigenbaum, 1982)


"Commercialising Artificial Intelligence." (The New York Times, 1982)


"Gains are Slow for Artificial Intelligence Industry." (The New York Times, 1987)


"New expert systems companies were being formed at a rate of what seemed like one a week. " (Hart, 2021)


Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Limitations

Session 1 - Artificial Intelligence and Machine Learning

Expert Systems (1969-1986) - AI Limitations

Knowledge Complexity
The Tree of Knowledge System: Gregg Henriques, CC BY-SA 4.0 , via Wikimedia Commons

Extracting knowledge from human experts and encoding it into rules was difficult, time-consuming, and expensive


Systems could not reason beyond their pre-programmed knowledge and failed when confronted with unexpected situations


Adding more rules often led to rule interaction problems and combinatorial explosion


Updating knowledge bases as domains evolved required significant effort

Session 1 - Artificial Intelligence and Machine Learning

AI History - Second AI Winter (1987 - 1994)

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 First AI Winter (1974-1980) Second AI Winter (1987-1994) Artificial Neuron (McCulloch & Pitts, 1943) Information Theory (Shannon, 1948) Cybernetics (Wiener, 1948) Updating Rule (Hebbian, 1949) Computing Machinery and Intelligence (Turing, 1950) SNARC (Minsky, 1951) AI Term (Dartmouth Workshop, 1956) GPS (Newell & Simon, 1957) Advice Taker (McCarthy, 1958) Back-Propagation (Kelley, 1960) Perceptrons (Rosenblatt, 1962) ELIZA (MIT, 1966) ALPAC Report (USA, 1966) The DENDRAL (Buchanan, 1969) Perceptrons Book (Minsky & Papert, 1969) PROLOG (1972) MYCIN (Stanford, 1972) Lighthill Report (UK, 1973) FRAMES (1975) Hopfield net (1982) R1 (McDermott, 1982) Parallel Distributed Processing (Rumelhart & McClelland, 1986)
Session 1 - Artificial Intelligence and Machine Learning

AI History - Learning from Examples (1987 - present)

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 Artificial Neuron (McCulloch & Pitts, 1943) First AI Winter (1974-1980) Second AI Winter (1987-1994) Artificial Neuron (McCulloch & Pitts, 1943) Information Theory (Shannon, 1948) Cybernetics (Wiener, 1948) Updating Rule (Hebbian, 1949) Computing Machinery and Intelligence (Turing, 1950) SNARC (Minsky, 1951) AI Term (Dartmouth Workshop, 1956) GPS (Newell & Simon, 1957) Advice Taker (McCarthy, 1958) Back-Propagation (Kelley, 1960) Perceptrons (Rosenblatt, 1962) ELIZA (MIT, 1966) ALPAC Report (USA, 1966) The DENDRAL (Buchanan, 1969) Perceptrons Book (Minsky & Papert, 1969) PROLOG (1972) MYCIN (Stanford, 1972) Lighthill Report (UK, 1973) FRAMES (1975) Hopfield net (1982) R1 (McDermott, 1982) Parallel Distributed Processing (Rumelhart & McClelland, 1986) Bayesian Networks (Pearls, 1988) Reinforcement Learning (Sutton, 1988) Image Recognition (LeCun et al., 1990) Deep Blue beats Kasparov (IBM, 1997)
Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Problem Definition

Formal Definition

Computational Representation

Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Problem Definition

Formal Definition

Computational Representation

Agents cannot be fully pre-programmed. Agent must learn from examples to perform a task.

Learning from data
Learning from examples: classification problem
Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Problem Definition

Formal Definition

Computational Representation

Agents cannot be fully pre-programmed. Agent must learn from examples to perform a task.

Learning from data
Learning from examples: classification problem

Learning from examples requires a function that approximates patterns in data, using a learning algorithm, and evaluation criteria.

: dataset of examples or pairs
: hypothesis space of possible functions
: loss/reward function measuring success
: algorithm to search through





Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Problem Definition

Formal Definition

Computational Representation

Agents cannot be fully pre-programmed. Agent must learn from examples to perform a task.

Learning from data
Learning from examples: classification problem

Learning from examples requires a function that approximates patterns in data, using a learning algorithm, and evaluation criteria.

: dataset of examples or pairs
: hypothesis space of possible functions
: loss/reward function measuring success
: algorithm to search through




Learning algorithms use different computational representations during the learning and inference: dataframes, tuples, trees, graphs, matrices, etc.




Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Algorithms
Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Algorithms
Supervised vs Non-supervised
Supervised vs Non-supervised: Balkiss.hamad, CC BY-SA 4.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Algorithms
Supervised vs Non-supervised
Supervised vs Non-supervised: Balkiss.hamad, CC BY-SA 4.0 , via Wikimedia Commons
Reinforcement Learning
Reinforcement Learning
Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Models
Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Models
Linear Regression Model
Linear regression model: Sewaqu, Public domain, via Wikimedia Commons, via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

Learning from Examples (1987 - present) - Data-Driven Algorithms

Models
Linear Regression Model
Linear regression model: Sewaqu, Public domain, via Wikimedia Commons, via Wikimedia Commons
Neural Network Model
Neural Network Model: Glosser.ca, CC BY-SA 3.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

AI History - Big Data (2000s)

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 Artificial Neuron (McCulloch & Pitts, 1943) First AI Winter (1974-1980) Second AI Winter (1987-1994) Big Data (2000-2012) Artificial Neuron (McCulloch & Pitts, 1943) Information Theory (Shannon, 1948) Cybernetics (Wiener, 1948) Updating Rule (Hebbian, 1949) Computing Machinery and Intelligence (Turing, 1950) SNARC (Minsky, 1951) AI Term (Dartmouth Workshop, 1956) GPS (Newell & Simon, 1957) Advice Taker (McCarthy, 1958) Back-Propagation (Kelley, 1960) Perceptrons (Rosenblatt, 1962) ELIZA (MIT, 1966) ALPAC Report (USA, 1966) The DENDRAL (Buchanan, 1969) Perceptrons Book (Minsky & Papert, 1969) PROLOG (1972) MYCIN (Stanford, 1972) Lighthill Report (UK, 1973) FRAMES (1975) Hopfield net (1982) R1 (McDermott, 1982) Parallel Distributed Processing (Rumelhart & McClelland, 1986) Bayesian Networks (Pearls, 1988) Reinforcement Learning (Sutton, 1988) Image Recognition (LeCun et al., 1990) Deep Blue beats Kasparov (IBM, 1997)
Session 1 - Artificial Intelligence and Machine Learning

AI History - Big Data (2000s)

Session 1 - Artificial Intelligence and Machine Learning

AI History - Big Data (2000s)

Big Data
Big Data: Camelia.boban, CC BY-SA 3.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

AI History - Big Data (2000s)

Large and complex datasets that improve models' statistical power

  • Social Networks
  • Mobile Computing
  • Internet of Things
  • ...

Big Data
Big Data: Camelia.boban, CC BY-SA 3.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

AI History - Big Data (2000s)

Moore's Law
Session 1 - Artificial Intelligence and Machine Learning

AI History - Machine Learning Age (2001 - present)

1940 1950 1960 1970 1980 1990 2000 2010 2020 2030 First AI Winter (1974-1980) Second AI Winter (1987-1994) Big Data (2000-2012) Artificial Neuron (McCulloch & Pitts, 1943) Information Theory (Shannon, 1948) Cybernetics (Wiener, 1948) Updating Rule (Hebbian, 1949) Computing Machinery and Intelligence (Turing, 1950) SNARC (Minsky, 1951) AI Term (Dartmouth Workshop, 1956) GPS (Newell & Simon, 1957) Advice Taker (McCarthy, 1958) Back-Propagation (Kelley, 1960) Perceptrons (Rosenblatt, 1962) ELIZA (MIT, 1966) ALPAC Report (USA, 1966) The DENDRAL (Buchanan, 1969) Perceptrons Book (Minsky & Papert, 1969) PROLOG (1972) MYCIN (Stanford, 1972) Lighthill Report (UK, 1973) FRAMES (1975) Hopfield net (1982) R1 (McDermott, 1982) Parallel Distributed Processing (Rumelhart & McClelland, 1986) Bayesian Networks (Pearls, 1988) Reinforcement Learning (Sutton, 1988) Image Recognition (LeCun et al., 1990) Deep Blue beats Kasparov (IBM, 1997) Deep Learning (Hinton, 2006) Watson wins Jeopardy (2011) AlexNet (Krizhevsky, 2012) GANs (Goodfellow, 2014) AlphaGo beats Lee Sedol (DeepMind, 2016) Transformer (Vaswani, 2017) AlphaFold (DeepMind, 2018) GPT-1 (OpenAI, 2020) BERT (Google, 2019) Chinchilla (DeepMind, 2022) ChatGPT (OpenAI, 2022) LLaMA (Meta AI, 2023) Claude 2 (Anthropic, 2023) phi-3 (Microsoft, 2024) Gemini 1.5 (Google DeepMind, 2024) Qwen3 (Alibaba, 2025) R1 (DeepSeek) 2025)
Session 1 - Artificial Intelligence and Machine Learning

ML Today

Session 1 - Artificial Intelligence and Machine Learning

AI Today - Applications

Session 1 - Artificial Intelligence and Machine Learning

AI Today - Applications

Everyday Applications

  • Voice and Chat Assistants (Siri, Chat-GPT)
  • Content Recommendations (Netflix, Spotify)
  • Smart Home Devices
  • Facial Recognition
  • Language Translation
  • Fraud Detection
  • Personalised Marketing
  • Gaming
  • ...
Applications
Session 1 - Artificial Intelligence and Machine Learning

AI Today - Promises

Session 1 - Artificial Intelligence and Machine Learning

AI Today - Promises

Promises

  • Early diagnosis and personalised medicine
  • Drug discovery
  • Climate change solutions
  • Personalised learning
  • Accelerating science
  • Productivity boost, automation
  • Smart homes, cities, etc.
  • ...
Protein
Protein Folding: BQUB23-Anialet, CC BY-SA 4.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

AI Today - Risks

Session 1 - Artificial Intelligence and Machine Learning

AI Today - Risks

Risks
Designed by stories / Freepik

Risks

  • Bias and discrimination, privacy violations
  • Job displacement, digital divide
  • Data quality issues, system vulnerabilities
  • Adversarial attacks, data breaches
  • Implementation costs, market disruption
  • Energy consumption, carbon footprint
  • Regulatory compliance, liability issues
  • ...
Session 1 - Artificial Intelligence and Machine Learning

AI Today - Risks

Risks

Risks

  • Bias and discrimination, privacy violations
  • Job displacement, digital divide
  • Data quality issues, system vulnerabilities
  • Adversarial attacks, data breaches
  • Implementation costs, market disruption
  • Energy consumption, carbon footprint
  • Regulatory compliance, liability issues
  • ...
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Perception
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Perception
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Perception
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Perception
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

"We should be able to do 90 percent of miles driven [autonomously] within three years." (Musk, 2013)


"From a technology standpoint, Tesla will have a car that can do full autonomy in about three years, maybe a bit sooner." (Musk, 2015)


"I feel very confident predicting that there will be autonomous robotaxis from Tesla next year." (Musk, 2019)

...

Session 1 - Artificial Intelligence and Machine Learning

ML Perception

"We should be able to do 90 percent of miles driven [autonomously] within three years." (Musk, 2013)


"From a technology standpoint, Tesla will have a car that can do full autonomy in about three years, maybe a bit sooner." (Musk, 2015)


"I feel very confident predicting that there will be autonomous robotaxis from Tesla next year." (Musk, 2019)

...

Musk
See full list of predictions: https://en.wikipedia.org/wiki/List_of_predictions_for_autonomous_Tesla_vehicles_by_Elon_Musk
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

"We should be able to do 90 percent of miles driven [autonomously] within three years." (Musk, 2013)


"From a technology standpoint, Tesla will have a car that can do full autonomy in about three years, maybe a bit sooner." (Musk, 2015)


"I feel very confident predicting that there will be autonomous robotaxis from Tesla next year." (Musk, 2019)

...

"I predict that there will be millions of Teslas operating fully autonomously in the second half of next year." (Musk, 2025)

Musk
See full list of predictions: https://en.wikipedia.org/wiki/List_of_predictions_for_autonomous_Tesla_vehicles_by_Elon_Musk
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Geoffrey Hinton
NOBEL Prizes Ceremony 2024: Arthur Petron, CC BY-SA 4.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

"We should stop training radiologists now. It’s just completely obvious that within five years, deep learning is going to do better than radiologists." (Hinton, 2016)

Geoffrey Hinton
NOBEL Prizes Ceremony 2024: Arthur Petron, CC BY-SA 4.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

"We should stop training radiologists now. It’s just completely obvious that within five years, deep learning is going to do better than radiologists." (Hinton, 2016)

Radiology Crisis
Radiology Crisis - Independent: https://www.independent.co.uk/news/health/cancer-delays-nhs-staff-shortages-b2561385.html
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

"We should stop training radiologists now. It’s just completely obvious that within five years, deep learning is going to do better than radiologists." (Hinton, 2016)

Radiology Crisis
Radiology Crisis - Independent: https://www.independent.co.uk/news/health/cancer-delays-nhs-staff-shortages-b2561385.html
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Demis Hassabis
NOBEL Prizes Ceremony 2024: Arthur Petron, CC BY-SA 4.0 , via Wikimedia Commons
Protein
Protein Folding: BQUB23-Anialet, CC BY-SA 4.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Demis Hassabis
NOBEL Prizes Ceremony 2024: Arthur Petron, CC BY-SA 4.0 , via Wikimedia Commons

"... step 1: solving intelligence, step 2: use it to solve everything else..." (Hassabis, 2025)


"Artificial General Intelligence (AGI) will emerge in the next five or 10 years." (Hassabis, 2025)

Session 1 - Artificial Intelligence and Machine Learning

ML Perception

"In from three to eight years we will have a machine with the general intelligence of an average human being." (Minsky, 1970)


"In medicine, management, and the military — indeed in most of the world's work — the daily tasks are those requiring symbolic reasoning with detailed professional knowledge." (Feigenbaum, 1982)

"... step 1: solving intelligence, step 2: use it to solve everything else..." (Hassabis, 2025)


"Artificial General Intelligence (AGI) will emerge in the next five or 10 years." (Hassabis, 2025)

Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Demis Hassabis
NOBEL Prizes Ceremony 2024: Arthur Petron, CC BY-SA 4.0 , via Wikimedia Commons
Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Demis Hassabis
NOBEL Prizes Ceremony 2024: Arthur Petron, CC BY-SA 4.0 , via Wikimedia Commons

Machine Learning Problem Requirements

1. Massive combinatorial search space


2. Clear objective function (metric) to optimise against


3. Lots of data and/or accurate and efficient simulators

Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Complexity
Complexity (Designed by freepik.com)

Machine Learning Problem Requirements

1. Massive combinatorial search space


2. Clear objective function (metric) to optimise against


3. Lots of data and/or accurate and efficient simulators

Session 1 - Artificial Intelligence and Machine Learning

ML Perception

Social Problems
Social Problems Metrics?: Jusezam, CC BY-SA 3.0 , via Wikimedia Commons

Machine Learning Problem Requirements

1. Massive combinatorial search space


2. Clear objective function (metric) to optimise against


3. Lots of data and/or accurate and efficient simulators

Session 1 - Artificial Intelligence and Machine Learning

ML Perception

AI Models Collapse
AI Models Collapse: https://www.nature.com/articles/s41586-024-07566-y

Machine Learning Problem Requirements

1. Massive combinatorial search space


2. Clear objective function (metric) to optimise against


3. Lots of data and/or accurate and efficient simulators

Session 1 - Artificial Intelligence and Machine Learning

ML Scientific Approach

Critical Thinking
Critical Thinking (Designed by freepik.com)
Session 1 - Artificial Intelligence and Machine Learning

ML Scientific Approach

Critical Thinking
Critical Thinking (Designed by freepik.com)

"It seems to me what is called for is an exquisite balance between two conflicting needs: the most skeptical scrutiny of all hypotheses that are served up to us and at the same time a great openness to new ideas. Obviously those two modes of thought are in some tension. But if you are able to exercise only one of these modes, whichever one it is, you’re in deep trouble. (The Burden of Skepticism, Sagan, 1987)

Session 1 - Artificial Intelligence and Machine Learning

ML Definition

Session 1 - Artificial Intelligence and Machine Learning

ML Definition

Session 1 - Artificial Intelligence and Machine Learning

ML Definition

Session 1 - Artificial Intelligence and Machine Learning

ML Definition

Our definition before...


Machine Learning Definition
Session 1 - Artificial Intelligence and Machine Learning

ML Definition

Session 1 - Artificial Intelligence and Machine Learning

ML Definition

Our ML projects must have a purpose...


People
Session 1 - Artificial Intelligence and Machine Learning

Conclusions

Session 1 - Artificial Intelligence and Machine Learning

Conclusions

Session 1 - Artificial Intelligence and Machine Learning

Conclusions

Overview

  • ML Context
  • AI History
  • AI Perception
  • AI Winters
  • ML Today
  • ML Applications, Promises, and Risks
  • ML Perception
  • ML Definition
Session 1 - Artificial Intelligence and Machine Learning

Conclusions

Overview

  • ML Context
  • AI History
  • AI Perception
  • AI Winters
  • ML Today
  • ML Applications, Promises, and Risks
  • ML Perception
  • ML Definition

Next Time

  • ML Adoption Process
  • ML with Purpose
  • ML and Socio-technical Systems
  • Data Orientation
  • Data Access
Session 1 - Artificial Intelligence and Machine Learning

Many Thanks!

chc79@cam.ac.uk

PDF

PDF

PDF

PDF

PDF

_script: true

This script will only execute in HTML slides

_script: true