Media Summary: Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to We don't know how AIs think or why they do what they do. Or at least, we don't know much. That fact is only becoming more ...

Mechanistic Interpretability Neel Nanda Deepmind - Detailed Analysis & Overview

Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to We don't know how AIs think or why they do what they do. Or at least, we don't know much. That fact is only becoming more ... SPONSOR MESSAGES: *** CentML offers competitive pricing for GenAI model deployment, with flexible options to suit a wide ... This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? Art by Clipped from episode 19 of AXRP: Transcript of that episode: ...

This is a talk I gave to my MATS scholars, with a stylised history of the field of Part 1 of a walkthrough of our paper, Progress Measures for Grokking via A talk I gave to my MATS 9.0 training program about reasoning model When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ... How can we look inside neural networks and figure out how they do what they do? This is likely to be very important for alignment ...

Photo Gallery

Mechanistic Interpretability - NEEL NANDA (DeepMind)
I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
We Can Monitor AI’s Thoughts… For Now | Google DeepMind's Neel Nanda
Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
NEURAL NETWORKS ARE WEIRD! - Neel Nanda (DeepMind)
What Matters Right Now In Mechanistic Interpretability?
What is mechanistic interpretability? Neel Nanda explains.
Neel Nanda: Mechanistic Interpretability & Mathematics
Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability
The Story of Mech Interp
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
Sponsored
Sponsored
View Detailed Profile
Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

Most remarkably, he ended up running

Sponsored
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

We Can Monitor AI’s Thoughts… For Now | Google DeepMind's Neel Nanda

We Can Monitor AI’s Thoughts… For Now | Google DeepMind's Neel Nanda

We don't know how AIs think or why they do what they do. Or at least, we don't know much. That fact is only becoming more ...

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda

Sponsored
NEURAL NETWORKS ARE WEIRD! - Neel Nanda (DeepMind)

NEURAL NETWORKS ARE WEIRD! - Neel Nanda (DeepMind)

SPONSOR MESSAGES: *** CentML offers competitive pricing for GenAI model deployment, with flexible options to suit a wide ...

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda

Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability

Neel Nanda on Avoiding an AI Catastrophe with Mechanistic Interpretability

Neel Nanda

The Story of Mech Interp

The Story of Mech Interp

This is a talk I gave to my MATS scholars, with a stylised history of the field of

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0 training program about reasoning model

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ...

Concrete Open Problems in Mechanistic Interpretability: Neel Nanda at SERI MATS

Concrete Open Problems in Mechanistic Interpretability: Neel Nanda at SERI MATS

How can we look inside neural networks and figure out how they do what they do? This is likely to be very important for alignment ...

Part 2: 5. Interpretability

Part 2: 5. Interpretability

Neel Nanda

Related Video Content

MECHANISTIC Definition & Meaning - Merriam-Webster information

May 16, 2026 · The meaning of MECHANISTIC is mechanically determined.

MECHANISTIC | English meaning - Cambridge Dictionary information

MECHANISTIC definition: 1. thinking of living things as if they were machines: 2. thinking of living things as if...

MECHANISTIC definition and meaning | Collins English Dictionary information

If you describe a view or explanation of something as mechanistic, you are criticizing it because it describes a...

MECHANISTIC Definition & Meaning | Dictionary.com information

MECHANISTIC definition: of or relating to the theory of mechanism or to mechanists. See examples of mechanistic used...

What Is a Mechanistic Model? Definition and Examples information

Nov 7, 2025 · A mechanistic model is constructed from known physical, chemical, or biological laws. For instance, a...