Media Summary: This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

Neel Nanda Mechanistic Interpretability Superposition - Detailed Analysis & Overview

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? This is a talk I gave to my MATS scholars, with a stylised history of the field of How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Art by Clipped from episode 19 of AXRP: Transcript of that episode: ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via A walkthrough of Anthropic's Paper, A Toy Model of

A talk I gave to my MATS 9.0 training program about reasoning model Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ... When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ... How good are we at understanding the internal computation of advanced machine learning models, and do we have a hope at ... See part 2 here: Implementing GPT-2 from Scratch Template notebook: ...

Photo Gallery

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking
What Matters Right Now In Mechanistic Interpretability?
The Story of Mech Interp
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
What is mechanistic interpretability? Neel Nanda explains.
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)
A Walkthrough of Toy Models of Superposition w/ Jess Smith
How Reasoning Models Break Mechanistic Interpretability Techniques
Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour
Mechanistic Interpretability - NEEL NANDA (DeepMind)
Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]
What Happened With Sparse Autoencoders?
Sponsored
Sponsored
View Detailed Profile
Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda–Mechanistic Interpretability, Superposition, Grokking

Neel Nanda

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

Sponsored
The Story of Mech Interp

The Story of Mech Interp

This is a talk I gave to my MATS scholars, with a stylised history of the field of

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...

Sponsored
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via

A Walkthrough of Toy Models of Superposition w/ Jess Smith

A Walkthrough of Toy Models of Superposition w/ Jess Smith

A walkthrough of Anthropic's Paper, A Toy Model of

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0 training program about reasoning model

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda – Mechanistic Interpretability: A Whirlwind Tour

Neel Nanda

Mechanistic Interpretability - NEEL NANDA (DeepMind)

Mechanistic Interpretability - NEEL NANDA (DeepMind)

http://80000hours.org/mlst Visit our sponsor 80000 hours - grab their free career guide and check out their podcast! Use our ...

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda

What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

Neel Nanda - Our Pivot To Pragmatic Interpretability [Alignment Workshop]

When Anthropic tested Claude Sonnet 4.5 for alignment, the model appeared perfectly behaved — but it turned out the model had ...

Neel Nanda: Mechanistic Intepretability (HAAISS 2024)

Neel Nanda: Mechanistic Intepretability (HAAISS 2024)

Neel Nanda

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

I lead a Google DeepMind team at 26. If you want to work at an AI company... | Neel Nanda (Part 2)

PART 1* — a comprehensive update on

19 - Mechanistic Interpretability with Neel Nanda

19 - Mechanistic Interpretability with Neel Nanda

How good are we at understanding the internal computation of advanced machine learning models, and do we have a hope at ...

What is a Transformer? (Transformer Walkthrough Part 1/2)

What is a Transformer? (Transformer Walkthrough Part 1/2)

See part 2 here: Implementing GPT-2 from Scratch https://neelnanda.io/transformer-tutorial-2 Template notebook: ...

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda

Related Video Content

NEEL - YouTube information

NEEL is an R&B/Pop, Bollywood-influenced singer-songwriter and producer based out of NYC. Born and raised in New...

Alice Neel - Wikipedia information

In the 1930s, Neel gained a reputation as an artist, and established a good standing within her circle of downtown...

NEEL-TRIMARANS information

NEEL-TRIMARANS is the worldwide leader designing and building cruising trimarans. Discover the unique range of...

Alice Neel | Official Website information

Explore the life and works of Alice Neel, a pioneering American painter known for her humanist portraits and social...

NEEL - Facebook information

NEEL is a Pop/R&B and Bollywood-influenced artist based out of NYC. It started with ‘Hey Siri’ and ended with Alexa...