Media Summary: Forough Poursabzi, Researcher, Microsoft Research Presented at MLconf 2018 Abstract: Machine learning is increasingly used to ... A surprising fact about modern large language Lex Fridman Podcast full episode: Thank you for listening ❤ Check out our ...

Can Interpretability Control Model Training - Detailed Analysis & Overview

Forough Poursabzi, Researcher, Microsoft Research Presented at MLconf 2018 Abstract: Machine learning is increasingly used to ... A surprising fact about modern large language Lex Fridman Podcast full episode: Thank you for listening ❤ Check out our ... Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ... InterpretML demo from Azure ML product group. InterpretML is an open-source python package for Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ...

This meetup was recorded in Mountain View, California on January 24th 2019. Slides from the meetup ACL SIG-FinTech x TFAI Webinar Series ( Understanding and improving LLMs through mechanistic ... See Part I for an intro into Steering Vectors Code from this video: ... Art by Clipped from episode 19 of AXRP: Transcript of that episode: ...

Photo Gallery

Can Interpretability Control Model Training?
Manipulating and Measuring Model Interpretability
What is interpretability?
Interpretability: Understanding how AI models think
Yuandong Tian: Inside-out interpretability: training dynamics in multi-layer transformer
Mechanistic Interpretability explained | Chris Olah and Lex Fridman
The Dark Matter of AI [Mechanistic Interpretability]
What Matters Right Now In Mechanistic Interpretability?
Fit interpretable models & explain blackbox ML with InterpretML from Microsoft Research.
Interpretable vs Explainable Machine Learning
Scaling interpretability
Accuracy versus Interpretability / Explainability in Machine Learning
View Detailed Profile
Can Interpretability Control Model Training?

Can Interpretability Control Model Training?

A talk I gave to my MATS 9.0

Manipulating and Measuring Model Interpretability

Manipulating and Measuring Model Interpretability

Forough Poursabzi, Researcher, Microsoft Research Presented at MLconf 2018 Abstract: Machine learning is increasingly used to ...

What is interpretability?

What is interpretability?

A surprising fact about modern large language

Interpretability: Understanding how AI models think

Interpretability: Understanding how AI models think

What's happening inside an AI

Yuandong Tian: Inside-out interpretability: training dynamics in multi-layer transformer

Yuandong Tian: Inside-out interpretability: training dynamics in multi-layer transformer

You

Mechanistic Interpretability explained | Chris Olah and Lex Fridman

Mechanistic Interpretability explained | Chris Olah and Lex Fridman

Lex Fridman Podcast full episode: https://www.youtube.com/watch?v=ugvHCXCOmm4 Thank you for listening ❤ Check out our ...

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0

Fit interpretable models & explain blackbox ML with InterpretML from Microsoft Research.

Fit interpretable models & explain blackbox ML with InterpretML from Microsoft Research.

InterpretML demo from Azure ML product group. InterpretML is an open-source python package for

Interpretable vs Explainable Machine Learning

Interpretable vs Explainable Machine Learning

Interpretable models can

Scaling interpretability

Scaling interpretability

Science and engineering are inseparable. Our researchers reflect on the close relationship between scientific and engineering ...

Accuracy versus Interpretability / Explainability in Machine Learning

Accuracy versus Interpretability / Explainability in Machine Learning

Accuracy versus

Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!

Get hands-on with Explainable AI at Machine Learning Interpretability(MLI) Gym!

This meetup was recorded in Mountain View, California on January 24th 2019. Slides from the meetup

Manipulating and Measuring Model Interpretability

Manipulating and Measuring Model Interpretability

Manipulating and Measuring

How Reasoning Models Break Mechanistic Interpretability Techniques

How Reasoning Models Break Mechanistic Interpretability Techniques

A talk I gave to my MATS 9.0

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How

Understanding and improving LLMs through mechanistic interpretability

Understanding and improving LLMs through mechanistic interpretability

ACL SIG-FinTech x TFAI Webinar Series (https://sigfintech.github.io/) Understanding and improving LLMs through mechanistic ...

Steering vectors: tailor LLMs without training. Part II: Code (Interpretability Series)

Steering vectors: tailor LLMs without training. Part II: Code (Interpretability Series)

See Part I for an intro into Steering Vectors https://youtu.be/cp-YSyc5aW8. Code from this video: ...

AI  Interpretability vs Explainability

AI Interpretability vs Explainability

Interpretability

What is mechanistic interpretability? Neel Nanda explains.

What is mechanistic interpretability? Neel Nanda explains.

Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...