mechanistic interpretability

The study of how neural networks process information internally by identifying the circuits and components responsible for their outputs.

Models and Research

A 14-Author Paper Tries to Make Deep Learning Theory a Science
ByJames McCallef April 26, 2026

A 14-author perspective paper posted to arXiv on April 23 argues that deep learning theory is starting to look less…

Read More A 14-Author Paper Tries to Make Deep Learning Theory a Science