Interpretability Blog
Just another blog about interpretability, this one by James Enouen
Category: Uncategorized
-
~Some Brief Opinions after the NeurIPS ‘25 Interpretability Workshop~ I again wanted to share some thoughts on the field of interpretability, this time under the pretense of responding to the “Mechanistic Interpretability” workshop at NeurIPS 2025. Mostly, I want to discuss the rising tide of mechanistic interpretability and ask the question: Mechanistic? As the title…
-
I love the field of interpretability, but one issue faced by everyone who tries dipping their toes into interpretability is: “What is Interpretability?”. There never seems to be a universally agreed-upon definition for interpretability. Much like philosophy, this often leads to disagreements over the definitions, fights over the contexts, and arguments over the objectives. Interpretability…
-
~Some Brief Thoughts after the ICML ‘25 Interpretability Workshop~ I have told many people multiple times that I will write a blog post and I keep not ending up with my words written down on a website. I have recently been advised that if I just write down my first thoughts without any attempts to…