Category: Uncategorized

  • ~Some Brief Opinions after the NeurIPS ‘25 Interpretability Workshop~ I again wanted to share some thoughts on the field of interpretability, this time under the pretense of responding to the “Mechanistic Interpretability” workshop at NeurIPS 2025.  Mostly, I want to discuss the rising tide of mechanistic interpretability and ask the question: Mechanistic? As the title…

  • I love the field of interpretability, but one issue faced by everyone who tries dipping their toes into interpretability is: “What is Interpretability?”.  There never seems to be a universally agreed-upon definition for interpretability.  Much like philosophy, this often leads to disagreements over the definitions, fights over the contexts, and arguments over the objectives.  Interpretability…

  • ~Some Brief Thoughts after the ICML ‘25 Interpretability Workshop~ I have told many people multiple times that I will write a blog post and I keep not ending up with my words written down on a website. I have recently been advised that if I just write down my first thoughts without any attempts to…