AI NewsMachine LearningMechanistic Interpretability
OpenAI Researchers Train Weight Sparse Transformers to Expose Interpretable Circuits
OpenAI's weight-sparse transformers achieve 1-in-1000 weight sparsity, enabling interpretable circuits for safer AI