Gemma Scope: serving to the security group make clear the interior workings of language fashions
Fashions Revealed 31 July 2024 Authors Language Mannequin Interpretability staff Asserting a complete, open suite of sparse autoencoders for language ...