A new research by Google DeepMind shows how sparse autoencoders (SAEs) with special JumpReLU activation can help interpret LLMs.
View Article on VentureBeat
AI,Business,AI research,AI, ML and Deep Learning,category-/Science/Computer Science,deep learning,DeepMind,Google Deepmind,large language models,LLMs,neural networks,research
LLMs