We are an AI research lab at the London School of Economics focusing on mechanistic interpretability of LLMs.
Our Research
Our recent paper on layerwise trasfer learning in sparse autoencoders was accepted for publication by ACL and presentation at BackboxNLP (EMNLP 2024)!
We’re a dynamic group of individuals who are passionate about what we do and dedicated to delivering the best.