tommybark / elk Goto Github PK
View Code? Open in Web Editor NEWThis project forked from cadenza-labs/elk
Keeping language models honest by directly eliciting knowledge encoded in their activations. Building on "Discovering latent knowledge in language models without supervision" (Burns et al. 2022)
License: MIT License