PaCE: Parsimonious Concept Engineering for Large Language Models

Publication
Neural Information Processing Systems (NeurIPS)