Darshan Thaker
Darshan Thaker
Home
Publications
Experience
Contact
CV
Light
Dark
Automatic
Kwan Ho Ryan Chan
Latest
SECA: Semantically Equivalent & Coherent Attacks for Eliciting LLM Hallucinations
KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
PaCE: Parsimonious Concept Engineering for Large Language Models
Cite
×