Darshan Thaker
Darshan Thaker
Home
Publications
Experience
Contact
CV
Light
Dark
Automatic
KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Buyun Liang
,
Kwan Ho Ryan Chan
,
Darshan Thaker
,
Jinqi Luo
,
René Vidal
February 2025
PDF
Cite
Type
Preprint
Publication
In Submission
Cite
×