This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Andrea Palmieri 🤌
andpalmier.com
did:plc:uuu3t6jhtvpd6ljyb74ruwxt
We interact (and therefore attack) LLMs mainly using language, therefore let's start from there.
I used this dataset github.com/verazuo/jailbreak_llms of #jailbreak #prompt to create this wordcloud.
I believe it gives a sense of "what works" in these attacks!
⬇️
2024-11-25T07:08:56.098Z