A single prompt was found to universally bypass all safety guardrails against all frontier AI models, as well as get them to leak their intellectual property! I got hands on and proved it worked...
Share this post
Update #7 - Policy Puppetry Attack in Action
Share this post
A single prompt was found to universally bypass all safety guardrails against all frontier AI models, as well as get them to leak their intellectual property! I got hands on and proved it worked...