Gandalf AI Prompt Injection Game Summary
Gandalf is an interactive AI challenge by Lakera where players try to outsmart a chatbot named Gandalf into revealing a secret password that it has been instructed not to share. The game has multiple levels with increasing defenses, illustrating how prompt injection techniques can trick or fail against evolving AI safeguards. Users must craft clever inputs to bypass rules and extract hidden information, making it a hands-on way to learn about AI security and prompt engineering.
https://gandalf.lakera.ai/do-not-tell
Ps. thanks https://www.linkedin.com/in/rgcampos/
Comments
Post a Comment