Gandalf AI Prompt Injection Game Summary

Gandalf is an interactive AI challenge by Lakera where players try to outsmart a chatbot named Gandalf into revealing a secret password that it has been instructed not to share. The game has multiple levels with increasing defenses, illustrating how prompt injection techniques can trick or fail against evolving AI safeguards. Users must craft clever inputs to bypass rules and extract hidden information, making it a hands-on way to learn about AI security and prompt engineering. 

https://gandalf.lakera.ai/do-not-tell

Ps. thanks https://www.linkedin.com/in/rgcampos/

Comments

Popular posts from this blog

Prompt Engineering Demands Rigorous Evaluation

Secure Vibe Coding Guide: Best Practices for Writing Secure Code

KEVIntel: Real-Time Intelligence on Exploited Vulnerabilities