ClawGuard: AI Agent Security Scanner
ClawGuard is an open-source security scanner designed to act as a firewall for AI agents, detecting threats like prompt injection, jailbreaks, and data exfiltration in real time. It features 216 detection patterns across 13 categories, supports 15 languages, and achieves sub-10 millisecond scan times with an F1 score of 99 percent. The tool includes a first-of-its-kind MCP security scanner for identifying hidden injections in MCP server tool descriptions, along with a 10-stage preprocessing pipeline to resist common evasion techniques like leetspeak, zero-width characters, and base64 encoding. It provides confidence scoring, a benchmarking framework, CLI and SARIF output for CI/CD integration, and helps with EU AI Act compliance. The project has been used to responsibly disclose vulnerabilities in over 30 popular MCP servers and AI tools, and is released under the MIT License.
Comments
Post a Comment