ClawGuard: AI Agent Security Scanner
ClawGuard is an open-source security scanner that acts as a firewall for AI agents, detecting prompt injection, jailbreaks, and data exfiltration in real time. It features 216 detection patterns across 13 categories, supports 15 languages, and achieves sub-10 millisecond scan times with an F1 score of 99 percent using pure Python with no external dependencies or API calls. The scanner includes a 10-stage preprocessing pipeline to catch evasion techniques like leetspeak, zero-width characters, homoglyphs, and base64 encoding, along with confidence scoring for each finding. ClawGuard offers a dedicated MCP security scanner for identifying hidden prompt injections in MCP server tool descriptions, an evaluation framework with 262 test cases, CLI and SARIF output for CI/CD integration, and compliance support for EU AI Act articles. The project has been used to responsibly disclose vulnerabilities in over 30 popular MCP servers and AI tools representing more than 280,000 combined GitHub stars, and is released under the MIT License.
Comments
Post a Comment