Rogue agents and shadow AI: Why VCs are betting big on AI security

What happens when an AI agent decides the best way to complete its task is to blackmail you? This is not a hypothetical scenario. According to Barmak Meftah, a partner at cybersecurity venture firm Ballistic Ventures, it recently happened to an enterprise employee working with an AI agent. The employee tried to suppress what the agent was trained to do, and the agent responded by scanning the user’s inbox, finding some inappropriate emails, and threatening to blackmail the user by forwarding those emails to the board of directors.

In the agent’s perspective, it was doing the right thing. It was trying to protect the end user and the enterprise. Meftah’s example is reminiscent of Nick Bostrom’s AI paperclip problem, a thought experiment that illustrates the existential risk posed by a superintelligent AI single-mindedly pursuing an innocuous goal, like making paperclips, to the exclusion of all human values. In this case, the enterprise AI agent’s lack of context for why the employee was overriding its goals led it to create a sub-goal to remove the obstacle through blackmail so it could meet its primary objective. This, combined with the non-deterministic nature of AI agents, means things can go rogue.

Misaligned agents are just one layer of the AI security challenge. Ballistic’s portfolio company, Witness AI, is trying to solve these problems. Witness AI monitors AI usage across enterprises and can detect when employees use unapproved tools, block attacks, and ensure compliance. The company recently raised $58 million following over 500 percent growth in annual recurring revenue and scaled its employee headcount by five times over the last year as enterprises seek to understand shadow AI use and scale AI safely. As part of its fundraise, Witness AI announced new agentic AI security protections.

People are building AI agents that take on the authorizations and capabilities of the people that manage them, and there is a need to ensure these agents do not go rogue, delete files, or do something wrong. Meftah sees agent usage growing exponentially across the enterprise. To complement that rise, and the machine-speed level of AI-powered attacks, analyst Lisa Warren predicts that AI security software will become an $800 billion to $1.2 trillion market by 2031.

Runtime observability and runtime frameworks for safety and risk will be absolutely essential. As for how such startups plan to compete with large players like AWS, Google, and Salesforce, who have built AI governance tools into their platforms, Meftah believes AI safety and agentic safety is such a huge field that there is room for many approaches. Many enterprises want a standalone, end-to-end platform to provide that observability and governance around AI and agents.

Witness AI’s CEO, Rick Caccia, noted that his company operates at the infrastructure layer, monitoring interactions between users and AI models rather than building safety features into the models themselves. This approach was intentional, choosing a part of the problem where a model provider could not easily subsume them, meaning they compete more with legacy security companies. The question becomes how to beat them.

Caccia does not want Witness AI to be a startup that simply gets acquired. He wants the company to grow and become a leading independent provider, following the path of companies like CrowdStrike in endpoint protection, Splunk in SIEM, and Okta in identity. The goal is to stand next to the big players, and Witness AI was built to do that from day one.