The Takeaway: The real AI security problem isn’t prompts — it’s autonomous agents taking the wrong actions at scale.
- Enterprises can’t rely on human review anymore when agent actions are multiplying 100x to 1,000,000x.
- Existing security tools miss the core issue: they can see activity, but not whether an agent’s next move is actually safe.
- The winning control layer will be small, fast models that decide when a smarter agent needs to step in — not giant models reviewing everything.
Maxim Bar Kogan, cofounder and CEO of Onyx Security, is building what he calls an “AI control plane” for enterprises: agents that watch other agents. His bet started early, when AutoGPT hinted that LLMs could stop generating text and start taking actions. Back then, most buyers thought he was too early. Now, with coding agents like Claude Code and enterprise adoption accelerating, the risk has caught up to the vision.
His core argument is blunt: traditional security breaks down when software is allowed to act like a human. Identity tools assume permissions can be tightly scoped, but agents need broad access to be useful. Endpoint and API tools can log what happened, but they can’t tell whether the model was justified in doing it. As he puts it, the hard part is “understanding what another AI system is thinking, what is it planning to do.”
Onyx’s answer is not to run a giant model on every action. That would be too slow and too expensive. Instead, they train small models that are good at one job: deciding when a higher-level review is needed. It’s a blitz-chess mindset — spend almost no compute on routine moves, then slow down hard when the position gets dangerous. Kogan thinks that same logic will define AI governance as models get smarter and more autonomous.