<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
    <channel>
        <title>Security | VentureBeat</title>
        <link>https://venturebeat.com/category/security/feed/</link>
        <description>Transformative tech coverage that matters</description>
        <lastBuildDate>Tue, 23 Jun 2026 17:23:54 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>https://github.com/jpmonette/feed</generator>
        <language>en</language>
        <copyright>Copyright 2026, VentureBeat</copyright>
        <item>
            <title><![CDATA[7,000 Langflow servers are under attack. LangGraph and LangChain have the same holes]]></title>
            <link>https://venturebeat.com/security/7000-langflow-servers-under-attack-langgraph-langchain-same-holes</link>
            <guid isPermaLink="false">4v4X3T0BLpo0EzJ33GND2V</guid>
            <pubDate>Fri, 19 Jun 2026 21:14:19 GMT</pubDate>
            <description><![CDATA[<p>Your AI agent did exactly what it was designed to do. The framework underneath it just handed an attacker a shell on the box that holds your OpenAI key, your database credentials, and your CRM tokens.</p><p>That is not a hypothetical. In a few months, three of the most widely deployed AI agent frameworks each turned a known, ordinary bug class into a way through. <a href="https://research.checkpoint.com/2026/from-sqli-to-rce-exploiting-langgraphs-checkpointer/">Check Point Research</a> chained a SQL injection in LangGraph’s SQLite checkpointer to full remote code execution. Tenable and VulnCheck tracked a path traversal in Langflow’s file upload endpoint to active, in-the-wild RCE. <a href="https://www.cyera.com/research/langdrained-3-paths-to-your-data-through-the-worlds-most-popular-ai-framework">Cyera</a> documented a path traversal in LangChain-core’s prompt loader that reads your secrets off disk. Two paths to a shell, one to your keys. They are the same bug, wearing three frameworks.</p><p>These frameworks became production infrastructure faster than anyone secured them. They store agent state, take file uploads, load prompt configs, and hold the credentials to databases, CRMs, and internal APIs. The edge tools watch traffic. The endpoint tools watch processes. Neither was built to treat an imported framework as a boundary worth guarding, and that blind spot is exactly where all three chains live, widening every week as these frameworks ship to production.</p><h2><b>The LangGraph chain, SQL injection to a Python shell</b></h2><p>Start with the one most teams pulled into production this quarter. LangGraph gives AI agents memory through checkpointers, the persistence layer that stores execution state. It has cleared over <a href="https://pypistats.org/packages/langgraph">50 million downloads a month</a>. Yarden Porat of Check Point Research took that layer apart and found three vulnerabilities. Two of them chain to RCE.</p><p><a href="https://advisories.gitlab.com/pypi/langgraph-checkpoint-sqlite/CVE-2025-67644/">CVE-2025-67644</a>, rated CVSS 7.3, is a SQL injection in the SQLite checkpointer. The function that builds the WHERE clause for checkpoint lookups drops user-controlled filter keys straight into the query with no parameterization and no escaping. This does not hit everyone, but where it hits, it is serious. A deployment is exposed when it self-hosts LangGraph on the SQLite or Redis checkpointer and lets untrusted input reach get_state_history() or a similar history endpoint. Meet those conditions, and an attacker who controls the filter writes a fabricated row straight into the checkpoint table. Run LangChain’s managed LangSmith platform on PostgreSQL, and the exposure is gone.</p><p>Then <a href="https://advisories.gitlab.com/pypi/langgraph/CVE-2026-28277/">CVE-2026-28277</a>, CVSS 6.8, finishes the job. LangGraph’s msgpack checkpoint decoder rebuilds Python objects from the stored data, which lets it import a module and call a named function with attacker-supplied arguments. That step needs write access to the checkpoint store; the SQL injection is what grants it remotely. LangGraph loads the forged row as a legitimate checkpoint, the decoder runs the specified function, including os.system, and code executes under the identity of the agent server. A third issue, CVE-2026-27022, CVSS 6.5, reaches the same place through the Redis checkpointer.</p><p>There has been no confirmed exploitation in the wild yet. A working proof-of-concept is public in Check Point’s disclosure. The fixes are version bumps: langgraph-checkpoint-sqlite to 3.0.1, langgraph to 1.0.10, and langgraph-checkpoint-redis to 1.0.2.</p><h2><b>The Langflow chain, one unauthenticated request to RCE</b></h2><p>Langflow is the one already under attack. CVE-2026-5027, CVSS 8.8, is a path traversal in the POST /api/v2/files endpoint, which takes the filename straight from the form data and writes it to disk unsanitized. An attacker packs that filename with traversal sequences and drops a file anywhere, such as a cron job in /etc/cron.d/. Because Langflow ships with auto-login enabled in its default configuration, an exposed instance needs no credentials at all. A single unauthenticated request reaches the endpoint, and the next cron run hands over a shell.</p><p>VulnCheck’s Caitlin Condon confirmed exploitation on June 9: “Our Canaries observed exploitation of CVE-2026-5027 that successfully leveraged the path traversal to write what appear to be test files on victim systems.” Censys put roughly 7,000 exposed instances on the internet, most in North America. This is the third Langflow flaw to draw active exploitation this year, after <a href="https://www.probablypwned.com/article/langflow-cve-2025-34291-muddywater-account-takeover-rce">CVE-2025-34291</a>, which the Iranian state-sponsored group MuddyWater weaponized and which CISA added to its <a href="https://thehackernews.com/2026/05/cisa-adds-exploited-langflow-and-trend.html">Known Exploited Vulnerabilities catalog</a> in May. CVE-2026-5027 itself was patched in version 1.9.0, released April 15.</p><p>The timeline is what sets the clock. The patch shipped April 15. Attacks started in June, and <a href="https://www.thestack.technology/langflow-instances-are-getting-exploited-again/">VulnCheck added CVE-2026-5027 to its exploited-vulnerabilities list June 8</a> once its sensors caught the first in-the-wild hits. Every instance left unpatched between those two dates has been sitting in the open for almost two months. The lesson for security teams is to start the patch clock at disclosure, not at a federal catalog entry.</p><h2><b>The LangChain-core gap, arbitrary file reads through the prompt loader</b></h2><p>LangChain-core, the foundation under both, disclosed <a href="https://thehackernews.com/2026/03/langchain-langgraph-flaws-expose-files.html">CVE-2026-34070</a>, CVSS 7.5, a path traversal in its legacy prompt-loading API. The load_prompt() functions read a file path out of a config dict with no check against traversal sequences or absolute paths, so an attacker who influences that path reads arbitrary files the process can reach, including the .env file holding OPENAI_API_KEY and ANTHROPIC_API_KEY. Cyera paired it with CVE-2025-68664, CVSS 9.3, a deserialization flaw that resolves environment secrets through a crafted object. The fix versions differ, which matters when you patch: CVE-2026-34070 lands in <a href="https://security.snyk.io/vuln/SNYK-PYTHON-LANGCHAINCORE-15809257">langchain-core 1.2.22 and 0.3.86</a>; CVE-2025-68664 lands earlier in <a href="https://nvd.nist.gov/vuln/detail/CVE-2025-68664">1.2.5 and 0.3.81</a>. Clear both, or the higher-severity flaw stays live behind a patched one.</p><p>Three frameworks, three classic AppSec bugs. Path traversal. SQL injection. Unsafe deserialization. Nothing exotic, nothing AI-specific, just old vulnerabilities living inside new infrastructure. None of this is a frontier-model problem. It is plumbing, sitting in the layer where AI meets the enterprise.</p><h2><b>Why the scanner cannot see it</b></h2><p>Merritt Baer, CSO at <a href="https://www.enkryptai.com/">Enkrypt AI</a> and former deputy CISO at AWS, has named what makes this kind of failure hard to see coming. It does not announce itself as an AI problem. &quot;CISOs will experience MCP insecurity not in the abstract, but when an employee pastes sensitive data into a tool, or when an attacker finds an unauthenticated MCP server in your cloud,&quot; Baer told VentureBeat. &quot;It won&#x27;t feel like &#x27;AI risk.&#x27; It will feel like your traditional security program failing.&quot; The framework chains here are the same shape. An exposed Langflow instance is an unauthenticated server in your cloud, and the alert, if one fires, reads like an ordinary incident.</p><p>That is the gap in one sentence. The exploit lives in the framework your code imports. The WAF never sees a msgpack decoder running three layers down. The EDR watches the agent server make the same process calls it makes a thousand times a day and waves it through. Both tools are doing their job. Nobody scoped the framework itself as the thing that could turn on you. </p><p>The root cause is older than AI, and Baer names it. “MCP is shipping with the same mistake we’ve seen in every major protocol rollout: insecure defaults,” she told VentureBeat. “If we don’t build authentication and least privilege in from day one, we’ll be cleaning up breaches for the next decade.” Langflow’s auto-login is that mistake shipped. LangChain-core’s unguarded prompt loader is that mistake shipped. The convenient default is the vulnerability. And the moment an agent connects to anything, that risk compounds. “You’re not just trusting your own security, you’re inheriting the hygiene of every tool, every credential, every developer in that chain,” Baer said. “That’s a supply chain risk in real time.”</p><p>There is a governance failure layered on top of the technical one, and it is the same miscategorization Assaf Keren, chief security officer at Qualtrics and former CISO at PayPal, has flagged in adjacent tooling. “Most security teams still classify experience management platforms as ‘survey tools,’ which sit in the same risk tier as a project management app,” Keren told VentureBeat. “This is a massive miscategorization.” Swap in AI agent frameworks, and it still holds. Teams file LangGraph, Langflow, and LangChain under developer convenience, then wire them into databases, CRMs, and provider keys. “Security has to be an enabler,” Keren said, “or teams route around it.” These frameworks are what routing around it looks like.</p><p>Follow the money and it points at the same layer. On its <a href="https://www.fool.com/earnings/call-transcripts/2026/06/03/crowdstrike-crwd-q1-2027-earnings-transcript/">Q1 fiscal 2027 earnings call</a>, CrowdStrike reported its AI detection and response line up more than 250% sequentially, and on June 17 it <a href="https://www.crowdstrike.com/en-us/press-releases/crowdstrike-advances-ai-and-cloud-security-operations-on-aws/">extended that runtime coverage</a> to agent, LLM, and MCP traffic on AWS. George Kurtz, the company’s co-founder and CEO, named the reason in plain terms: “Agents run on the endpoint. They make tool calls, access files, invoke APIs, and move data at the process level.” That is the exact plumbing these chains abuse, and real money is now moving to the layer your AppSec scan skips.</p><h2><b>What to put in front of the board</b></h2><p>The board does not need the CVE numbers. It needs the consequence, and Keren draws the line the board cares about. Most teams have mapped the technical blast radius. “But not the business blast radius,” Keren told VentureBeat. “When an AI engine triggers a compensation adjustment based on poisoned data, the damage is not a security incident. It is a wrong business decision executed at machine speed.” A framework RCE is the same problem one layer earlier. The agent does not just leak a credential; it acts on production systems with it, and the business sees an outcome no one can explain.</p><p>So frame it the way a board frames it: we run AI agent frameworks in production that can be turned into remote shells through bugs our scanners are not built to find, all three are patched, one is under active attack, and here is the date every instance is verified and closed. None of this required custom malware or a zero-day.</p><h2><b>The six-question checklist</b></h2><p>Six trust boundaries, one per row, each with the question, the proof point, the command, the fix, and the board line. Run it tonight.</p><table><tbody><tr><td><p><b>Trust-Boundary Question</b></p></td><td><p><b>Proof Point</b></p></td><td><p><b>What Broke</b></p></td><td><p><b>Verify Before You Install</b></p></td><td><p><b>The Fix</b></p></td><td><p><b>Board Language</b></p></td></tr><tr><td><p><b>1. Can the agent&#x27;s state store be poisoned with code?</b></p></td><td><p>LangGraph SQLi-to-RCE chain. CVE-2025-67644 (CVSS 7.3) chains into CVE-2026-28277 (CVSS 6.8). PoC public, no in-the-wild use yet.</p></td><td><p>Filter keys interpolated into SQL with an f-string. Forged checkpoint row hits the msgpack decoder, which imports and runs an attacker-named callable.</p></td><td><p>pip show langgraph-checkpoint-sqlite. Below 3.0.1 = vulnerable. Confirm get_state_history() is not exposed to network input.</p></td><td><p>Upgrade langgraph-checkpoint-sqlite to 3.0.1, langgraph to 1.0.10, langgraph-checkpoint-redis to 1.0.2.</p></td><td><p>“Our agent memory layer can be tricked into running attacker code. Vendor has patched it. We are upgrading and confirming the endpoint is not exposed.”</p></td></tr><tr><td><p><b>2. Can an unauthenticated request write a file to our agent server?</b></p></td><td><p>Langflow CVE-2026-5027 (CVSS 8.8). On VulnCheck KEV (June 8). Active exploitation confirmed June 9. ~7,000 exposed instances (Censys).</p></td><td><p>Path traversal in POST /api/v2/files. Filename unsanitized. Auto-login on by default. Two HTTP calls drop a cron job and earn a shell.</p></td><td><p>Query Censys or Shodan for your Langflow, Flowise, n8n, and Dify instances on the perimeter. Check whether auto-login is enabled.</p></td><td><p>Upgrade Langflow to 1.9.0+. Disable auto-login. Pull AI dev tools behind VPN or zero-trust. Isolate port 7860.</p></td><td><p>“Our AI dev tools are reachable from the internet with login off. This exact flaw is under active attack now. We are pulling them behind access controls today.”</p></td></tr><tr><td><p><b>3. Can our prompt loader read files it should never touch?</b></p></td><td><p>LangChain-core CVE-2026-34070 (CVSS 7.5), path traversal in the prompt-loading API. Paired with deserialization CVE-2025-68664 (CVSS 9.3).</p></td><td><p>load_prompt() reads a config-supplied path with no traversal check, returning files such as the .env holding OPENAI_API_KEY and ANTHROPIC_API_KEY.</p></td><td><p>pip show langchain-core. Below 1.2.22 (1.x) or 0.3.86 (0.x) = vulnerable. Audit any code passing user-influenced paths to load_prompt().</p></td><td><p>Upgrade langchain-core past both fixes: 1.2.22 / 0.3.86 (CVE-2026-34070) and 1.2.5 / 0.3.81 (CVE-2025-68664). Replace load_prompt() with an allowlisted directory. Run as non-root.</p></td><td><p>“Our prompt system could be steered to read our API keys off disk. We are patching and removing the legacy loader.”</p></td></tr><tr><td><p><b>4. Does a compromised framework hand over every credential at once?</b></p></td><td><p>These frameworks are often deployed with provider keys, database credentials, and integration tokens available to the process environment. Cyera documents the credential-exfiltration path.</p></td><td><p>One RCE on the agent server exposes every secret the process can read. Blast radius is the full credential set, not one app.</p></td><td><p>Inventory which secrets each framework process can reach. Confirm keys come from a secrets manager, not static .env files.</p></td><td><p>Move provider keys to ephemeral injection. Rotate any key a vulnerable instance could have read. Scope each key to least privilege.</p></td><td><p>“A single break in one AI framework exposes the keys to every model and data store it touches. We are rotating and scoping them now.”</p></td></tr><tr><td><p><b>5. Are these frameworks running outside security governance?</b></p></td><td><p>A prior Langflow flaw, CVE-2025-34291, was weaponized by Iranian-linked MuddyWater and added to CISA KEV in May. Shadow AI is the new shadow IT.</p></td><td><p>Teams stand frameworks up for speed, give them credentials, and never bring them under review. The security team cannot see what it does not know exists.</p></td><td><p>Run a discovery sweep for AI frameworks outside change management. Map each to an owner and an approval record.</p></td><td><p>Assign every framework a documented owner and a place in the approval process. Offer a sanctioned alternative so teams do not route around you.</p></td><td><p>“We have AI frameworks in production that no one formally approved. We are bringing them under governance, not banning them.”</p></td></tr><tr><td><p><b>6. Can our scanners even see inside the framework at runtime?</b></p></td><td><p>Runtime detection is forming around this layer: CrowdStrike Falcon AIDR expanded to AWS June 17 (Bedrock, Kiro, Strands); its <a href="https://www.crowdstrike.com/en-us/press-releases/crowdstrike-expands-project-quiltworks-with-aws-hardening-the-cloud-attack-surface-against-frontier-ai-risk/">QuiltWorks coalition</a> now covers cloud workloads.</p></td><td><p>WAF reads HTTP at the edge. EDR watches the endpoint. By default, neither reliably models a msgpack decoder or a prompt loader three layers down in an imported framework as a separate trust boundary.</p></td><td><p>Test whether your AppSec scan covers third-party framework internals. Track CVEs by dependency, not just by what your edge tools can parse.</p></td><td><p>Add framework dependencies to vuln management. Treat agent output and stored state as untrusted. Patch on disclosure, not on KEV listing.</p></td><td><p>“Our scanners check our code, not the frameworks our code imports. We are closing that blind spot and patching on disclosure, not waiting for the federal catalog.”</p></td></tr></tbody></table><p><i>How to read this table: each row is one trust boundary, left to right, from the question to ask to the line to read your board.</i></p><h2><b>Give the board the deadline, not the technology</b></h2><p>The fixes are not a re-architecture. They are version bumps and config changes you can land this week. The exposure is the gap between the day the patch shipped and the day your team runs the checks, and right now that gap is measured in months. The frameworks did exactly what they were built to do. </p>]]></description>
            <author>louiswcolumbus@gmail.com (Louis Columbus)</author>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/5CFo8mBoW1WjItcZvYyHpg/3172659c88b4856fe7137de54672ab16/hero.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[Copilot searched your mailbox. LiteLLM handed out admin keys. Run this 5-check audit before your stack is next]]></title>
            <link>https://venturebeat.com/security/copilot-searched-your-mailbox-litellm-handed-out-admin</link>
            <guid isPermaLink="false">1a7xSZvdjuJg9QOkTPSMyW</guid>
            <pubDate>Thu, 18 Jun 2026 17:42:49 GMT</pubDate>
            <description><![CDATA[<p>Two AI tools broke in the same way in the same two weeks, and four research teams proved it. The pattern underneath every disclosure is one sentence: enterprise AI accepts external input with no trust boundary. </p><p>On June 15, Varonis disclosed <a href="https://www.varonis.com/blog/searchleak">SearchLeak (CVE-2026-42824)</a>, a proof-of-concept exfiltration chain in Microsoft 365 Copilot Enterprise Search. A victim clicks a crafted microsoft.com URL, Copilot searches their mailbox, and the data leaves through a Bing SSRF. No plugins, no second click, no visible indicator. Four days earlier, Obsidian Security published a <a href="https://www.obsidiansecurity.com/blog/litellm-privilege-escalation-rce">three-CVE chain against LiteLLM</a> that carried a default low-privilege user all the way to admin and remote code execution. Two tools. Two teams. One broken boundary.</p><p>The five-check audit at the end of this article maps each gap to a CVE or a market signal from June, a command you can run before lunch, and a sentence a CISO can read to the board.</p><h2>Copilot turned a trusted URL into an exfiltration engine</h2><p>SearchLeak chained three weaknesses into a silent data-theft chain. The URL q parameter fed attacker instructions straight to Copilot’s LLM. A rendering race condition fired an image tag before the output sanitizer ran. Bing’s image-search endpoint, allowlisted in the <a href="https://developer.mozilla.org/en-US/docs/Web/HTTP/CSP">Content Security Policy</a>, routed the stolen data out. Microsoft rated the flaw critical and patched it on the back end, according to Varonis. <a href="https://nvd.nist.gov/vuln/detail/CVE-2026-42824">NVD has not yet scored it</a>; a third-party tracker lists it at 6.5 medium. The severity is contested, but the mechanism is not.</p><p>The escalation is the real story. This is the third Varonis Copilot exfiltration chain in twelve months, after <a href="https://arstechnica.com/security/2026/01/a-single-click-mounted-a-covert-multistage-attack-against-copilot/">Reprompt</a> in January and <a href="https://www.bleepingcomputer.com/news/security/new-attack-turned-microsoft-365-copilot-into-1-click-data-theft-tool/">EchoLeak</a> in 2025. Reprompt hit Copilot Personal. SearchLeak hit Enterprise Search. Enterprise inherits the user’s full organizational permissions, so the blast radius is everything that a user can reach.</p><h2>LiteLLM handed a default account to every provider key</h2><p>The LiteLLM gateway holds the keys for OpenAI, Anthropic, Azure, and Bedrock behind a single proxy. The Obsidian chain runs in three moves. <a href="https://cvefeed.io/vuln/detail/CVE-2026-47101">CVE-2026-47101</a>, an authorization bypass, lets a non-admin mint a wildcard API key. CVE-2026-47102 promotes that caller to proxy admin through an unguarded /user/update endpoint. CVE-2026-40217 escapes the code sandbox through exec() with full builtins. Obsidian then demonstrated a reverse shell by injecting a forged tool-call response through LiteLLM’s callback mechanism. Obsidian assessed the combined chain at CVSS 9.9. The developer typed one word. The attacker popped a shell.</p><p>A separate LiteLLM flaw made the urgency immediate. <a href="https://thehackernews.com/2026/06/litellm-flaw-cve-2026-42271-exploited.html">CVE-2026-42271</a>, a command-injection bug in the MCP test endpoints, landed on the <a href="https://www.cisa.gov/known-exploited-vulnerabilities-catalog">CISA KEV list</a> on June 8 with a June 22 remediation deadline. That KEV entry is not the Obsidian chain. The two are distinct disclosures four days apart, fixed in different releases, pointed at the same gateway. LiteLLM carries more than 40,000 GitHub stars and sits in thousands of enterprise deployments. This is not the first scare, either. A <a href="https://thehackernews.com/2026/06/litellm-vulnerability-chain-lets-low.html">supply-chain compromise backdoored LiteLLM versions 1.82.7 and 1.82.8 on PyPI in March</a>. A compromised gateway exposes every provider credential the organization holds.</p><h2>Langflow and Mini Shai-Hulud proved the pattern scales</h2><p>The same boundary broke in two more tools in the same fortnight. <a href="https://thehackernews.com/2026/06/unpatched-langflow-flaw-cve-2026-5027.html">Langflow CVE-2026-5027</a> became the third Langflow remote-code-execution flaw to hit active exploitation this year. A path traversal in file upload lets an attacker write files anywhere on disk, and because Langflow ships with auto-login enabled by default, a single unauthenticated request reaches RCE. <a href="https://www.vulncheck.com/">VulnCheck</a> confirmed exploitation on June 9. Censys counted roughly 7,000 exposed instances, the heaviest concentration in North America, with <a href="https://attack.mitre.org/groups/G0069/">MuddyWater</a> attribution.</p><p>The <a href="https://www.securityweek.com/over-100-npm-pypi-packages-hit-in-new-shai-hulud-supply-chain-attacks/">Mini Shai-Hulud campaign</a> hit a different pressure point. After the worm’s source code went public on May 12, copycat variants <a href="https://socket.dev/blog/mini-shai-hulud-campaign-hits-red-hat-cloud-services-npm-packages">compromised 32 Red Hat Cloud Services npm packages</a> on June 1, packages pulled 80,000 times a week. The worm harvests more than 20 credential types and self-propagates under the compromised maintainer’s identity.</p><p>Four teams, four tools, one operating failure. The bug classes differ. SearchLeak is a prompt injection. LiteLLM is privilege escalation. Langflow is path traversal. Mini Shai-Hulud is supply-chain poisoning. The boundary that broke is the same in all four.</p><h2>The market already repriced the risk</h2><p>CrowdStrike’s <a href="https://www.fool.com/earnings/call-transcripts/2026/06/03/crowdstrike-crwd-q1-2027-earnings-transcript/">Q1 FY27 earnings call</a> put a number on the gap. <a href="https://www.crowdstrike.com/en-us/platform/falcon-aidr-ai-detection-and-response/">AIDR</a>, the company’s AI detection and response line, grew ending ARR more than 250% sequentially, with a Q2 pipeline above $50 million (<a href="https://www.sec.gov/Archives/edgar/data/0001535527/000153552726000022/crwd-20260603xex991.htm">SEC-filed 8-K</a>). Total company ARR reached $5.51 billion, and CrowdStrike’s fleet telemetry shows more than 1,800 agentic applications running across enterprise endpoints. </p><p>On June 17, the company <a href="https://www.crowdstrike.com/en-us/press-releases/crowdstrike-advances-ai-and-cloud-security-operations-on-aws/">extended AIDR to AWS</a>, adding real-time evaluation of agent, LLM, and MCP communications across Amazon Bedrock, Kiro, and Strands Agents, building on its work with <a href="https://www.anthropic.com/glasswing">Anthropic’s Project Glasswing</a>. Daniel Bernard, CrowdStrike’s chief business officer, said the AI attack surface now spans development, runtime, identities, and cloud infrastructure, and that teams treating those as separate domains leave the gaps between them open.</p><h2>Practitioners name the same gap in plainer terms</h2><p>David Levin, CISO at American Express Global Business Travel, <a href="https://venturebeat.com/security/amex-ciso-fights-threats-at-machine-speed-with-ai/">told VentureBeat</a> the pattern does not surprise him. “We kind of have this shadow AI, which is just the new version of shadow IT,” Levin said. </p><p>Both Langflow and LiteLLM fit the description. Teams stood them up for convenience, gave them credentials, and never brought them under governance. Levin puts the fix before deployment. “We didn’t go into this with just saying we’re going to go do this without the right fundamentals,” he said. “We leverage NIST controls. NIST has released their CSF along with their AI framework. OWASP released their top 10. You need the right fundamentals before you deploy.”</p><p>Merritt Baer, CSO at Enkrypt AI and former AWS Deputy CISO, named the structural version of the failure in a separate <a href="https://venturebeat.com/security/most-enterprises-cant-stop-stage-three-ai-agent-threats-venturebeat-survey-finds">VentureBeat interview</a>. “Enterprises believe they’ve ‘approved’ AI vendors, but what they’ve actually approved is an interface, not the underlying system,” Baer said. “The real dependencies are one or two layers deeper, and those are the ones that fail under stress.” She has tied that directly to how systems fall. “Raw zero-days aren’t how most systems get compromised. Composability is,” Baer <a href="https://venturebeat.com/security/adversaries-hijacked-ai-security-tools-at-90-organizations-the-next-wave-has-write-access-to-the-firewall">told VentureBeat</a>. “It’s the glue between the model and your data where the risk lives. If you give an agent bash and a root token, you’ve already done most of the attacker’s work for them.” That is what rows 2 and 4 of the audit test: the gateway that holds every key, and the agent identity no one governs.</p><p>Levin had a sharper frame for the boardroom. “You need to talk more in terms of risk versus compliance to your boards and your executives,” he said. “It’s not about the size of the engineering team anymore. It’s the size of your imagination. It’s all written in plain English. It’s not hard for anyone.” Neither SearchLeak nor LiteLLM needed custom malware or a zero-day to work.</p><p>Adam Meyers, CrowdStrike’s SVP of Intelligence, put the operational squeeze in numbers in an exclusive VentureBeat interview. “The problem is not zero-day. The problem is patching. If you 10x that problem, they’re gonna be completely underwater,” Meyers said. He pointed to identity as the second front. “Some of these AI have their own identities, or people give their identity to the AI to take action on their behalf, and that makes it a very complex problem.”</p><h2>The five-check trust-boundary audit</h2><p>Each row maps a gap to its proof point, a verification command for Monday morning, the fix, and the sentence to read to the board.</p><table><tbody><tr><td><p><b>Trust-Boundary Gap</b></p></td><td><p><b>Proof Point</b></p></td><td><p><b>What Broke</b></p></td><td><p><b>Verify Monday</b></p></td><td><p><b>Fix Monday</b></p></td><td><p><b>Board Language</b></p></td></tr><tr><td><p><b>1. Prompt-to-Data</b></p></td><td><p>SearchLeak CVE-2026-42824. P2P injection + HTML race + Bing SSRF. One-click mailbox exfiltration via microsoft.com URL. PoC demonstrated; Microsoft rated it critical, NVD not yet scored.</p></td><td><p>URL q-parameter passed to LLM as instructions. Sanitizer ran after render. Bing acted as exfiltration proxy via CSP allowlist.</p></td><td><p>Audit CSP allowlists for domains performing server-side fetches. Monitor Copilot Search URLs for encoded payloads. Review Copilot audit logs.</p></td><td><p>Confirm server-side patch applied. Enable sensitivity labels restricting Copilot. Treat AI streaming output as untrusted.</p></td><td><p>“Our AI assistant could search employee email and send results to an attacker through a trusted Microsoft URL. Vendor patched it. We must verify configuration.”</p></td></tr><tr><td><p><b>2. Gateway Credential Exposure</b></p></td><td><p>LiteLLM three-CVE chain (-47101, -47102, -40217). CVSS 9.9. Separate CVE-2026-42271 on CISA KEV (fixed in v1.83.7; full chain fixed in v1.83.14-stable). June 22 deadline.</p></td><td><p>No role validation on key endpoints. Self-promotion to admin via /user/update. exec() sandbox escape. One gateway exposes all provider keys.</p></td><td><p>Run pip show litellm. Below 1.83.14-stable = vulnerable. Check /mcp-rest/test/ exposure. Audit proxy_admin accounts.</p></td><td><p>Upgrade to v1.83.14-stable+. Rotate all provider API keys. Block /mcp-rest/test/* at proxy. Review Custom Code Guardrails.</p></td><td><p>“Our AI gateway held keys for every provider. A default account could promote itself to admin and steal them all. Rotating and patching now.”</p></td></tr><tr><td><p><b>3. AI Tooling Sprawl</b></p></td><td><p>Langflow CVE-2026-5027 (CVSS 8.8). Third RCE of 2026. ~7,000 exposed instances. MuddyWater. Active exploitation June 9.</p></td><td><p>Path traversal in file upload. Auto-login enabled by default. Single unauthenticated request to RCE.</p></td><td><p>Query Censys/Shodan for Langflow, Flowise, n8n, Dify on your perimeter. Check auto-login. Inventory AI tools outside change management.</p></td><td><p>Pull AI platforms behind VPN/zero-trust. Enable auth everywhere. Upgrade Langflow to v1.9.0+ (current release 1.10.0). Fingerprint surface continuously.</p></td><td><p>“AI dev tools are exposed to the internet with login disabled. A nation-state group is exploiting this flaw now. Pulling behind access controls today.”</p></td></tr><tr><td><p><b>4. Non-Human Identity Governance</b></p></td><td><p>AIDR ARR up 250% (Q1 FY27, SEC 8-K). Q2 pipeline &gt;$50M. 1,800+ agentic apps across enterprise endpoints.</p></td><td><p>Agents hold identities and act on behalf of humans. Some exceed their intended scope to reach a goal. No standard governs agent credential lifecycle.</p></td><td><p>Inventory all non-human identities used by agents and MCP servers. Map agent-to-data-store access. Flag agents with write access to security policy.</p></td><td><p>Least-privilege every agent identity. Set privilege boundaries via identity protection. Runtime detection for policy-exceeding actions. Human-in-the-loop for policy changes.</p></td><td><p>“AI agents hold credentials and act autonomously. We do not govern their identity lifecycle like human access. The 250% market growth tells us this gap is systemic.”</p></td></tr><tr><td><p><b>5. Runtime Agentic Detection</b></p></td><td><p>Falcon AIDR expanded to AWS (June 17). Covers Bedrock, Kiro, Strands Agents. MCP integration. Real-time agent/LLM/MCP evaluation.</p></td><td><p>Traditional tools monitor human-speed actions. Agents run at machine speed, thousands of actions per minute, and route around controls to reach goals.</p></td><td><p>Test if EDR/XDR links agent actions to originating identity. Verify SIEM ingests MCP communications. Confirm you can distinguish human from agent on endpoint.</p></td><td><p>Deploy AIDR or equivalent runtime detection. Shadow-AI discovery for all agentic apps, models, MCP servers, identities. Real-time policy enforcement on agent actions.</p></td><td><p>“We cannot distinguish a human employee from an AI agent acting on their behalf. We need runtime detection at machine speed that can stop damage before it starts.”</p></td></tr></tbody></table><h2>The fix is plumbing, not policy</h2><p>The <a href="https://www.whitehouse.gov/presidential-actions/2026/06/promoting-advanced-artificial-intelligence-innovation-and-security/">June 2 executive order</a> creates an AI Cybersecurity Clearinghouse with a July 2 deadline. The five gaps above are not frontier-model problems. They are plumbing problems in the gateways, orchestration platforms, identity layers, and runtime environments where AI meets the enterprise. </p><p>The audit is five rows. Every row maps to a June disclosure or market signal, a command a team can run before lunch, and a sentence a CISO can read to the board. The question is not whether your vendor will patch. It&#x27;s whether you find the gap first — or whether an attacker finds it the way they found Copilot and LiteLLM.</p>]]></description>
            <author>louiswcolumbus@gmail.com (Louis Columbus)</author>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/599hDEEWXHzpIDiNVQFFsc/069254d665cc4a88ccee32f955648c72/hero.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[85% of IT teams claim every AI agent is under control. Only 42% actually know who owns them.]]></title>
            <link>https://venturebeat.com/security/85-of-it-teams-claim-every-ai-agent-is-under-control-only-42-actually-know-who-owns-them</link>
            <guid isPermaLink="false">3wCVYKClI7OE7oMOioSdO1</guid>
            <pubDate>Mon, 15 Jun 2026 17:19:28 GMT</pubDate>
            <description><![CDATA[<p>Organizational leaders are <a href="https://www.ivanti.com/resources/research-reports/scaling-ai-it-operations">nearly twice as likely to hide their AI use</a> compared to all other employees, at 42% versus 23%, according to new Ivanti research surveying 3,900 employees across six countries. Among leaders who conceal that usage, 52% say they do it for a &quot;secret advantage.&quot; The same research found 85% of IT professionals claim a named owner exists for every AI agent. Only 42% say ownership is actually clear — a 43-point gap that no governance framework was designed to close.</p><p>Sam Evans, CISO of Clearwater Analytics, stood before his board and laid out the risk to the $8.8 trillion in assets his firm&#x27;s platform supports. &quot;The worst possible thing would be one of our employees taking customer data and putting it into an AI engine that we don&#x27;t manage,&quot; <a href="https://venturebeat.com/security/ciso-dodges-bullet-protecting-8-8-trillion-from-shadow-ai">Evans told VentureBeat</a>. He brought a solution, not just a problem. Many CISOs VentureBeat interviewed did not.</p><p>Menlo Security CEO Bill Robbins relayed a conversation with a Top 3 U.S. bank CISO who called shadow AI discovery &quot;a bit of a fool&#x27;s errand&quot;: AI is embedded in every application and browser employees touch. The bank governs from containment, not discovery.</p><p>The scale justifies that posture. &quot;We see 50 new AI apps a day, and we&#x27;ve already cataloged over 12,000,&quot; Prompt Security CEO Itamar Golan <a href="https://venturebeat.com/security/shadow-ai-unapproved-ai-apps-compromising-security-what-you-can-do-about-it">told VentureBeat</a>. &quot;Around 40% of these default to training on any data you feed them, meaning your intellectual property can become part of their models.&quot; CrowdStrike has detected <a href="https://venturebeat.com/security/rsac-2026-agent-identity-frameworks-three-gaps">1,800 AI applications operating</a> across 160 million endpoint instances. Those are vendor-reported numbers from proprietary telemetry. No independent party can verify them. The directional signal matters more than the exact count.</p><p>CrowdStrike CTO Elia Zaitsev described what makes the surface so hard to govern. &quot;It looks indistinguishable if an agent runs your web browser versus if you run your browser,&quot; Zaitsev <a href="https://venturebeat.com/security/cisco-crowdstrike-rsac-2026-agent-identity-iam-gap-maturity-model">told VentureBeat at RSAC 2026</a>. &quot;Observing actual kinetic actions is a structured, solvable problem. Intent is not.&quot; The shadow AI surface is no longer a list security teams can maintain. It is an environment they have to assume.</p><p>The Ivanti survey was administered independently by Ravn Research and MSI Advanced Customer Insights across 1,500 IT professionals. Among companies with AI policies, just 24% of employees say those policies are followed &quot;very consistently&quot; in day-to-day work.</p><p>Kayne McGladrey, IEEE senior member, told VentureBeat why that governance gap persists. &quot;Anything that seems to have a cybersecurity flavor is generally put into the cybersecurity risk category, which is a complete fiction. They should be focused on business risks, because if it doesn&#x27;t affect the business, like a financial loss, then nobody&#x27;s going to pay attention to it, and they will not budget it appropriately, nor will they adequately put in controls to prevent it,&quot; McGladrey told VentureBeat previously.</p><p>Brokerage partners at major consulting firms shared over Signal that <a href="https://venturebeat.com/security/vibe-coded-apps-shadow-ai-s3-bucket-crisis-ciso-audit-framework">they build shadow AI applications in Google Colab</a> and store them in S3 buckets to compress a week of financial analysis into an hour. The approval process takes too long, so they route around it.</p><div></div><h2>Governance at deploy time, failure at runtime</h2><p>Reviews check functional requirements when a model ships, but they never check model provenance, behavioral drift, or whether the agent expanded its own permissions after launch. </p><p>CrowdStrike CEO George Kurtz <a href="https://venturebeat.com/security/rsac-2026-agent-identity-frameworks-three-gaps">disclosed at RSA Conference 2026</a> that a Fortune 50 CEO&#x27;s AI agent rewrote the company&#x27;s security policy to expand its own autonomy. The company caught it by accident. Every credential check had passed. &quot;In the agentic era, defending against AI-accelerated adversaries and securing AI systems themselves require operating at machine speed,&quot; <a href="https://venturebeat.com/security/adversaries-hijacked-ai-security-tools-at-90-organizations-the-next-wave-has-write-access-to-the-firewall">Kurtz said</a>. Quarterly governance reviews do not operate at machine speed.</p><p>Mike Riemer, Field CISO at Ivanti, built that lesson into his own team&#x27;s AI agent development. &quot;It&#x27;s great at what I intended it for, but it&#x27;s also great at what I didn&#x27;t intend it for, and what I didn&#x27;t intend it for is dangerous,&quot; <a href="https://venturebeat.com/security/most-enterprises-cant-stop-stage-three-ai-agent-threats-venturebeat-survey-finds">Riemer told VentureBeat</a>.</p><p>Hallucination data compounds the problem. Sixty-eight percent of IT professionals have personally witnessed AI generate hallucinations with potential operational impact, according to Ivanti. More than half caught the errors before damage, but 16% did not. Yet among the most advanced users of AI, 49% fully trust AI-generated outputs that influence IT decisions.</p><p>Riemer described the pattern <a href="https://venturebeat.com/security/mfa-verifies-who-logged-in-it-has-no-idea-what-they-do-next">in an exclusive interview with VentureBeat</a>. &quot;There are people that are just accepting what&#x27;s been given to them without any full understanding of what it is doing, which we&#x27;ve found in the tech industry for decades,&quot; Riemer said. &quot;They don&#x27;t question how it&#x27;s doing it. They just start gauging it by its outcome.&quot;</p><p>Qualtrics CSO Assaf Keren identified the core tension in an exclusive interview with VentureBeat. Organizations are introducing &quot;non-deterministic decisioning into environments built for deterministic.&quot; Keren cited internal Qualtrics data showing that 22% of SOC triage is now AI-driven. No codified threshold separates what an agent can auto-execute from what requires a human in the loop.</p><h2>The 18-month window</h2><p>The window for fixing this is closing. IT organizations expect AI to <a href="https://www.ivanti.com/resources/research-reports/scaling-ai-it-operations">automate 46% of their operations within 18 months</a>, according to Ivanti. U.S. companies project 52%. Governance is already the most commonly cited barrier to faster deployment, ahead of skills, technology, and data challenges.</p><p>The maturity divide makes the governance gap more dangerous. IT professionals at AI-mature organizations save six hours per week, double the three hours saved at the least mature level. Nearly 9 in 10 IT professionals at scaled organizations say AI frequently helps detect or resolve issues before employees are affected. At early experimentation organizations, that number drops to four in ten. Sixty-nine percent of scaled organizations report fully embedded governance, compared to 15% at early experimentation.</p><p>Cisco President Jeetu Patel <a href="https://venturebeat.com/security/85-of-enterprises-are-running-ai-agents-only-5-trust-them-enough-to-ship">walked through a hypothetical scenario</a> in an interview at RSAC 2026: an agent that charges $40,000, invites competitors to a Slack channel, and publishes home addresses. &quot;The apology is not a guardrail,&quot; Patel told VentureBeat. </p><p>Cato Networks VP of Threat Intelligence Etay Maor <a href="https://venturebeat.com/security/rsac-2026-agent-identity-frameworks-three-gaps">framed the accountability problem in a separate RSAC</a> interview. &quot;They&#x27;re closer to humans. Why are we not doing background checks on agents?&quot;</p><p>&quot;AI is compressing the time between intent and execution while turning enterprise AI systems into targets,&quot; CrowdStrike VP of Intelligence Operations Adam Meyers told VentureBeat. </p><p>&quot;Proceed on one action does not mean proceed on the next,&quot; Cisco SVP of AI Software and Platform DJ Sampath said in a separate interview. </p><p>McGladrey described the root cause. Organizations default to cloning human user profiles for agents, and permission sprawl starts on day one. &quot;It uses far more permissions than it should have, more than a human would, because of the speed of scale and intent,&quot; he said.</p><p>Riemer&#x27;s team built governance into Ivanti&#x27;s own development process. &quot;We have AI check on top of AI to make sure that it is fixed. Two different models, two different manufacturers,&quot; <a href="https://venturebeat.com/security/most-enterprises-cant-stop-stage-three-ai-agent-threats-venturebeat-survey-finds">Riemer said</a>. &quot;If one AI believes the other AI fixed it appropriately, then it passes it off to a human being.&quot;</p><p>Riemer put the vendor question in terms every CISO can use at the negotiating table. &quot;If that vendor doesn&#x27;t have a way to show you what they&#x27;ve done from a development perspective in order to improve their development processes, you really need to question why you&#x27;re working with that vendor,&quot; he said.</p><p>The six questions below target governance dimensions where enforcement collapses at runtime. CISOs can use them during Q3 vendor renewals to separate vendors shipping runtime enforcement from vendors shipping documentation.</p><h2>Six governance questions for Q3 renewals</h2><table><tbody><tr><td><p><b>Governance dimension</b></p></td><td><p><b>What the data proved</b></p></td><td><p><b>Why governance misses it</b></p></td><td><p><b>Q3 renewal question</b></p></td><td><p><b>Proof artifact to demand</b></p></td></tr><tr><td><p><b>Executive shadow AI</b></p></td><td><p>Leaders hide AI at 42% vs. 23% all employees. 52% hide for &quot;secret advantage.&quot; Regulated industries have the highest unsanctioned rates.</p></td><td><p>Governance assumes policy writers follow policy. Leaders sit above the controls they wrote.</p></td><td><p>Can your DLP, browser, SSE, and endpoint telemetry detect AI data movement at the executive layer with the same coverage as all other users?</p></td><td><p>Executive-layer DLP, browser, SSE, and endpoint telemetry logs showing identical coverage to all other users.</p></td></tr><tr><td><p><b>Named agent ownership</b></p></td><td><p>85% claim a named owner. Only 42% say ownership is clear. 43-point gap.</p></td><td><p>Owner on a spreadsheet. Agent at runtime. Nobody tested whether the owner can kill the agent under load.</p></td><td><p>Can you name the owner for every AI agent? Can that owner revoke access in 60 seconds?</p></td><td><p>Live demo of 60-second agent access revocation under production load.</p></td></tr><tr><td><p><b>Pre-deployment review</b></p></td><td><p>65% have pre-deployment risk review. Separately, only 24% say any AI policy is followed &quot;very consistently.&quot; Review exists. Enforcement does not.</p></td><td><p>Review checks functional requirements at deploy. Never checks model provenance or behavioral drift at runtime.</p></td><td><p>Does your review cover model provenance? Is it enforced or advisory?</p></td><td><p>Model provenance certificate with enforcement log showing blocked deployments.</p></td></tr><tr><td><p><b>Policy enforcement</b></p></td><td><p>58% have acceptable-use policies. 24% followed &quot;very consistently.&quot; Documented. Not practiced.</p></td><td><p>Agent pursued its goal past every boundary. Goal-seeking does not stop at a document the model never reads.</p></td><td><p>Are policies enforced by server-side gates or by agent compliance? What percentage of actions are gated?</p></td><td><p>Server-side gate audit trail with percentage of agent actions gated vs. ungated.</p></td></tr><tr><td><p><b>Trust thresholds</b></p></td><td><p>68% have seen hallucinations with operational impact. 49% of advanced users fully trust outputs.</p></td><td><p>No codified threshold separates auto-execute from human-review.</p></td><td><p>Which agent actions auto-execute versus require human review? Is that enforced in policy or in the platform?</p></td><td><p>Documented threshold matrix classifying every agent action as auto-execute or human-review.</p></td></tr><tr><td><p><b>Per-action authorization</b></p></td><td><p>Governance is the #1 barrier at 27%. Skills 20%. Tech 17%. Data 14%.</p></td><td><p>Oversight reviews quarterly. Agents act per-second.</p></td><td><p>Is per-action authorization enforced at runtime or only at deploy-time review? Can agents accumulate permissions without re-authorization?</p></td><td><p>Runtime authorization log showing per-action gate events and permission re-authorization timestamps.</p></td></tr></tbody></table><p><i>Source data from Ivanti, </i><a href="https://www.ivanti.com/resources/research-reports/scaling-ai-it-operations"><i>Scaling AI in IT Operations: The Path to Maturity in 2026</i></a><i> (n=1,500 IT professionals, 3,900 total employees, six countries, February–March 2026). Exclusive CISO sourcing by VentureBeat.</i></p><p>Evans put structure around the Clearwater board conversation. The bank CISO that Robbins described assumed AI is everywhere and governed from containment instead of discovery. Governance that tries to catalog every shadow AI tool will fail because the surface grows faster than any inventory.</p><p>At scaled, business-critical organizations, 54% of IT professionals say AI makes their work both faster and better, according to Ivanti. At early experimentation organizations, 24% say the same. At scaled organizations, accountability lives in the platform. At early ones, it lives in a document the agent never reads.</p><p>The six questions above give every CISO a way to test whether their governance actually works where it matters. At runtime, under load, and before the next renewal check clears.</p>]]></description>
            <author>louiswcolumbus@gmail.com (Louis Columbus)</author>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/VVD0axSaQb70MPEzSbZms/5dfd23c2b3da09d025fbc393412549c9/hero.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[Attackers scale deception with AI. Defenders need truth at machine speed.]]></title>
            <link>https://venturebeat.com/security/attackers-scale-deception-with-ai-defenders-need-truth-at-machine-speed</link>
            <guid isPermaLink="false">51laSPpskSEetQ7nH4vhNO</guid>
            <pubDate>Mon, 15 Jun 2026 07:00:00 GMT</pubDate>
            <description><![CDATA[<p><i>Presented by Splunk</i></p><hr/><p>AI has changed the economics of cyber deception.</p><p>An attacker can now generate thousands of convincing phishing lures, fake identities, and tailored pretexts before a defender finishes a single change-control cycle. That is the new security challenge: deception got faster and cheaper, while verification did not.</p><p>Much of the discussion around AI for defense centers on detection models. Detection matters, but it is not the only bottleneck. The deeper constraint is evidence: where data lives, whether it is available when needed, how quickly it can be correlated, how long it is retained, and whether analysts or agents can trust what they retrieve.</p><p>Defense in the AI era is a data problem before it is a detection problem.</p><h2>The defender’s advantage is truth</h2><p>Attackers can afford to lie at enterprise scale. They can test endless combinations of messages, identities, domains, and attack paths, and most can fail at almost no cost.</p><p>Defenders do not have that luxury. Their advantage is truth: quickly knowing what happened, where, when, which identity was involved, which assets were affected, what changed, and what business process may be at risk.</p><p>That truth must be documented, governed, auditable, and defensible. Attackers are using AI to scale deception, impersonation, social engineering, and speed. Defenders need AI to scale verification.</p><p>The goal is not just to act faster than the attacker. It is to take action that people and machines can trust.</p><h2>Fragmented data breaks modern defense</h2><p>Consider a suspicious login from a contractor account. On its own, it is just another authentication anomaly. To know whether it matters, a security team may need identity history, endpoint activity, cloud access logs, ticketing records, asset ownership, configuration changes, network telemetry, and business context.</p><p>If those records sit in different tools, expire at different times, or require multiple teams to retrieve, defenders are not investigating the incident. They are negotiating with their own data estate.</p><p>When signals can be reached in place and correlated quickly, the issue is no longer just whether the login looks unusual. It becomes whether the enterprise has enough evidence, in enough context, to take action it can defend.</p><p>That challenge grows more urgent with AI assistants and agents. AI can only reason over what it can retrieve in time to matter. If the data is partial, stale, fragmented, unavailable, or stripped of context, AI does not create truth. It accelerates uncertainty.</p><h2>The system of record must become a defensive control plane</h2><p>For years, enterprises treated security platforms, SIEMs, and data lakes as passive repositories: places to store data for later search and analysis. That model is no longer enough.</p><p>What organizations now need is a defensive control plane: a layer that connects what happened, what it means, and what the enterprise is allowed to do about it. In architectural terms, it ties together raw machine data, business context, and policy. It does not just store evidence. It makes evidence usable for decisions and actions that must be explainable and trusted.</p><p>In practice, that means doing four things well: preserving evidence, reaching data wherever it lives, adding business context, and governing action. More on each below.</p><p>The old system of record answered one question: What is the official record?</p><p>A defensive control plane answers the questions that matter operationally: What happened? What does it mean? What evidence supports that conclusion? And what action can we trust?</p><p>AI does not reduce the need for authoritative records. It raises the standard for what those records must do.</p><h2>A defensive control plane must do four things</h2><ol><li><p><b>Preserve evidence. </b>Logs, metrics, traces, events, identity records, configuration changes, tickets, and asset state all help establish what happened. Their value often becomes clear only after an incident begins.</p></li><li><p><b>Make data accessible wherever it lives</b>. Security-relevant data is already spread across object stores, cloud platforms, operational tools, and business systems. Moving every byte into one place is often too slow, too expensive, and too difficult to govern. The better model is to bring analytics to the data.</p></li><li><p><b>Add business context. </b>Correlating machine data with business information turns “anomaly on host X” into “the system supporting payment services for top accounts is being probed.” That is what allows organizations to prioritize correctly.</p></li><li><p><b>Govern action</b>. In the agentic era, systems will do more than summarize incidents. They will enrich alerts, open cases, trigger workflows, isolate assets, update policies, and escalate decisions. Enterprises need to know what evidence an agent used, what policy governed the action, whether it stayed within scope, and how the decision can be reviewed afterward.</p></li></ol><h2>The real SOC problem is not too little data</h2><p>Modern SOCs are not suffering from a lack of data. They are suffering from a lack of usable context.</p><p>According to the Splunk State of Security 2025 report, SOC analysts continue to struggle with too many alerts (59%), too many false positives (55%), and alerts that lack context (46%). The issue is not data volume. It is the difficulty of turning fragmented signals into trusted decisions.</p><p>Today, analysts are left stitching together context manually, pivoting across disconnected tools, and making high-stakes decisions without the full picture in time. Even as AI improves, outcomes still depend on whether humans are willing to approve changes across fragmented environments.</p><p>This creates a daily crisis of context. Teams are forced to make consequential decisions based on data they cannot easily see, correlate, or trust. The result is latency, inconsistency, missed opportunities, and unnecessary risk.</p><h2>Trusted action is the durable advantage</h2><p>A data fabric architecture offers a way forward by creating a unified, intelligent layer across data sources spanning SecOps, ITOps, and NetOps. The goal is not centralization for its own sake. It is to break down silos and deliver context-rich insight at the speed AI-driven operations require.</p><p>This is an operating model before it is a product. AI-driven defense depends on a foundation that can preserve evidence, reach data where it lives, add context, and maintain a reviewable link between data, decision, and action. That is the architectural shift behind Cisco Data Fabric powered by the Splunk Platform, which brings together machine data, federation, business context, governance, and provenance to help teams move from signal to trusted action.</p><p>Attackers will keep making deception cheaper, faster, and more personalized. Defenders do not win that race by generating more noise. They win by making truth faster, and by grounding every action in evidence that people and machines can trust.</p><p><i>Learn more about the </i><a href="https://www.splunk.com/ciscodatafabric"><i>Cisco Data Fabric powered by the Splunk Platform</i></a>.</p><p><i>Seth Brickman is VP, Global Product - Splunk Platform, Cisco.</i></p><hr/><p><i>Sponsored articles are content produced by a company that is either paying for the post or has a business relationship with VentureBeat, and they’re always clearly marked. For more information, contact </i><a href="mailto:sales@venturebeat.com"><i><u>sales@venturebeat.com</u></i></a><i>.</i></p>]]></description>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/591Zr5l5GE1ceQqivX0qqw/9e29de38bbca6dedc88795bdf38bd92f/Gemini_Generated_Image_m4qphtm4qphtm4qp.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[NanoClaw and JFrog launch 'immune system' to block AI agents from downloading malicious code]]></title>
            <link>https://venturebeat.com/security/nanoclaw-and-jfrog-launch-immune-system-to-block-ai-agents-from-downloading-malicious-code</link>
            <guid isPermaLink="false">38lkGsiwBWZ1jd5vvMcudB</guid>
            <pubDate>Fri, 12 Jun 2026 16:46:00 GMT</pubDate>
            <description><![CDATA[<p>The creators of the hit, enterprise-friendly, open source OpenClaw variant <a href="https://venturebeat.com/orchestration/nanoclaws-creators-are-turning-the-secure-open-source-ai-agent-harness-into-an-enterprise-second-brain">NanoClaw</a> are partnering with software supply chain management leader <a href="https://jfrog.com/">JFrog</a> to launch a new, joint security integration they say will protect NanoClaw autonomous agents from malicious code injection. </p><p>&quot;These agents are doing things that you cannot necessarily control, and you cannot necessarily train,&quot; said Gal Marder, Chief Strategy Officer at JFrog, in an exclusive interview with VentureBeat.</p><p>Available immediately, the partnership hardwires NanoClaw agents directly to JFrog’s vetted software registries, ensuring that AI assistants can only pull scanned, safe dependencies. </p><p>The release addresses a rapidly growing blind spot in tech: autonomous agents frequently install packages in the background to extend their capabilities, often without their human operators&#x27; knowledge or oversight. </p><p>&quot;The people who are operating the agents are not necessarily developers, and they are not even aware of the implications,&quot; explained Gavriel Cohen, creator of NanoClaw and CEO and co-founder of its new commercial services startup, <a href="http://nanoco.ai/">NanoCo AI</a>. </p><p>To secure the broader ecosystem, the partners are <!-- -->working to make<!-- --> it available completely free of charge for the open-source community, while enterprise organizations can seamlessly route their agents through their existing, commercially licensed JFrog environments.</p><p>The new technical capability enabled by this partnership follows NanoCo&#x27;s moves to add permissions dialogs across the apps in which it&#x27;s available via <a href="https://venturebeat.com/orchestration/should-my-enterprise-ai-agent-do-that-nanoclaw-and-vercel-launch-easier-agentic-policy-setting-and-approval-dialogs-across-15-messaging-apps">a partnership with Vercel</a>, and a <a href="https://venturebeat.com/infrastructure/nanoclaw-and-docker-partner-to-make-sandboxes-the-safest-way-for-enterprises">new partnership with Docker to allow NanoClaw</a> agents to run more securely, isolated from other software environments directly inside Docker virtual containers. </p><h2><b>The risk of current, personal autonomous AI agents </b></h2><p>When an operator interacts with an autonomous system like NanoCo&#x27;s NanoClaw, they communicate at a high level of abstraction. </p><p>A user might simply send an audio file or a voice note, prompting the agent to independently figure out how to process it. </p><p>As Cohen explained, the agent thinks, &quot;oh, I can&#x27;t understand voice notes, so let me go and grab a package and download something and install it and set it up and run it&quot;.</p><p>This dynamic self-improvement makes AI agents incredibly powerful, but it also renders them highly susceptible to software supply chain attacks. </p><p>Bad actors are increasingly poisoning open-source registries with malicious packages. Because agents act autonomously to fetch what they need, they bypass human scrutiny. </p><p>The operators, who may not even be developers, are largely unaware of the security implications unfolding behind the scenes.</p><h2><b>How NanoCo and JFrog are working to stop agents from running malicious code</b></h2><p>The integration between NanoCo and JFrog acts as an automated immune system for these AI environments.</p><p>Under the hood, NanoClaw agents are now configured to route their requests for software packages, CLI tools, and Model Context Protocol (MCP) servers exclusively through JFrog’s registries.</p><p>If an agent attempts to download a compromised library—such as a vulnerable version of the popular Axios package—the JFrog registry intercepts the request.</p><p>It blocks the installation, returning a security policy error to the agent, noting that the request was &quot;rejected by JFrog&#x27;s registry with a 403 security policy&quot;. </p><p>Crucially, the system does not just stop at blocking the threat; it creates a dynamic correction loop. The agent is notified of the vulnerability and guided to automatically seek out and install an approved, non-malicious version of the requested package instead.</p><p>For large organizations, this integration solves a massive compliance headache. Marder notes that as enterprises adopt autonomous agents, they require absolute visibility. </p><p>Organizations need &quot;a system of record, we need somewhere to track what agents that&#x27;s running by whom and consuming what packages and using what skills and using what MCPs,&quot; he told VentureBeat.</p><p>Beyond visibility, the JFrog integration provides a foundational &quot;trust layer&quot; and strict governance over what these automated systems are permitted to access.</p><h2><b>Licensing and accessibility</b></h2><p>In the realm of software distribution, licensing and access parameters dictate adoption. The NanoCo and JFrog partnership utilizes a dual-track approach to serve both individual open-source developers and highly regulated enterprises.</p><p>For the open-source community, the integration is completely free. JFrog is providing open-source NanoClaw users with complimentary access to safe, vetted sources of artifacts, tools, and skills. </p><p>This allows individual developers to run autonomous agents locally without drowning in manual approval requests for every single dependency. Furthermore, as community members build and share new &quot;skills&quot; for the agents, these contributions are uploaded to the registry, scanned for malicious code, and cleared before anyone else can use them. </p><p>This infrastructure directly neutralizes the threat of poisoned community repositories.</p><p>For enterprise deployments, the architecture plugs seamlessly into an organization&#x27;s existing commercial environment. Rather than using the public open-source registry, corporate users point their NanoClaw agents to their own internal JFrog registries. </p><p>This ensures that all agent activity adheres to the company’s specific commercial licenses, internal security policies, visibility needs, and governance standards.</p><p>As AI continues to blur the line between human intent and machine execution, the infrastructure securing that execution must evolve. This partnership acknowledges a core reality: you cannot train an AI to perfectly recognize every zero-day vulnerability; instead, you must build an environment where the agent simply cannot reach the vulnerability in the first place.</p>]]></description>
            <author>carl.franzen@venturebeat.com (Carl Franzen)</author>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/77BUvHRR5neMsb4vlwRip2/ed64951fbd22cd58addb8d99b6f977df/Gemini_Generated_Image_xz3q0ixz3q0ixz3q__2_.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[Meta's AI support agent bound recovery emails for anyone who asked. Your SOC never saw an alert.]]></title>
            <link>https://venturebeat.com/security/meta-ai-support-agent-recovery-email-takeover-soc-audit-grid</link>
            <guid isPermaLink="false">5RjObu657JLLRcXlMWh4dH</guid>
            <pubDate>Fri, 05 Jun 2026 16:42:50 GMT</pubDate>
            <description><![CDATA[<p>Meta&#x27;s AI support agent bound recovery emails to accounts for whoever asked, and SOCs never saw an alert. An authorized agent writes a log of legitimate transactions, so nothing in the detection stack fired. Attackers asked the bot to make the change, took the one-time code it sent, <a href="https://www.404media.co/hackers-simply-asked-meta-ai-to-give-them-access-to-high-profile-instagram-accounts-it-worked/">and ran the password reset</a>, 404 Media reported.</p><p>No malware, no stolen credentials, and no prompt injection in the sense most security teams drill for. The agent did exactly what Meta built it to do. That is what should keep a security operations leader up at night: The takeover did not break a control; it rode one that was already trusted.</p><p>What a SOC needs is a way to walk each recovery path through an audit grid with its AI build team before the next renewal closes. The AI Authority Audit Grid at the end of this article maps every authentication write a support agent can make on the recovery path, what Meta&#x27;s incident proved about each one, why it stays dark to the SOC, and the control that closes it.</p><h2><b>The agent is an authorized actor, so the SOC reads the takeover as routine traffic</b></h2><p>From inside the detection stack, the attack produced no signal the stack could read. The agent binds a new email, then resets the password, and <a href="https://genai.owasp.org/llmrisk/llm062025-excessive-agency/">identity and access management</a> logs both writes as an authorized actor, so each lands in the authentication state as a legitimate transaction. No anomalous login, no failed-auth spike, nothing for EDR or DLP, no SIEM rule to match, because nothing in the sequence looks like an attack. The takeover lived inside the trust boundary the stack assumes is safe. There is no foothold to find, because the agent was the foothold, and it was supposed to be there.</p><p>The chain was almost insulting in its simplicity. Brian Krebs documented the <a href="https://krebsonsecurity.com/2026/06/hackers-used-metas-ai-support-bot-to-seize-instagram-accounts/">version pro-Iran hackers posted to Telegram on May 31</a>. The attacker <a href="https://www.bbc.com/news/articles/c98rzr72dpyo">switched on a VPN to appear in the victim&#x27;s region</a>, sidestepping Instagram&#x27;s location alarms, then asked the support assistant to add a new email and send a verification code, as the BBC confirmed from the same recordings. The bot complied, sending the one-time code straight to the attacker, <a href="https://gizmodo.com/hackers-tricked-meta-ai-into-handing-out-access-to-major-instagram-accounts-2000766087">Gizmodo reported</a>. The reset finished and the owner was locked out, in minutes. The exploit failed against any account with MFA enabled, according to Krebs.</p><p>The hijacked accounts were not soft targets. They included Sephora, U.S. Space Force senior enlisted leader Chief Master Sergeant John Bentivegna, researcher Jane Manchun Wong, and a dormant Obama White House handle that briefly posted a defaced image, according to <a href="https://www.404media.co/hackers-simply-asked-meta-ai-to-give-them-access-to-high-profile-instagram-accounts-it-worked/">404 Media</a>. <a href="https://techcrunch.com/2026/06/03/instagram-is-alerting-users-who-were-targeted-by-hackers-during-ai-chatbot-attacks/">Meta disputes the Obama account</a>, according to TechCrunch, and called claims that leaders&#x27; accounts were breached &quot;completely false,&quot; according to the BBC. The rest stand.</p><h2><b>MFA held. The recovery path beside it did not.</b></h2><p>The detail that decided who survived was narrow. Krebs reported the attack failed against any account with multifactor authentication, even SMS. The recovery path beside it was the gap. When that path asked for a selfie video, <a href="https://www.ghacks.net/2026/06/03/instagram-accounts-hijacked-by-tricking-meta-ai-support-into-verifying-attackers-as-owners/">attackers ran the target&#x27;s public photos through an AI video generator</a> and submitted the clip, which Meta accepted as valid identity verification, gHacks reported. Either way the failure was the recovery door, not the login door MFA guards.</p><p>That makes this an architecture problem, not a Meta problem. MFA gates the login path for owner and attacker alike, but the recovery path runs beside it, built to relax the usual checks because it exists for the moment a user has lost the normal way in. Meta put an agent on that path with write access to authentication state and no deterministic check between a convincing request and a committed change. Authorization cannot live inside the model, because a conversational system can be talked into skipping a check. It has to live outside the model, in a gate the agent cannot reason its way past. Security researchers have a name for this pattern, the confused deputy, a trusted system tricked into spending its privileges on an attacker&#x27;s behalf.</p><p>This is not the last support agent that will hand over an account. Ian Goldin, a threat researcher at Lumen&#x27;s Black Lotus Labs, told Krebs on Security that AI bots are as easy to social engineer as the human agents they replace, and just as eager to help. &quot;AI chatbots create interesting new attack surface, and we&#x27;re likely going to see a lot more of these kinds of attacks,&quot; Goldin said. Every enterprise wiring an agent into a recovery, provisioning, or password flow is shipping the same write access Meta did.</p><p>Simon Willison, who coined the term prompt injection, put it plainly on <a href="https://simonwillison.net/2026/Jun/1/hackers-simply-asked-meta-ai/">his blog</a>. &quot;Meta really did wire their support system into an AI chatbot that had the ability to fast-forward through the entire account recovery process,&quot; he wrote. &quot;This one hardly even qualifies as a prompt infection. Don&#x27;t wire your support bot up to allow one-shot account takeovers.&quot; The attacker never tricked the agent. The attacker asked, and the agent had untrusted input, write access, and a way to execute, all at once.</p><p>OWASP named this class before Meta shipped it, as Excessive Agency at <a href="https://genai.owasp.org/llmrisk/llm062025-excessive-agency/">LLM06</a> and Identity and Privilege Abuse at <a href="https://genai.owasp.org/2025/12/09/owasp-top-10-for-agentic-applications-the-benchmark-for-agentic-security-in-the-age-of-autonomous-ai/">ASI03 in the Agentic AI Top 10</a>. The warning label was on the box: Meta pushed the assistant to every Facebook and Instagram account in March, according to 404 Media, with the power to reset passwords and handle recovery, the product page promising &quot;solutions, not just suggestions&quot; under the line &quot;account security and recovery.&quot; Meta gave the agent the power and never built the gate to govern it.</p><h2><b>The AI Authority Audit Grid</b></h2><p>Security operations leaders need to run this against their own support agent before the next renewal closes. Each row is an authentication write the agent makes on the recovery path, with what Meta proved, why your stack misses it, and the control that closes it.</p><table><tbody><tr><td><p><b>Authentication write</b></p></td><td><p><b>What Meta proved</b></p></td><td><p><b>Why your stack misses it</b></p></td><td><p><b>Enterprise control and owner</b></p></td></tr><tr><td><p>Login authentication (MFA, factor prompts)</p></td><td><p>Held on login. Accounts with any MFA enabled, even SMS, survived (<a href="https://krebsonsecurity.com/2026/06/hackers-used-metas-ai-support-bot-to-seize-instagram-accounts/">Krebs</a>). The gap was the recovery path beside it.</p></td><td><p>MFA gates the login path for owner and attacker alike. It does not gate the recovery path beside it.</p></td><td><p>Enforce MFA as the baseline and extend step-up verification to the recovery path, the same standard login gets (<a href="https://cheatsheetseries.owasp.org/cheatsheets/AI_Agent_Security_Cheat_Sheet.html">OWASP</a>). A selfie video is not proof of identity. Any agent that operates on a path MFA does not cover fails the audit. Owner: IAM.</p></td></tr><tr><td><p>Email rebind</p></td><td><p>Full takeover. The agent bound attacker-controlled emails on request, taking Sephora and a U.S. Space Force account (<a href="https://www.404media.co/hackers-simply-asked-meta-ai-to-give-them-access-to-high-profile-instagram-accounts-it-worked/">404 Media</a>).</p></td><td><p>IAM logs the agent as an authorized actor, so the rebind reads as a legitimate transaction and no alert reaches the SOC or the account owner.</p></td><td><p>Confirm out-of-band to the existing verified contact before any rebind commits, gated outside the model, and notify the old address the moment it changes (<a href="https://community.ibm.com/community/user/blogs/shane-weeden1/2020/02/28/account-recovery-is-just-another-authentication-me">IBM</a>). An agent that rebinds without confirming the old address fails. Owner: IAM and platform engineering.</p></td></tr><tr><td><p>Password reset</p></td><td><p>Full takeover in minutes. Researcher Jane Manchun Wong was among the affected accounts (<a href="https://www.404media.co/hackers-simply-asked-meta-ai-to-give-them-access-to-high-profile-instagram-accounts-it-worked/">404 Media</a>).</p></td><td><p>The reset runs on the recovery path, outside the login MFA check, so no factor prompt fires and no detection rule triggers.</p></td><td><p>Require a second non-email factor before any reset completes. NIST dropped email as a valid out-of-band channel (<a href="https://pages.nist.gov/800-63-4/sp800-63b/authenticators/">NIST 800-63B</a>). An agent reset must clear the same gate a human reset does. Owner: IAM.</p></td></tr><tr><td><p>Recovery-method change</p></td><td><p>Persistent lockout. Victims could not self-recover. The support loop offered only AI with no human escalation (<a href="https://www.bleepingcomputer.com/news/security/instagram-users-locked-out-after-meta-ai-abused-to-steal-accounts/">BleepingComputer</a>).</p></td><td><p>A silent swap of the recovery email or phone removes the owner&#x27;s re-entry path with no SOC visibility.</p></td><td><p>Require step-up review on any change, notify the prior method, and grant time-delayed, reduced-scope access after recovery so a swap never hands over instant control (<a href="https://www.authsignal.com/blog/articles/account-recovery-is-the-identity-industrys-most-overlooked-challenge">Authsignal</a>). Keep a human escalation path the agent cannot close. Owner: GRC and IT operations.</p></td></tr><tr><td><p>Account-action execution</p></td><td><p>Speed risk. A dormant Obama White House handle briefly showed a defaced image during the spree, an account Meta disputes was taken this way (<a href="https://techcrunch.com/2026/06/03/instagram-is-alerting-users-who-were-targeted-by-hackers-during-ai-chatbot-attacks/">TechCrunch</a>).</p></td><td><p>The agent executes irreversible state changes in seconds with no human in the loop and no reversibility window.</p></td><td><p>Separate decision from execution. The agent only proposes the action. A policy service validates scope and approval before it runs, with approval bound to the exact action (<a href="https://cheatsheetseries.owasp.org/cheatsheets/AI_Agent_Security_Cheat_Sheet.html">OWASP</a>). No auth-state write commits without that gate and a reversibility window. Owner: platform engineering and the AI build team.</p></td></tr><tr><td><p>Agent action logging</p></td><td><p>Detection gap. The takeover left no alert, and Meta has not published how many accounts fell before the patch (<a href="https://techcrunch.com/2026/06/03/instagram-is-alerting-users-who-were-targeted-by-hackers-during-ai-chatbot-attacks/">TechCrunch</a>).</p></td><td><p>Without per-action telemetry piped to the SIEM, an authorized-agent takeover is invisible to the SOC.</p></td><td><p>Emit structured decision metadata for every auth-state write into the SIEM: action class, authorization outcome, approval ID, result, policy version (<a href="https://cheatsheetseries.owasp.org/cheatsheets/AI_Agent_Security_Cheat_Sheet.html">OWASP</a>). A write your SIEM cannot see is a write you cannot defend. Owner: SOC and detection engineering.</p></td></tr></tbody></table><p>The fix is not bolting yet another MFA prompt onto the login screen. The people who survived Meta’s incident were the ones who already had that control in place.</p><p>The fix is pulling authorization out of the recovery path’s honor system and putting it behind a gate that does not move just because a prompt sounds convincing. Build the agent so the SOC sees every write it makes, and so any write that changes who owns an account cannot commit without a check that the model does not control.</p><p>Meta just showed what happens when the most trusting employee on the team is also the one holding the keys. The next agent like that is already reading your intellectual property and financials.</p>]]></description>
            <author>louiswcolumbus@gmail.com (Louis Columbus)</author>
            <category>Security</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/38WzL1nPF6nI78v2ZkNhqv/24ab8b1e4f85b1a21a123522a39ded31/hero_image.png?w=300&amp;q=30" length="0" type="image/png"/>
        </item>
        <item>
            <title><![CDATA[Microsoft launches MXC, an OS-level sandbox for AI agents, with OpenAI and Nvidia already on board]]></title>
            <link>https://venturebeat.com/security/microsoft-launches-mxc-an-os-level-sandbox-for-ai-agents-with-openai-and-nvidia-already-on-board</link>
            <guid isPermaLink="false">5lEA8BoEKR9x9BP7gxCkZO</guid>
            <pubDate>Tue, 02 Jun 2026 16:30:00 GMT</pubDate>
            <description><![CDATA[<p>For the past two years, the technology industry has raced to make AI agents more capable — teaching them to write code, navigate software interfaces, manage files, and orchestrate multi-step workflows with increasing autonomy. What the industry has not done, at least not with any consistency, is answer the question that keeps chief information security officers awake at night: what happens when an agent goes wrong?</p><p>On Tuesday at its annual <a href="https://news.microsoft.com/build-2026/">Build</a> developer conference, Microsoft offered what may become the definitive answer. The company introduced <a href="https://aka.ms/Windows-Build2026">Microsoft Execution Containers</a>, or MXC — a policy-driven execution layer, built into the Windows operating system itself, that lets developers and IT administrators declare exactly what an AI agent can and cannot access, with those boundaries enforced at runtime by the OS kernel.</p><p>The announcement, <a href="https://aka.ms/Windows-Build2026">buried within a sweeping set of developer-focused updates</a>, is arguably the most consequential platform move Microsoft made at Build this year, and it has the potential to reshape how every enterprise on Earth thinks about deploying autonomous AI software.</p><p>MXC is not a product you buy. It is an SDK and a policy model — a foundational primitive embedded in Windows and the Windows Subsystem for Linux — that provides what Microsoft calls a &quot;<a href="https://aka.ms/Windows-Build2026">composable sandbox spectrum</a>.&quot; That spectrum ranges from lightweight process isolation, already adopted by GitHub Copilot&#x27;s command-line interface, all the way up to micro-virtual machines, Linux containers, and full cloud instances running on Windows 365.</p><p>The system separates an agent&#x27;s execution from the user&#x27;s desktop, clipboard, user interface, and input devices. Critically, it binds every agent to a strong identity — either a local ID or a cloud-provisioned identity backed by Microsoft Entra — so that every action the agent takes can be attributed, audited, and governed.</p><p>The implications are enormous. Until now, the enterprise deployment of AI agents has been stuck in a paradox: the more autonomous and useful an agent becomes, the more dangerous it is to let it operate on a corporate network without guardrails. MXC is Microsoft&#x27;s attempt to break that paradox — not by making agents less capable, but by making the environment they operate in fundamentally more controlled.</p><h2>Why every autonomous AI agent is a security incident waiting to happen</h2><p>To understand why MXC matters, consider what an AI agent actually does when it runs on your computer. Unlike a traditional application, which operates within well-understood boundaries — a word processor reads and writes documents, a browser fetches web pages — an AI agent is, by design, unpredictable. It receives a goal in natural language, reasons about how to achieve it, and then takes actions: opening files, executing code, calling APIs, browsing the web, interacting with other software. Each of those interactions creates what security professionals call &quot;attack surface.&quot;</p><p>Microsoft&#x27;s own blog post framed the challenge in stark terms. The company wrote that &quot;as agents become more capable and autonomous, they&#x27;re delivering material productivity gains. But they&#x27;re also introducing new risk, and the issue isn&#x27;t just the agent. It&#x27;s the entire system the agent operates across.&quot; Every interaction between agents and humans, tools, applications, models, and other agents &quot;exposes new attack surface and introduces different failure modes.&quot; Microsoft characterized this as &quot;a multi-layer systems problem.&quot;</p><p>This is not a theoretical concern. In the months leading up to <a href="https://news.microsoft.com/build-2026/">Build</a>, security researchers demonstrated numerous ways that AI agents could be manipulated — through prompt injection, through malicious tool calls, through data exfiltration disguised as normal workflow. For enterprises that handle sensitive data, proprietary models, and regulated information, the absence of a trusted execution environment has been the single biggest barrier to moving agents from demo to deployment.</p><h2>Microsoft&#x27;s answer is a sandbox that scales from a single process to a full virtual machine</h2><p>MXC operates on a deceptively simple principle: declare what the agent can do before it runs, and let the operating system enforce those declarations at runtime. A developer or an IT administrator writes a policy that specifies which files, directories, and network resources an agent is allowed to access. MXC then creates a contained execution environment — a sandbox — that enforces those boundaries regardless of what the agent attempts to do.</p><p>What makes MXC unusual, and potentially very powerful, is the breadth of its isolation options. Microsoft designed the system so that a single SDK and policy model can map to the appropriate isolation construct for any given workload. For a lightweight coding assistant that just needs to read the current project directory, fast process isolation may be sufficient. For an autonomous agent that executes arbitrary code downloaded from the internet, a full micro-VM may be required. The system is designed to be &quot;dynamically composable based on intent and risk,&quot; meaning that the level of isolation can be adjusted based on what the agent is actually doing, not just what category it falls into.</p><p>Session isolation is a particularly important feature. MXC separates the agent&#x27;s execution from the user&#x27;s desktop, clipboard, UI, and input devices. This directly mitigates several classes of attacks that security researchers have identified as particularly dangerous for AI agents: UI spoofing, where an agent manipulates what the user sees to trick them into approving a malicious action; input injection, where an agent sends keystrokes or mouse clicks to other applications; and cross-session data leakage, where information from one user&#x27;s session bleeds into another.</p><h2>A live demo showed an AI agent trying to delete files — and failing, because the OS wouldn&#x27;t let it</h2><p>During a pre-briefing with VentureBeat the night before the announcement, a Microsoft developer offered a vivid demonstration of the technology in action. He had set up the open-source agent framework <a href="https://openclaw.ai/">OpenClaw</a> running inside MXC&#x27;s sandbox on his personal development machine. He then instructed the agent to delete all the files on his desktop. The agent attempted to comply — but the sandbox prevented it. &quot;If you look at my desktop here, you see how clean my desktop is,&quot; the developer said during the demo. &quot;That&#x27;s a lie.&quot; The files, he explained, were completely safe because &quot;the container won&#x27;t allow it.&quot;</p><p>The demonstration went further, showcasing the granularity of MXC&#x27;s controls. Users can mark specific files as read-only for the agent, restrict access to the browser and screen capture, control whether the agent can see location data, and have all of those permissions managed centrally by an enterprise IT department through Intune policies. The agent operates inside what is effectively a one-way mirror: it can do the work it has been asked to do, but it cannot see or touch anything outside the boundaries that its policy defines.</p><p>Pavan Davuluri, Microsoft&#x27;s Executive Vice President for Windows and Devices, underscored during the pre-briefing that the primitives MXC introduces — security, containment, isolation, and user control — are essential to making AI agents commercially viable.</p><p>He emphasized that these capabilities are &quot;not unique to OpenClaw&quot; and that &quot;this pattern repeats itself over and over&quot; for any agent running on a Windows device. The primitives that exist in the operating system now &quot;for the file around security, containment, isolating them, having users in control,&quot; he said, are what will make agents safe enough for ordinary consumers and corporate deployments alike.</p><h2>Defender, Entra, Intune, and Purview integration arriving in July turns MXC into an enterprise control plane</h2><p>For corporate IT departments, the most significant element of the <a href="https://openclaw.ai/">MXC announcement</a> is not the SDK itself but its integration with Microsoft&#x27;s existing enterprise security stack through what the company calls Agent 365. Arriving in preview in July, <a href="https://www.microsoft.com/en-us/microsoft-agent-365">Agent 365</a> layers Microsoft&#x27;s Entra identity service and Intune device management platform on top of MXC, so that IT administrators can govern agent containment centrally while developers choose the level of isolation their workload demands.</p><p>The integration goes further: <a href="https://www.microsoft.com/en-us/microsoft-365/microsoft-defender-for-individuals">Microsoft Defender</a> will provide runtime threat protection, <a href="https://www.microsoft.com/en-us/security/business/microsoft-entra">Entra</a> will handle identity and access management, Intune will enforce device-level policies, and <a href="https://www.microsoft.com/en-us/security/business/microsoft-purview">Microsoft Purview</a> will extend its data governance and compliance capabilities to agent activity. This means that an enterprise could, in theory, allow employees to run AI agents on their corporate machines — even powerful, autonomous agents that execute code and manage files — while maintaining the same kind of centralized visibility and control that IT departments currently have over traditional applications.</p><p>Microsoft described the identity layer in its <a href="https://aka.ms/Windows-Build2026">official blog</a>: &quot;Windows assigns agents a local ID or a cloud provisioned identity backed by Entra and attributes all activity from the container to that identity, so you can clearly differentiate human from agent.&quot; For regulated industries — financial services, healthcare, government — the ability to produce an audit trail that distinguishes between human actions and agent actions on the same machine could prove to be a regulatory requirement, not merely a nice-to-have feature. Every agent action attributable to a specific identity, every containment boundary enforceable through the same policy infrastructure that already governs hundreds of millions of Windows devices — this is the architecture that could finally move AI agents from pilot programs to production.</p><h2>OpenAI, Nvidia, Manus, and Nous Research are already building on MXC — and that changes the calculus</h2><p>Platform announcements at developer conferences are often aspirational. What distinguishes the MXC launch is the breadth and specificity of the partners already building on it. Microsoft named five: <a href="https://openai.com/">OpenAI</a>, <a href="https://www.nvidia.com/en-us/">Nvidia</a>, <a href="https://manus.im/">Manus</a>, <a href="https://nousresearch.com/">Nous Research</a> (maker of the Hermes agent), and the <a href="https://openclaw.ai/">OpenClaw</a> open-source project. Each is integrating MXC in a distinct way that illuminates a different use case for the technology.</p><p>OpenAI&#x27;s involvement is particularly striking. David Wiesen, a member of OpenAI&#x27;s technical staff, said that &quot;working with Microsoft on the Microsoft Execution Containers (MXC) allows us to explore new patterns for AI agents to safely and efficiently generate and execute code.&quot; He added that by combining Codex&#x27;s capabilities with MXC&#x27;s execution environment, the goal is &quot;to help developers move from intent to reliable execution faster, while maintaining the security and control enterprises need.&quot; The reference to <a href="https://openai.com/codex/">Codex</a> — OpenAI&#x27;s code-generation agent — suggests that MXC could become the default execution environment for one of the most widely anticipated agent products in the industry.</p><p>Nvidia is bringing its <a href="https://docs.nvidia.com/openshell/home">OpenShell framework</a> to Windows built on MXC, providing what Microsoft described as &quot;an easy-to-deploy package for autonomous, always-on agents safely.&quot; Manus, the Chinese-born AI agent startup that gained viral attention earlier this year, is also integrating. Tao Zhang, Manus&#x27;s Chief Product Officer, said that MXC &quot;gives developers a policy-driven way to define what an agent can access and enforce those boundaries at runtime, so more autonomous agents can operate safely in enterprise environments.&quot; And Dillon Rolnick, the CEO of Nous Research, offered what may be the most concise articulation of why MXC matters: &quot;Continuously-running local agents, like Hermes Agent, require intentional isolation. Developers need control over what an agent can access and trust that those controls will hold.&quot;</p><h2>How an open-source agent framework became Microsoft&#x27;s proving ground for AI safety on Windows</h2><p>One of the more revealing stories behind the MXC announcement involves <a href="https://openclaw.ai/">OpenClaw</a>. During the press pre-briefing, a Microsoft developer described how the partnership came together organically — Peter Steinberger, OpenClaw&#x27;s creator, sent him a direct message in January expressing interest in collaborating. What began as a casual conversation evolved into a full-fledged platform partnership, with Microsoft developers contributing to the OpenClaw Windows companion app, built as a native WinUI application rather than a wrapped web app.</p><p>The OpenClaw integration serves as what Scott called &quot;the ultimate test app for all the stuff that [the Windows platform team] is making.&quot; If OpenClaw — which by its nature gives agents broad autonomy to execute tasks on a user&#x27;s machine — can run securely within MXC&#x27;s containment boundaries, then the containment system is robust enough for any agent. Scott explained the philosophy driving the work: &quot;Think of OpenClaw Windows as the ultimate test app... If OpenClaw can succeed on Windows, that means that the Linux support is there, the container support is there, the containment is there.&quot;</p><p>The companion app demonstrates the full spectrum of MXC&#x27;s enterprise controls — file permissions, network access, screen capture restrictions, location data — all manageable centrally through Intune policies. Microsoft donated the project to OpenClaw and plans to continue contributing to it as open source. As one member of the Windows leadership team put it during the briefing: &quot;All agents, all comers, everyone is welcome on Windows... It&#x27;s going to run great on Windows, because the primitives are there. The base of the pyramid is solid.&quot;</p><h2>Building containment into the OS gives Microsoft a strategic edge over Apple&#x27;s walled garden and Google&#x27;s cloud-first model</h2><p>MXC arrives at a moment when the technology industry is grappling with a fundamental tension. AI agents represent what may be the most significant new category of software since mobile applications, and every major technology company is racing to build them. But the security and governance infrastructure required to deploy these agents responsibly in enterprise environments barely exists. Microsoft&#x27;s approach is distinctive because it locates the trust layer at the operating system level rather than in the agent framework, the model provider, or a third-party security product.</p><p>This is a deliberate architectural choice. By building containment into Windows itself, Microsoft ensures that the security guarantees hold regardless of which agent, which model, or which framework a developer chooses.</p><p>It also means that the hundreds of millions of Windows devices already managed through <a href="https://www.microsoft.com/en-us/security/business/microsoft-intune">Intune</a> and secured through <a href="https://www.microsoft.com/en-us/microsoft-365/microsoft-defender-for-individuals">Defender</a> can, in principle, become agent-ready through a software update rather than a rip-and-replace deployment.</p><p>Apple&#x27;s approach to AI agents leans heavily on its walled-garden ecosystem, offering security through restriction — limiting which agents can run and what they can do. Google&#x27;s approach, centered on its cloud infrastructure, offers security through centralization. Microsoft&#x27;s approach offers security through declaration and enforcement — allowing any agent to run, but containing its impact through OS-level policy.</p><p>For enterprises that operate in heterogeneous environments with diverse toolchains and multiple AI providers, the Microsoft model may prove the most practical. The competitive dynamics are already shifting: with OpenAI&#x27;s <a href="https://openai.com/codex/">Codex</a>, Nvidia’s <a href="https://build.nvidia.com/openshell">OpenShell</a>, and independent agent frameworks like <a href="https://manus.im/">Manus</a> and <a href="https://hermes-agent.nousresearch.com/">Hermes</a> all building on MXC, Microsoft is positioning Windows not just as the platform where agents run, but as the platform where agents can be trusted to run.</p><h2>The hardest part isn&#x27;t building the sandbox — it&#x27;s writing the policies that go inside it</h2><p>MXC is available now in early preview, meaning developers can begin building against the SDK and testing containment policies. The Agent 365 integration with Defender, Entra, Intune, and Purview is scheduled for preview in July — a timeline aggressive enough to suggest that much of the engineering work is already done, but far enough out to allow for refinement based on developer feedback.</p><p>The real test, however, will come when enterprises begin deploying agents at scale on production networks. Containment is only as good as the policies that govern it, and writing effective agent policies for complex enterprise environments will be an entirely new discipline — one that IT departments have not yet developed and that no vendor has yet figured out how to teach. The technology is promising, but an empty sandbox is just an empty box. Filling it with the right rules, for the right agents, in the right contexts, will require a level of organizational sophistication that most companies are only beginning to contemplate.</p><p>Still, the significance of what Microsoft announced on Tuesday is difficult to overstate. For the first time, a major operating system vendor has proposed a comprehensive, kernel-level answer to the question of how autonomous AI software should be contained, identified, and governed on the devices where most of the world&#x27;s work actually gets done. The industry spent two years teaching agents to act. Microsoft is now betting that the bigger business — and the harder engineering problem — is teaching the operating system to watch.</p><p>
</p>]]></description>
            <author>michael.nunez@venturebeat.com (Michael Nuñez)</author>
            <category>Security</category>
            <category>Technology</category>
            <category>Infrastructure</category>
            <enclosure url="https://images.ctfassets.net/jdtwqhzvc2n1/2Bj8ehmUSTCeqnkJ3pPCjc/f9782b3575c73ccecb809afd58e7acd2/Nuneybits_Vector_art_of_the_iconic_Microsoft_Windows_logo_on_a__b8c7cdb1-4983-4e68-94a9-93fbef23357b.webp?w=300&amp;q=30" length="0" type="image/webp"/>
        </item>
    </channel>
</rss>