VentureBeat | Transformative tech coverage that matters

May 29, 2026

nuneybits Retro CRT monitor projecting a holographic burnt oran 0a706390-7ee6-462d-b5c8-ca2ac337272e — Credit: VentureBeat made with Midjourney

Technology

Mistral AI launches Vibe, expands into industrial AI and announces data center push to challenge OpenAI

May 28, 2026

test-time scaling strategy — Image credit: VentureBeat with ChatGPT

Orchestration

Researchers automated LLM reasoning strategy design and cut token usage by 69.5%

May 28, 2026

Orchestration

Open source — CleoP made with Midjourney

Pinterest cut AI costs 90% by gutting a frontier model's vision layer

At 620M users, frontier model API calls aren't viable. Pinterest CTO Matt Madrigal on gutting Qwen3-VL's vision layer — and the 90% cost cut that followed.

Taryn Plumb

May 29, 2026

26AIT Temporal sg 01 12 25 09cleaned — Preeti Somal, Senior VP Engineering, Temporal Technologies

VB Event

AI agents are entering their rebuild era as enterprises confront the reliability problem

As enterprise AI agents move into production, organizations are confronting a growing reliability problem. Many teams are discovering that LLM performance alone does not determine whether agents succeed in production. Long-running AI workflows must survive crashes, preserve state, recover from failures, manage inference costs, and coordinate across APIs, tools, and enterprise systems.

VB Staff

May 29, 2026

AI agents are quietly generating chaos engineering failures enterprises don’t track yet

There is a category of production incident that engineering teams are not tracking yet — because it doesn't fit any existing postmortem template.

Sayali Patil

May 24, 2026

ai agent using the terminal — Image credit: VentureBeat with ChatGPT

Your AI agents need a terminal, not just a vector database

DCI lets AI agents grep, trace, and verify data directly — no embeddings needed. Researchers say it's faster and cheaper than vector search for complex tasks.

May 22, 2026

Infrastructure

Merck and Mastercard are seeing real agentic AI results. Both say the plumbing came first.

Merck shrank a drug discovery cycle by a year. Mastercard is rebuilding how fraud disputes get resolved. Both say agents only work if the infrastructure is already there.

Taryn Plumb

May 27, 2026

Nuneybits Vector art of a single neon yellow-green AI eye open e84a55ca-5cf2-49ff-9486-757f960ca6dc — Credit: VentureBeat made with Midjourney

Resolve AI says the AI coding boom is breaking production systems. It wants to fix that.

The centerpiece of the release is a new multi-agent investigation system developed by Resolve AI's in-house research lab. Instead of deploying a single AI agent to diagnose a production failure — analogous to a lone engineer pulling an on-call shift — the platform now dispatches a coordinated team of specialized agents that pursue multiple hypotheses in parallel, independently verify each other's conclusions, and construct complete causal chains from root cause to symptom. The company says the architecture delivers more than a twofold improvement in root cause accuracy on its internal evaluation benchmarks compared to earlier versions of its platform.

May 21, 2026

Nuneybits Vector art of cobalt chip towering servers in burnt o e4e68375-d5c6-4559-87a7-d92ffb2bf67a-1 — Credit: VentureBeat made with Midjourney

Cerebras says its chips run a trillion-parameter AI model nearly 7 times faster than GPU clouds

Less than a week after completing the largest tech IPO of 2026, Cerebras Systems is making its most aggressive play yet to dominate the fast-growing AI inference market. On Monday, the Sunnyvale-based chipmaker announced that it is now running Kimi K2.6 — a trillion-parameter open-weight model developed by Beijing-based Moonshot AI — for enterprise customers at nearly 1,000 tokens per second, a speed no GPU-based provider has come close to matching.

May 20, 2026

Man stands in massive data center painting landscape of it — Credit: VentureBeat made with OpenAI ChatGPT-Images-2.0

AWS nabs white hot gen AI media creation startup fal, becoming its preferred cloud provider

For large media conglomerates, this managed service approach allows them to experiment with the latest state-of-the-art tools securely, without the risk of exposing proprietary data or intellectual property.

May 20, 2026

Events

Tue, Jul 14, 2026 - Wed, Jul 15, 2026

Hotel Nia, Menlo Park

VB Transform 2026

View event

Data

Mining SQL queries — Credit: Image generated by VentureBeat with FLUX-2-Pro

SQL query logs hold the context AI agents need to stop hallucinating joins

AI agents hallucinate joins when they can't see your query history. DataHub is mining that history to fix it.

Sean Michael Kerner

May 28, 2026

Partner Content

Control within connection: How data sovereignty is rewriting the rules of critical infrastructure

Presented by Equinix

Shane Paladin, Equinix

May 28, 2026

48431D37-24D9-411B-94BB-87FD6B379801 — Credit: DataGrail

DataGrail report finds your vendor may be sending data to AI models you never approved

The data processing agreement (DPA) — the bedrock contract companies use to evaluate how vendors handle personal data — can no longer be trusted at face value. That is the central, and arguably most alarming, conclusion of DataGrail's Privacy and AI Trends Report 2026, released today.

May 27, 2026

DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole

For months, the leading AI coding benchmarks have told enterprise buyers a comforting but misleading story: the top models are all roughly the same. OpenAI's GPT-5 family, Anthropic's Claude Opus, and Google's Gemini Pro have clustered within a narrow band on Scale AI's SWE-Bench Pro leaderboard, making it nearly impossible for engineering leaders to determine which agent will actually perform best inside their codebases.

May 26, 2026

Security

DataGrail report finds your vendor may be sending data to AI models you never approved

May 27, 2026

The most active attacker in financial services this year didn't steal credentials. It called your help desk — VentureBeat created with Imagen

The attack dominating financial services doesn't steal passwords. It resets MFA and steals the token.

Three 2026 reports, same finding: financial services attackers don't steal passwords anymore. They reset MFA and capture tokens.

Louis Columbus

May 26, 2026

633 malicious npm packages passed Sigstore provenance checks with legitimate signing certificates — VentureBeat made with Imagen

Valid certificates, stolen accounts: how attackers broke npm's last trust signal

633 malicious npm packages cleared every provenance check. Stolen accounts, valid certificates — and no single vendor covers all seven attack surfaces that failed.

Louis Columbus

May 22, 2026

Partner Content

Americans can’t spot a deepfake, and that’s a business crisis, not just a consumer problem

Presented by Veriff

VB Staff

May 21, 2026

Newsroom

Daversa Appoints Maggie Fair to Managing Director

FDA Grants Coredio Breakthrough Designation for AI Platform Bringing Advanced Heart Failure Assessment Beyond the Hospital

Qevlar Introduces AI Agents Unifying SOC and Vulnerability Operations as Exploitation Windows Collapse

TDK Ventures Invests in C2i Semiconductors to Revolutionize AI Data Center Power Delivery

Video

VB Latest

Building a 30% Better AI: The Taste Graph Moat

VB in Conversation

Trust is the real bottleneck in agentic AI

Cisco’s Michael Dickman explains why enforceable trust — not just better models or more compute — is becoming the critical requirement for agentic AI in production.

VB in Conversation

Securing AI at scale starts inside the code

VB talks with Cisco’s Anthony Grieco about why AI-generated code is breaking traditional security models, and forcing enforcement into the development loop.

Technology

Greeks staring up at Mount Olympus with computer displaying 4.8 — Credit: VentureBeat made with ChatGPT-Images-2.0

Anthropic's Claude Opus 4.8 is here with 3X cheaper fast mode and near-Mythos level alignment

Opus 4.8 shows a growing tendency to reason explicitly about how its outputs will be graded, including in environments where it wasn't told it was being evaluated.

May 28, 2026

Are designers the new SWEs? Figma Make's new two-way GitHub integration turns designs into live, production code — with built-in governance

From an enterprise governance perspective, this means visual AI edits are subject to the exact same continuous integration pipelines, security checks, and code reviews as any traditional engineering commit.

May 28, 2026

M3 flying saucer over Shanghai — Credit: VentureBeat made with Google Gemini 3 Pro Image

MiniMax teases upcoming M3 model with new sparse attention mechanism and 15.6X long-context response speed boost

It directly solves the exact bottleneck that normally makes AI chatbots freeze or stutter when handling massive amounts of information.

May 27, 2026

DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole

May 26, 2026

Why prompt debt, retrieval debt, and evaluation debt are quietly reshaping enterprise AI risk

Over the past two decades, technical debt meant outdated architecture, messy code, and poorly maintained documentation. That definition is no longer sufficient in the AI era, where failure modes are more subtle and often non-linear. AI systems are introducing new layers of technical debt that live across prompts, models, and data dependencies — making these layers less visible, harder to measure, and often more dangerous than traditional debt.

Vikram Venkat

May 25, 2026

D&B's database of 642 million businesses was built for humans, not AI agents. So they rebuilt it.

AI agents were failing on business identity. D&B rebuilt its 642-million-company database from scratch — and the lessons apply to every enterprise.

Sean Michael Kerner

May 22, 2026

Capybara with glasses typing on laptop while piloting mecha — Credit: VentureBeat made with OpenAI ChatGPT-2.0-Images

Alibaba's proprietary Qwen3.7-Max can run for 35 hours autonomously and supports external harnesses like Anthropic's Claude Code

On the Apex Math Reasoning benchmark, Qwen3.7-Max scored 44.5, eclipsing Claude Opus-4.6 Max's score of 34.5 and DeepSeek V4-Pro Max's 38.3.

May 21, 2026

lightweight llm memory adapter — Image credit: VentureBeat with Nano Banana

A 0.12% parameter add-on gives AI agents the working memory RAG can't

A new memory module lets AI agents retain context across long interactions — adding just 0.12% of model parameters with no architectural changes.

May 21, 2026

Enterprise AI agents keep failing because they forget what they learned

Most enterprise AI agents never make it out of the pilot phase. The problem isn't the model — it's that agents forget what they learned.

Taryn Plumb

May 21, 2026

MFA verifies who logged in. It has no idea what they do next.

MFA authenticates at the front door and never looks again. That's the gap attackers are exploiting — and most enterprises haven't closed it.

Louis Columbus

May 21, 2026

Nuneybits Vector art of robot agents in society d92ca346-3082-41e3-a601-5f3b18018036 — Credit: VentureBeat made with Midjourney

Kore.ai launches Artemis AI agent platform, takes on Salesforce and ServiceNow

The platform arrives at a moment when every major technology vendor — from Microsoft and Salesforce to Google and ServiceNow — is racing to become the default infrastructure for enterprise AI agents. Kore.ai's answer to that crowded field is a bet on neutrality, a proprietary intermediary language for defining agents, and a philosophy that AI, not human developers, should do most of the heavy lifting.