Ben Dickson

Guest Author

Ben Dickson is a software engineer and the founder of TechTalks, a blog that explores the ways technology is solving and creating problems. He writes about technology, business and politics. Follow him on Twitter: @BenDee983.

recursive multi-agent system — Image credit: VentureBeat with ChatGPT

How RecursiveMAS speeds up multi-agent inference by 2.4x and reduces token usage by 75%

A new framework from UIUC and Stanford lets AI agents share embeddings instead of text — slashing token usage and cutting training costs by more than half.

Ben Dickson

May 15, 2026

LLM data corruption — Image credit: VentureBeat with Nano Banana

Frontier AI models don't just delete document content — they rewrite it, and the errors are nearly impossible to catch

Weaker AI models delete document content when they fail. Frontier models rewrite it — subtly and silently, making errors far harder for human reviewers to catch.

Ben Dickson

May 13, 2026

RL conductor — Image credit: VentureBeat with ChatGPT

How Sakana trained a 7B model to orchestrate GPT, Claude and Gemini LLMs

A 7B model that learns to route tasks across GPT-5, Claude Sonnet 4, and Gemini 2.5 Pro — using RL instead of hardcoded workflows.

Ben Dickson

May 7, 2026

LLM tool-use abstention — Image credit: VentureBeat with Nano Banana

Alibaba's Metis agent cuts redundant AI tool calls from 98% to 2% — and gets more accurate doing it

Most AI agents call tools even when they don't need to. Alibaba's new RL framework teaches them when to stop — and accuracy goes up, not down.

Ben Dickson

April 30, 2026

How to build custom reasoning agents with a fraction of the compute

The technique, called Reinforcement Learning with Verifiable Rewards with Self-Distillation (RLSD), combines the reliable performance tracking of reinforcement learning with the granular feedback of self-distillation.

Ben Dickson

April 28, 2026

automated ai r and d — Image credit: VentureBeat with ChatGPT

New AI framework autonomously optimizes training data, architectures and algorithms — outperforming human baselines

A self-improving AI framework beat human-designed baselines across data, architecture, and reinforcement learning — with no manual intervention.

Ben Dickson

April 27, 2026

single vs multi-agent systems — Image credit: VentureBeat with ChatGPT

Are you paying an AI ‘swarm tax’? Why single agents often beat complex systems

New Stanford research challenges the assumption that more agents means better AI — and introduces a simple compute-budget fix that changes the calculus.

Ben Dickson

April 22, 2026

LLM multi-response sampling — Image credit: VentureBeat with Nano Bannana Pro

Train-to-Test scaling explained: How to optimize your end-to-end AI compute budget for inference

AI reasoning does not necessarily require spending huge amounts on frontier models. Instead, smaller models can yield stronger performance on complex tasks while keeping per-query inference costs manageable

Ben Dickson

April 17, 2026

Meta researchers introduce 'hyperagents' to unlock self-improving AI for non-coding tasks

Creating self-improving AI systems is an important step toward deploying agents in dynamic environments, especially in enterprise production environments, where tasks are not always predictable, nor consistent.

Ben Dickson

April 15, 2026

Self-evolving agents — Image credit: VentureBeat with Nano Banana

New framework lets AI agents rewrite their own skills without retraining the underlying model

A multi-university research team built a framework that teaches agents to fix their own failure modes — no human intervention required.

Ben Dickson

April 8, 2026

agentic code reasoning — Image credit: VentureBeat with Nano Banana

Meta's new structured prompting technique makes LLMs significantly better at code review — boosting accuracy to 93% in some cases

This technique can be used out-of-the-box, requiring no model training or special packaging. It is code-execution free, which means you do not need to add additional tools to your LLM environment.

Ben Dickson

April 1, 2026

LLM attention optimization — Image credit: VentureBeat with ChatGPT

IndexCache, a new sparse attention optimizer, delivers 1.82x faster inference on long-context AI models

The technique works by detecting that adjacent model layers repeat the same token selections — then caching the result instead of recalculating.

Ben Dickson

March 27, 2026