You can check the Wayback Machine and it's the same definition back over 5 years, which makes sense, or the wayback snapshot is calling the current AP

Agents: An optimistic painting - Adam Browne

submited by

Style Pass

2025-07-30 20:00:04

You can check the Wayback Machine and it's the same definition back over 5 years, which makes sense, or the wayback snapshot is calling the current API.

Most of today's "agents" are semi-autonomous apps that orchestrate LLMs, third-party APIs, and SDKs to achieve a narrowly defined goal. They might edit an IDE’s buffer, issue DOM commands in a browser, or drive other software. Given the same architecture, it could extend into the physical world.

There are parallels with agents and with RTS (Real-time strategy) games; one such example is StarCraft, which is one of the most famous:

In StarCraft, the best players in the world have very high APM (actions per minute) and the lowest action latency. With a ranking chart:

APM has the highest correlation with win-rate of any variable at p=0.65. We also see that the action latency of GrandMasters is the lowest of all the other ranks.

We know from StarCraft the more decisively correct actions you perform per minute will make you better. But there's something we must acknowledge about current Agents: they're built on LLMs, which are constantly evolving. This is both good and bad. On one hand, you have to update your system instructions and get used to the new changes. You also need to scrutinize the model itself. The AI labs already do this with their evaluations and testing, but the user (director) needs to evaluate it for themselves.

Agents: An optimistic painting - Adam Browne

Leave a Comment

Related Posts

Recent Posts

Kernel 6.16: Fine-tuning the details

Rust, Python, and TypeScript: the new trifecta · baby steps

Speed up suspend/resume for FreeBSD

ITALY ARMED TO THE TEETH: MY TOP NINE POLIZIOTTESCHI

Computer Science > Artificial Intelligence

Anthony Enzor-DeMeo Promoted to General Manager, Firefox

Search code, repositories, users, issues, pull requests...

The Private Equity Value Creation Report: 2025

How to Live Without a Smartphone in 2024

U.S. Senators Introduce New Pirate Site Blocking Bill: Block BEARD

Inside a Real Clickfix Attack: How This Social Engineering Hack Unfolds

Apache Flink 2.1.0: Ushers in a New Era of Unified Real-Time Data + AI with Comprehensive Upgrades

How Judoscale's Utilization-Based Autoscaling Works

Djangonaut Space is looking for contributors to be mentors

Search code, repositories, users, issues, pull requests...

Why Today’s Young Adults Are 4 Times More Likely To Have Rectal Cancer

Physics > Physics and Society

Hijacking multi-agent systems in your PajaMAS

Subpixel Rendering For Impossibly Small Terminal Text

Global Constraint Catalog