A new framework called A-LEMS redefines how energy consumption is measured for agentic AI systems, shifting from energy-per-inference to Energy per Successful Goal (EpG) to account for multi-step orchestration, retries, and failures. Testing across eight task families revealed agentic workflows consume 888.1 joules per successful goal versus 205.3 joules for linear baselines, with orchestration structure—not raw compute—driving the overhead.
Why it matters: As AI systems become more complex and agentic, traditional energy benchmarks mask the true operational cost of goal completion, making this framework essential for accurate sustainability assessment and cost forecasting in production AI systems.