Google Research releases ReasoningBank, an intelligent agent memory framework that enables large model-driven agents to learn continuously after deployment. It refines past task successes and failures into "reasoning patterns" stored in a memory bank, retrieving them before execution when encountering similar tasks, and transforming failure experiences into avoidance rules. The differences from Synapse and Agent Workflow Memory lie in the objects, structures, and inclusion of failure trajectories. The paper also introduces MaTTS, which allocates more computing power during reasoning and memorizes the exploration process. Parallel and sequential expansions improve robustness. Experiments show that on the WebArena and SWE-Bench-Verified benchmarks, ReasoningBank outperforms the memoryless baseline by 8.3% and 4.6%, respectively, with an average of about 3 steps saved; after adding MaTTS and parallel expansion, WebArena gains another 3 percentage points, reducing steps by 0.4.

CoinNetwork

2026-04-22 08:48:22

Abstract generation in progress

CryptoWorld News reports that, according to Beating Monitoring, Google Research has released an intelligent agent memory framework called ReasoningBank, enabling large-model-driven agents to continue learning after deployment. The core approach is to distill both successful and failed experiences from past tasks into general reasoning strategies stored in a memory bank, so that when similar tasks are encountered next time, the system retrieves relevant strategies first and then executes. The related paper was published at ICLR, and the code has been open-sourced on GitHub.

Previously, two mainstream solution types each had their own drawbacks: Synapse records complete action trajectories, but the granularity is too fine to transfer; Agent Workflow Memory only extracts workflows from successful cases. ReasoningBank makes two changes: it replaces the storage object from “action sequences” to “reasoning patterns,” and each memory includes a structured three-part field consisting of a title, description, and content; failed trajectories are also incorporated into learning. During execution, the model calls another large model to self-evaluate the execution trajectories, and failure experiences are broken down into pitfall-avoidance rules—for example, upgrading from “click the Load More button when you see it” to “first verify the current page identifier to avoid getting stuck in infinite scrolling, and then click load more.” The paper also proposes Memory-aware Test-time Scaling (MaTTS), which allocates more compute during inference to try repeatedly, and stores the exploration process in the memory bank.

Parallel expansion enables the agent to run multiple different trajectories for the same task, extracting more robust strategies through self-comparison; sequential expansion repeatedly refines within a single trajectory, recording intermediate reasoning into the memory bank. On the WebArena browser task and SWE-Bench-Verified code task benchmarks, using Gemini 2.5 Flash to build the ReAct agent, ReasoningBank achieves an 8.3% higher success rate on WebArena and a 4.6% higher success rate on SWE-Bench-Verified compared with a memoryless baseline, with about 3 fewer steps on average per task. After adding MaTTS parallel expansion (k=5), the WebArena success rate increases by another 3 percentage points, and the number of steps decreases by another 0.4 steps.

View Original

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
Gate13thAnniversaryLive
934K Popularity
#
WCTCTradingChallengeShare8MUSDT
774.62K Popularity
#
BitcoinBouncesBack
195.93K Popularity
#
USIranTalksProgress
694.03K Popularity
#
ArbitrumFreezesKelpDAOHackerETH
22.73K Popularity

Sitemap

Google releases ReasoningBank, enabling intelligent agents to extract reasoning strategies from success and failure experiences

Trending Topics

Gate13thAnniversaryLive

WCTCTradingChallengeShare8MUSDT

BitcoinBouncesBack

USIranTalksProgress

ArbitrumFreezesKelpDAOHackerETH

Pin