DeepAgent: A General Reasoning Agent with Scalable Toolsets AI updates on arXiv.org

_ February 7, 2026_ Tech Jacks Solutions_ 0 Comments

arXiv:2510.21618v3 Announce Type: replace
Abstract: Large reasoning models have demonstrated strong problem-solving abilities, yet real-world tasks often require external tools and long-horizon interactions. Existing agent frameworks typically follow predefined workflows, which limit autonomous and global task completion. In this paper, we introduce DeepAgent, an end-to-end deep reasoning agent that performs autonomous thinking, tool discovery, and action execution within a single, coherent reasoning process. To manage long-horizon interactions, we introduce an autonomous memory folding mechanism that compresses past interactions into structured episodic, working, and tool memories, reducing error accumulation while preserving critical information. To teach general-purpose tool use efficiently and stably, we develop an end-to-end reinforcement learning strategy, namely ToolPO, that leverages LLM-simulated APIs and applies tool-call advantage attribution to assign fine-grained credit to the tool invocation tokens. Extensive experiments on eight benchmarks, including general tool-use tasks (ToolBench, API-Bank, TMDB, Spotify, ToolHop) and downstream applications (ALFWorld, WebShop, GAIA, HLE), demonstrate that DeepAgent consistently outperforms baselines across both labeled-tool and open-set tool retrieval scenarios. The code and demo are available at https://github.com/RUC-NLPIR/DeepAgent. Read More

Author

Gallery

Contacts

DeepAgent: A General Reasoning Agent with Scalable Toolsets AI updates on arXiv.org

Tech Jacks Solutions

Leave a comment Cancel reply

Services

Learn

Company

Gallery

Contacts

DeepAgent: A General Reasoning Agent with Scalable Toolsets AI updates on arXiv.org

Tech Jacks Solutions

What I Am Doing to Stay Relevant as a Senior Analytics Consultant in 2026 Towards Data Science

Towards Green AI: Decoding the Energy of LLM Inference in Software Development AI updates on arXiv.org

Leave a comment Cancel reply

Services

Learn

Company