SkyworkAI / DeepResearchAgent
- понедельник, 15 сентября 2025 г. в 00:00:02
DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coordinate multiple specialized lower-level agents, enabling automated task decomposition and efficient execution across diverse and complex domains.
image.png DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coordinate multiple specialized lower-level agents, enabling automated task decomposition and efficient execution across diverse and complex domains.
🌐 Check out our interactive website: https://skyworkai.github.io/DeepResearchAgent/ - Explore the architecture, view experiments, and learn more about our research!
The system adopts a two-layer structure:
Deep Analyzer
Deep Researcher
Browser Use
MCP Manager Agent
General Tool Calling Agent
Image and Video Examples:
# poetry install environment
conda create -n dra python=3.11
conda activate dra
make install
# (Optional) You can also use requirements.txt
conda create -n dra python=3.11
conda activate dra
make install-requirements
# playwright install if needed
pip install playwright
playwright install chromium --with-deps --no-shell
Please refer to the .env.template
file and create a .env
file in the root directory of the project. This file is used to configure API keys and other environment variables.
Refer to the following instructions to obtain the necessary google gemini-2.5-pro API key and set it in the .env
file:
brew install --cask google-cloud-sdk
gcloud init
gcloud auth application-default login
A simple example to demonstrate the usage of the DeepResearchAgent framework.
python main.py
A simple example to demonstrate the usage of a single agent, such as a general tool calling agent.
python examples/run_general.py
# Download GAIA
mkdir data && cd data
git clone https://huggingface.co/datasets/gaia-benchmark/GAIA
# Run
python examples/run_gaia.py
We evaluated our agent on both GAIA validation and test sets, achieving state-of-the-art performance. Our system demonstrates superior performance across all difficulty levels.
With the integration of the Computer Use and MCP Manager Agent, which now enables pixel-level control of the browser, our system demonstrates remarkable evolutionary capabilities. The agents can dynamically acquire and enhance their abilities through learning and adaptation, leading to significantly improved performance. The latest results show:
Our framework now supports:
Update your config:
model_id = "qwen2.5-7b-instruct"
If problems occur, reinstall:
pip install "browser-use[memory]"==0.1.48
pip install playwright
playwright install chromium --with-deps --no-shell
Function-calling is now supported natively by GPT-4.1 / Gemini 2.5 Pro. Claude-3.7-Sonnet is also recommended.
We provide huggingface as a shortcut to the local model. Also provide vllm as a way to start services so that parallel acceleration can be provided.
nohup bash -c 'CUDA_VISIBLE_DEVICES=0,1 python -m vllm.entrypoints.openai.api_server \
--model /input0/Qwen3-32B \
--served-model-name Qwen \
--host 0.0.0.0 \
--port 8000 \
--max-num-seqs 16 \
--enable-auto-tool-choice \
--tool-call-parser hermes \
--tensor_parallel_size 2' > vllm_qwen.log 2>&1 &
Update .env
:
QWEN_API_BASE=http://localhost:8000/v1
QWEN_API_KEY="abc"
python main.py
Example command:
Use deep_researcher_agent to search the latest papers on the topic of 'AI Agent' and then summarize it.
DeepResearchAgent is primarily inspired by the architecture of smolagents. The following improvements have been made:
We would like to express our gratitude to the following open source projects, which have greatly contributed to the development of this work:
We sincerely appreciate the efforts of all contributors and maintainers of these projects for their commitment to advancing AI technologies and making them available to the wider community.
Contributions and suggestions are welcome! Feel free to open issues or submit pull requests.
@misc{zhang2025agentorchestrahierarchicalmultiagentframework,
title={AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving},
author={Wentao Zhang, Liang Zeng, Yuzhen Xiao, Yongcong Li, Ce Cui, Yilei Zhao, Rui Hu, Yang Liu, Yahui Zhou, Bo An},
year={2025},
eprint={2506.12508},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2506.12508},
}
如果你更习惯阅读中文说明文档,请查阅 README_CN.md。