HKUDS / AutoAgent
- ΡΡΠ΅Π΄Π°, 10 ΡΠ΅Π½ΡΡΠ±ΡΡ 2025β―Π³. Π² 00:00:03
"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"
Welcome to AutoAgent! AutoAgent is a Fully-Automated and highly Self-Developing framework that enables users to create and deploy LLM agents through Natural Language Alone.
π Top Performers on the GAIA Benchmark
AutoAgent has delivering comparable performance to many Deep Research Agents.
β¨ Agent and Workflow Create with Ease
AutoAgent leverages natural language to effortlessly build ready-to-use tools, agents and workflows - no coding required.
π Agentic-RAG with Native Self-Managing Vector Database
AutoAgent equipped with a native self-managing vector database, outperforms industry-leading solutions like LangChain.
π Universal LLM Support
AutoAgent seamlessly integrates with A Wide Range of LLMs (e.g., OpenAI, Anthropic, Deepseek, vLLM, Grok, Huggingface ...)
π Flexible Interaction
Benefit from support for both function-calling and ReAct interaction modes.
π€ Dynamic, Extensible, Lightweight
AutoAgent is your Personal AI Assistant, designed to be dynamic, extensible, customized, and lightweight.
π Unlock the Future of LLM Agents. Try π₯AutoAgentπ₯ Now!
AutoAgent have an out-of-the-box multi-agent system, which you could choose user mode in the start page to use it. This multi-agent system is a general AI assistant, having the same functionality with OpenAI's Deep Research and the comparable performance with it in GAIA benchmark.
π₯ Deep Research (aka User Mode)
The most distinctive feature of AutoAgent is its natural language customization capability. Unlike other agent frameworks, AutoAgent allows you to create tools, agents, and workflows using natural language alone. Simply choose agent editor or workflow editor mode to start your journey of building agents through conversations.
You can use agent editor as shown in the following figure.
Input what kind of agent you want to create. |
Automated agent profiling. |
Output the agent profiles. |
Create the desired tools. |
Input what do you want to complete with the agent. (Optional) |
Create the desired agent(s) and go to the next step. |
You can also create the agent workflows using natural language description with the workflow editor mode, as shown in the following figure. (Tips: this mode does not support tool creation temporarily.)
Input what kind of workflow you want to create. |
Automated workflow profiling. |
Output the workflow profiles. |
Input what do you want to complete with the workflow. (Optional) |
Create the desired workflow(s) and go to the next step. |
git clone https://github.com/HKUDS/AutoAgent.git
cd AutoAgent
pip install -e .We use Docker to containerize the agent-interactive environment. So please install Docker first. You don't need to manually pull the pre-built image, because we have let Auto-Deep-Research automatically pull the pre-built image based on your architecture of your machine.
Create an environment variable file, just like .env.template, and set the API keys for the LLMs you want to use. Not every LLM API Key is required, use what you need.
# Required Github Tokens of your own
GITHUB_AI_TOKEN=
# Optional API Keys
OPENAI_API_KEY=
DEEPSEEK_API_KEY=
ANTHROPIC_API_KEY=
GEMINI_API_KEY=
HUGGINGFACE_API_KEY=
GROQ_API_KEY=
XAI_API_KEY=[π¨ News: ] We have updated a more easy-to-use command to start the CLI mode and fix the bug of different LLM providers from issues. You can follow the following steps to start the CLI mode with different LLM providers with much less configuration.
You can run auto main to start full part of AutoAgent, including user mode, agent editor and workflow editor. Btw, you can also run auto deep-research to start more lightweight user mode, just like the Auto-Deep-Research project. Some configuration of this command is shown below.
--container_name: Name of the Docker container (default: 'deepresearch')--port: Port for the container (default: 12346)COMPLETION_MODEL: Specify the LLM model to use, you should follow the name of Litellm to set the model name. (Default: claude-3-5-sonnet-20241022)DEBUG: Enable debug mode for detailed logs (default: False)API_BASE_URL: The base URL for the LLM provider (default: None)FN_CALL: Enable function calling (default: None). Most of time, you could ignore this option because we have already set the default value based on the model name.git_clone: Clone the AutoAgent repository to the local environment (only support with the auto main command, default: True)test_pull_name: The name of the test pull. (only support with the auto main command, default: 'autoagent_mirror')In the agent editor and workflow editor mode, we should clone a mirror of the AutoAgent repository to the local agent-interactive environment and let our AutoAgent automatically update the AutoAgent itself, such as creating new tools, agents and workflows. So if you want to use the agent editor and workflow editor mode, you should set the git_clone to True and set the test_pull_name to 'autoagent_mirror' or other branches.
Then I will show you how to use the full part of AutoAgent with the auto main command and different LLM providers. If you want to use the auto deep-research command, you can refer to the Auto-Deep-Research project for more details.
ANTHROPIC_API_KEY in the .env file.ANTHROPIC_API_KEY=your_anthropic_api_keyauto main # default model is claude-3-5-sonnet-20241022OPENAI_API_KEY in the .env file.OPENAI_API_KEY=your_openai_api_keyCOMPLETION_MODEL=gpt-4o auto mainMISTRAL_API_KEY in the .env file.MISTRAL_API_KEY=your_mistral_api_keyCOMPLETION_MODEL=mistral/mistral-large-2407 auto mainGEMINI_API_KEY in the .env file.GEMINI_API_KEY=your_gemini_api_keyCOMPLETION_MODEL=gemini/gemini-2.0-flash auto mainHUGGINGFACE_API_KEY in the .env file.HUGGINGFACE_API_KEY=your_huggingface_api_keyCOMPLETION_MODEL=huggingface/meta-llama/Llama-3.3-70B-Instruct auto mainGROQ_API_KEY in the .env file.GROQ_API_KEY=your_groq_api_keyCOMPLETION_MODEL=groq/deepseek-r1-distill-llama-70b auto mainOPENAI_API_KEY in the .env file.OPENAI_API_KEY=your_api_key_for_openai_compatible_endpointsCOMPLETION_MODEL=openai/grok-2-latest API_BASE_URL=https://api.x.ai/v1 auto mainWe recommend using OpenRouter as LLM provider of DeepSeek-R1 temporarily. Because official API of DeepSeek-R1 can not be used efficiently.
OPENROUTER_API_KEY in the .env file.OPENROUTER_API_KEY=your_openrouter_api_keyCOMPLETION_MODEL=openrouter/deepseek/deepseek-r1 auto mainDEEPSEEK_API_KEY in the .env file.DEEPSEEK_API_KEY=your_deepseek_api_keyCOMPLETION_MODEL=deepseek/deepseek-chat auto mainAfter the CLI mode is started, you can see the start page of AutoAgent:
You can import the browser cookies to the browser environment to let the agent better access some specific websites. For more details, please refer to the cookies folder.
If you want to create tools from the third-party tool platforms, such as RapidAPI, you should subscribe tools from the platform and add your own API keys by running process_tool_docs.py.
python process_tool_docs.pyMore features coming soon! π Web GUI interface under development.
AutoAgent is continuously evolving! Here's what's coming:
Have ideas or suggestions? Feel free to open an issue! Stay tuned for more exciting updates! π
For the GAIA benchmark, you can run the following command to run the inference.
cd path/to/AutoAgent && sh evaluation/gaia/scripts/run_infer.shFor the evaluation, you can run the following command.
cd path/to/AutoAgent && python evaluation/gaia/get_score.pyFor the Agentic-RAG task, you can run the following command to run the inference.
Step1. Turn to this page and download it. Save them to your datapath.
Step2. Run the following command to run the inference.
cd path/to/AutoAgent && sh evaluation/multihoprag/scripts/run_rag.shStep3. The result will be saved in the evaluation/multihoprag/result.json.
A more detailed documentation is coming soon π, and we will update in the Documentation page.
We want to build a community for AutoAgent, and we welcome everyone to join us. You can join our community by:
Rome wasn't built in a day. AutoAgent stands on the shoulders of giants, and we are deeply grateful for the outstanding work that came before us. Our framework architecture draws inspiration from OpenAI Swarm, while our user mode's three-agent design benefits from Magentic-one's insights. We've also learned from OpenHands for documentation structure and many other excellent projects for agent-environment interaction design, among others. We express our sincere gratitude and respect to all these pioneering works that have been instrumental in shaping AutoAgent.
@misc{AutoAgent,
title={{AutoAgent: A Fully-Automated and Zero-Code Framework for LLM Agents}},
author={Jiabin Tang, Tianyu Fan, Chao Huang},
year={2025},
eprint={202502.05957},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2502.05957},
}