alibaba / page-agent
- Π²ΠΎΡΠΊΡΠ΅ΡΠ΅Π½ΡΠ΅, 8 ΠΌΠ°ΡΡΠ° 2026β―Π³. Π² 00:00:04
JavaScript in-page GUI agent. Control web interfaces with natural language.
The GUI Agent Living in Your Webpage. Control web interfaces with natural language.
π English | δΈζ
π π Demo | π Documentation | π’ Join HN Discussion
browser extension / python / headless browser.Fastest way to try PageAgent with our free Demo LLM:
<script src="{URL}" crossorigin="true"></script>| Mirrors | URL |
|---|---|
| Global | https://cdn.jsdelivr.net/npm/page-agent@1.5.2/dist/iife/page-agent.demo.js |
| China | https://registry.npmmirror.com/page-agent/1.5.2/files/dist/iife/page-agent.demo.js |
β οΈ For technical evaluation only. This demo CDN uses our free testing LLM API. By using it, you agree to its terms.
npm install page-agentimport { PageAgent } from 'page-agent'
const agent = new PageAgent({
model: 'qwen3.5-plus',
baseURL: 'https://dashscope.aliyuncs.com/compatible-mode/v1',
apiKey: 'YOUR_API_KEY',
language: 'en-US',
})
await agent.execute('Click the login button')For more programmatic usage, see π Documentations.
We welcome contributions from the community! Follow our instructions in CONTRIBUTING.md for environment setup and local development.
Please read Code of Conduct before contributing.
This project builds upon the excellent work of browser-use.
PageAgent is designed for client-side web enhancement, not server-side automation.
DOM processing components and prompt are derived from browser-use:
Browser Use
Copyright (c) 2024 Gregor Zunic
Licensed under the MIT License
Original browser-use project: <https://github.com/browser-use/browser-use>
We gratefully acknowledge the browser-use project and its contributors for their
excellent work on web automation and DOM interaction patterns that helped make
this project possible.
Third-party dependencies and their licenses can be found in the package.json
file and in the node_modules directory after installation.
β Star this repo if you find PageAgent helpful!