midscene vs page-agent
midscene and page-agent are both browser & computer use. midscene is AI-powered, vision-driven UI automation for every platform, while page-agent is javaScript in-page GUI agent. Control web interfaces with natural language. Here's an independent, side-by-side look at how they compare — and which fits.
JavaScript in-page GUI agent. Control web interfaces with natural language.
Visit page-agent →Side by side
| Spec | midscene | page-agent |
|---|---|---|
| Type | Agent | Agent |
| Model | Open source | Open source |
| Pricing | Open source | Open source |
| GitHub stars | 13,925 | 21,592 |
| Language | TypeScript | TypeScript |
| License | MIT | MIT |
| Last activity | Jul 2026 | Jul 2026 |
AI-powered, vision-driven UI automation for every platform.
you want the more widely-adopted project (22k GitHub stars).
About page-agent
JavaScript in-page GUI agent. Control web interfaces with natural language.
Full page-agent profile →More browser & computer use comparisons
- Browser Use Cloud vs page-agent
- Browser Use Cloud vs midscene
- ChatGPT Agent vs page-agent
- midscene vs ChatGPT Agent
- page-agent vs UI-TARS-desktop
- midscene vs UI-TARS-desktop