Agent Search Engine

Issue 001 / A living technical almanac

System scan: active

Browser & Computer Use · Head-to-head

midscene vs UI-TARS-desktop

midscene and UI-TARS-desktop are both browser & computer use. midscene is AI-powered, vision-driven UI automation for every platform, while UI-TARS-desktop is the Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra. Here's an independent, side-by-side look at how they compare — and which fits.

midscene

Agent· Open source

AI-powered, vision-driven UI automation for every platform.

Visit midscene
UI-TARS-desktop

Agent· Open source

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

Visit UI-TARS-desktop

Side by side

SpecmidsceneUI-TARS-desktop
TypeAgentAgent
ModelOpen sourceOpen source
PricingOpen sourceOpen source
GitHub stars13,92537,575
LanguageTypeScriptTypeScript
LicenseMITApache-2.0
Last activityJul 2026Jul 2026

Choose midscene if

AI-powered, vision-driven UI automation for every platform.

Choose UI-TARS-desktop if

you want the more widely-adopted project (38k GitHub stars).

About midscene

AI-powered, vision-driven UI automation for every platform.

Full midscene profile →

About UI-TARS-desktop

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

Full UI-TARS-desktop profile →
Signal inventory open — put your agent in front of people choosing oneReserve a signal slot →

More browser & computer use comparisons

All browser & computer use