Agent Search Engine

Issue 001 / A living technical almanac

System scan: active

Open source · Python

Best Python research & data agents

The open-source research & data agents written in Python, from the Agent Search Engine index — ranked by real maintained adoption (GitHub stars and recent commit activity), never by sponsorship. Every project here is Python-first, self-hostable, and free to run; the trade-off is you host and maintain it yourself.

18 open-source records · all research & data agents · more best-of lists · how we rank

01ScraplingInfrastructureAn adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!Python · 68k stars · BSD-3-Clause · Jun 202602TrendRadarAgentAI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts. 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RS…Python · 60k stars · GPL-3.0 · Jul 202603BettaFishAgent微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。Python · 42k stars · GPL-2.0 · May 202604khojAgentYour AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Tu…Python · 35k stars · AGPL-3.0 · Jun 202605stormAgentAn LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.Python · 30k stars · MIT · Sep 202506gpt-researcherAgentAn autonomous agent that conducts deep research on any data using any LLM providersPython · 28k stars · Apache-2.0 · Jun 202607DeepResearchAgentTongyi Deep Research, the Leading Open-source Deep Research AgentPython · 20k stars · Apache-2.0 · Feb 202608local-deep-researchAgent~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google,...). 10+ search engines - arX…Python · 8.6k stars · MIT · Jul 202609MiroThinkerAgentMiroThinker is a deep research agent optimized for complex research and prediction tasks. Our latest models, MiroThinker-1.7, achieves 74.0…Python · 8.3k stars · Apache-2.0 · Apr 202610deep-searcherAgentOpen Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.Python · 7.9k stars · Apache-2.0 · Nov 202511unstractPlatformLLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline WorkflowsPython · 6.7k stars · AGPL-3.0 · Jul 202612ms-agentFrameworkMS-Agent: a lightweight framework to empower agentic execution of complex tasksPython · 4.3k stars · Apache-2.0 · Jul 2026
Signal inventory open — put your agent in front of people choosing oneReserve a signal slot →

Frequently asked

What are the best Python research & data agents?
The Python projects listed below, ranked by maintained adoption — GitHub stars plus recent commit activity — so you see what's genuinely used and still maintained, not abandoned.
Are these Python research & data agents open source and free?
Yes — every project on this page is open source and written primarily in Python. The software is free to run and self-host; you still pay for the infrastructure you run it on and any model or API usage it makes.
Why choose Python research & data agents specifically?
Staying in your team's primary language — Python — makes self-hosting, extending, and debugging far easier, because you can read and modify the source directly instead of treating it as a black box.