Skip to main content
AIDiveForge AIDiveForge
Visit local-deep-research

Get This Tool

License: MIT Any use incl. commercial
Local-run terms: MIT License permits free use, modification, and distribution for any purpose (including commercial) provided attribution is included. Full source code is available on GitHub and via PyPI. Users can run the tool entirely offline on local hardware without any vendor involvement.

Share This Tool

Compare This Tool
📋 Embed this tool on your site

Copy this code to embed a compact tool card:

local-deep-research

FreeOpen SourceAPISelf-HostedAgentic

Pricing

Model
Free

Summary

Most cloud research tools process your documents on someone else's server — which is a problem the moment a document contains anything confidential. Local Deep Research runs the entire pipeline on your own hardware, with encrypted local processing and no data leaving the machine.

The tool autonomously plans and executes multi-step research tasks: it queries sources, follows citations, synthesizes findings, and returns results with full attribution — all without a cloud handoff. The vendor reports ~95% on SimpleQA benchmarks using models like Qwen3-27B on a single RTX 3090, which gives you a concrete hardware target. It pulls from 10+ search backends including arXiv, PubMed, and private document collections. Where it breaks: running capable local models demands real GPU headroom, and teams without that hardware will either throttle to weaker models or route queries to cloud LLMs — at which point the privacy guarantee depends entirely on which cloud endpoint they configure. The 109 open issues and 210 open pull requests on GitHub signal an active but fast-moving codebase; production stability requires version pinning.

Bottom line: Pick this if you're a researcher or enterprise team that needs cited, multi-source deep research with provable data isolation — but plan for meaningful GPU investment before the benchmark accuracy claims become relevant to your workflow.

Community Performance Report Card

No community ratings yet. Be the first to rate this tool!

Best For: Researchers requiring private, high-accuracy deep research, Organizations handling sensitive or confidential information, Teams needing full control over data and inference infrastructure, Technical users who want to avoid cloud vendor lock-in, Academic and enterprise research workflows

Community Benchmarks Community

No community benchmarks yet. Be the first to share a real-world data point.

  • Encrypted, fully local processing means documents never leave your infrastructure, so regulated or confidential data can be fed directly into research workflows without legal review of a vendor's data handling terms.
  • Provider-agnostic model routing — llama.cpp, Ollama, OpenAI, Google, and others through a single config — so migrating from cloud to local inference when privacy requirements tighten is a configuration change, not a rewrite.
  • 10+ search backends including arXiv and PubMed alongside private document collections, so a single research query can span published literature and internal proprietary data in one agent run rather than requiring two separate tools.
  • Full source citations on every synthesized output, which means research results arrive with attribution intact — no manual provenance chase before you can use the findings in a paper or internal report.
  • MIT license with self-hosted deployment means no vendor lock-in and no per-query costs as research volume scales, so teams running high-throughput literature reviews are not watching an API bill grow with every job.
  • Benchmark-level accuracy (~95% on SimpleQA) is tied to running Qwen3-27B on a GPU like the RTX 3090; teams without comparable hardware that fall back to smaller models or CPU inference will see meaningfully lower result quality, and the gap is not documented per-model in the scraped source.
  • With 109 open issues and 210 open pull requests, the codebase changes fast — teams that deploy this into production pipelines without pinning to a specific release version will encounter breaking changes between upgrades, and there is no paid support tier to escalate when something breaks.
  • The project has no commercial backing, only donations and grants; teams that need SLA-backed uptime, security patches on a defined schedule, or vendor-supported integrations will eventually migrate to a commercial research agent — the community-only support model is the condition that triggers that switch.

Community Reviews

No reviews yet. Be the first to share your experience.

About

Platforms
Linux, macOS, Windows (via Docker, WSL2, or direct installation)
API Available
Yes
Self-Hosted
Yes
Last Updated
2026-06-09T06:14:27.068Z

Best For

Who it's for

  • Researchers requiring private, high-accuracy deep research
  • Organizations handling sensitive or confidential information
  • Teams needing full control over data and inference infrastructure
  • Technical users who want to avoid cloud vendor lock-in
  • Academic and enterprise research workflows

What it does well

  • Conducting literature reviews across arXiv and academic databases
  • Analyzing private documents with encrypted local processing
  • Performing research queries with full source citations and attribution
  • Building custom knowledge bases with proprietary or sensitive data
  • Running agentic research workflows entirely offline

Integrations

Ollamallama.cppGoogle Generative AIAnthropic ClaudeOpenAI GPTSerperSearXNGarXivPubMedprivate document sources

Discussion Community

No discussion yet. Sign in to start the conversation.

Compare local-deep-research

Spotted incorrect or missing data? Join our community of contributors.

Sign Up to Contribute

Community Notes & Tips Community

Be the first to contribute. General notes, observations, gotchas, and tips from people who use this tool day-to-day.

Frequently Asked Questions

Is local-deep-research free?
Yes — local-deep-research is fully free to use. There is no paid tier.
Is local-deep-research open source?
Yes. local-deep-research is open source.
Does local-deep-research have an API?
Yes. local-deep-research exposes a developer API. See the official documentation at https://github.com/learningcircuit/local-deep-research for details.
Can I self-host local-deep-research?
Yes. local-deep-research supports self-hosting on your own infrastructure.
When was local-deep-research released?
local-deep-research was first released in 2024.
What platforms does local-deep-research support?
local-deep-research is available on: Linux, macOS, Windows (via Docker, WSL2, or direct installation).

Hours Saved & ROI Stories Community

Be the first to contribute. Concrete time/cost savings, with context. e.g. "Cut my code review backlog from 4h to 45m per week."

local-deep-research

Cloud-based research assistants process your queries and documents on vendor infrastructure — acceptable for public information, a liability for anything proprietary. Local Deep Research is an MIT-licensed, self-hosted research agent that plans its own search strategy, queries multiple sources in a loop, and returns synthesized answers with full source citations. The core workflow: you submit a research question, the agent autonomously breaks it into sub-queries, fans out across configured search engines and document collections, retrieves and ranks results, then assembles a cited report — entirely on infrastructure you control.

The differentiating feature is the combination of local-first encrypted processing with broad model compatibility. The tool runs against llama.cpp, Ollama, and major cloud LLMs through a unified interface, so teams can start on OpenAI during prototyping and migrate to a local Qwen or LLaMA deployment without rewriting their workflows. The vendor states ~95% accuracy on SimpleQA benchmarks with Qwen3-27B on a single RTX 3090, which anchors what ‘local’ actually requires in hardware terms. The 10+ search backends span academic databases (arXiv, PubMed), general web search, and private document stores — covering both public literature review and proprietary knowledge base queries in the same agent loop.

This fits tightly into two scenarios: academic researchers conducting literature reviews who need citation integrity and reproducibility, and enterprise teams handling documents that cannot leave the building. It does not fit teams without GPU resources expecting benchmark-level accuracy from CPU inference — the accuracy numbers are tied to specific hardware. The open-issues count on GitHub (109 issues, 210 pull requests as of the repository snapshot) indicates the project moves fast; teams depending on this in production pipelines need explicit version pinning and regression testing on upgrades.

Deployment is self-hosted via Docker (Unraid templates are included in the repository), and an API is available for integrating the research agent into larger pipelines. The project runs on donations and grants with no commercial offering, which means there is no paid support tier — community forums and GitHub Issues are the escalation path.