Open Source  ·  Zero Backend  ·  v1.07

The AI workspace that
lives in one file

Multi-agent orchestration, document RAG, live web research, and human-in-the-loop review — with zero servers, zero installs, and zero data leaving your machine.

↓ Download on GitHub See how it works
Free & open source  ·  Chrome · Edge · Firefox · Safari
✓ No server required ✓ No account needed ✓ 13 AI providers ✓ Offline WebGPU 100% client-side AES-256 encrypted keys Zero telemetry
Nemilia (neh-MEE-lee-ah) — Nahuatl: to think, to remember, to imagine

Everything you need.
Nothing you don't.

Open the HTML file. That's the entire setup. No npm, no Docker, no Python environment.

🤖
Multi-Agent Orchestration
Define specialist agents — researcher, analyst, writer, reviewer, synthesizer — and wire them into automated DAG pipelines. Each output is scored 1–10 and retried automatically if it falls short.
📄
Document RAG
Upload PDFs, DOCX, TXT, or CSV files. Nemilia chunks and embeds them with 384-dimensional vectors using hybrid semantic + BM25 search. Agents answer from your actual documents.
🌐
Live Web Research
NEXUS searches the internet in real-time via Brave, Serper, Tavily, or SearXNG. Findings are cited inline and passed to all downstream agents as grounded context.
👁
Human-in-the-Loop
Pause any pipeline at any agent. Approve the output, edit it inline, or send feedback to re-run that step. Loop as many times as needed before the pipeline continues.
🧠
Persistent Memory
Agents write MEMORY[key] = value in their outputs. These facts are stored and injected into all future agent prompts — building institutional knowledge across sessions.
📡
Offline AI via WebGPU
Run Llama, Phi, Qwen, Gemma, or Mistral entirely in your browser. No API key, no internet after model download. Models cache locally for instant future loads.
🔁
Drag-and-Drop Workflows
Build pipelines visually. Reorder agents by dragging, toggle HITL checkpoints per agent, and preview the execution flow as a live SVG DAG diagram before you run.
🎨
Visual Generation
VISION generates Chart.js charts, SVG diagrams, and HTML infographics — no image API required. Optional AI image generation via DALL·E, FLUX, or Stable Diffusion.
📁
Workspace Sync
Sync to a real folder on disk. Agents, prompts, and workflows become plain JSON and Markdown — editable in VS Code, version-controllable in Git, shareable with anyone.

From prompt to polished output in one click

The orchestrator handles everything — decomposition, parallel execution, quality scoring, and final synthesis.

1
Decompose
The orchestrator receives your prompt and assigns each agent a specific sub-task tailored to its role.
2
Execute in stages
Search agents run first, analysis agents run in parallel, synthesis agents run last — faster and rate-limit safe.
3
Validate & retry
Each output is scored 1–10. Scores below 6 trigger automatic retries with targeted improvement prompts.
4
Synthesize
The orchestrator merges all outputs into a final polished result. Export as Markdown or download as PDF.

Seven specialists. One team.

Each agent has a distinct role, personality, and system prompt. Create unlimited custom agents for any domain.

🔍
SCOUT
Deep research & information gathering
🔬
LENS
Analysis & critical insight extraction
✍️
QUILL
Long-form writing & content creation
⚒️
FORGE
Quality review & fact-checking
🧵
WEAVE
Final synthesis & executive summary
🌐
NEXUS
Live web search with cited sources
🎨
VISION
Charts, diagrams & visual generation

13 AI providers.
Switch with one click.

Your workflows never lock you in. Swap from GPT-4 to Llama to DeepSeek without rewriting a thing.

OpenAIAnthropicGroq Google GeminiDeepSeekMistral xAI GrokKimiOpenRouter Replicate Ollama (local) LM Studio (local) WebGPU (in-browser)

Your data never leaves
your machine.

Built for people who need powerful AI without the compliance liability of cloud infrastructure. Everything runs in your browser.

🔐
AES-256-GCM key encryption — API keys encrypted with PBKDF2 (200,000 iterations). Never written as plaintext.
🚫
Zero telemetry — No analytics, no tracking pixels, no beacons, no phone-home code. Verify it: it's one file.
💻
100% client-side processing — Document parsing, vector embeddings, memory, orchestration, charts. All local.
📴
Fully offline capable — With WebGPU and cached documents, Nemilia runs with no internet after initial setup.

Open the file.
Start building.

No waitlist. No onboarding. No account. Download the HTML file and open it in your browser — that's it.

Questions? Reach out at luis@nemilia.com

✉️

Get in Touch

For press inquiries, enterprise discussions, partnership opportunities, or just to say hello — Luis reads every email personally.

✉  luis@nemilia.com
Response time: typically within 24 hours