Helios

Open-source, local-first context intelligence engine for AI coding agents. Indexes your codebases, project dependencies, and documentation — makes everything searchable via hybrid keyword + semantic search through MCP. All data stays on your machine.

Think of it as a self-hosted, free alternative to Nia — with the added ability to index your actual installed dependency source code for version-correct search.

What It Does

Your Project Code  ──→  helios_index   ──┐
Installed Packages ──→  helios_deps    ──┤──→  Chunk  ──→  Embed  ──→  SQLite + FTS5
Documentation URLs ──→  helios_web     ──┘         (Ollama)              ↑
                                                                         │
Any AI Agent  ←──  MCP  ←──  helios_search / helios_context  ───────────┘

Helios runs as an MCP server. Any agent that supports MCP (Claude Code, Cursor, Windsurf, Continue.dev, Cline, etc.) connects to it and gets access to:

Your project code — indexed, chunked, and searchable
Every library you use — the actual installed source code from site-packages / node_modules, not stale training data
Documentation sites — crawled and indexed locally
Hybrid search — FTS5 keyword matching + vector semantic similarity via Ollama embeddings, fused with Reciprocal Rank Fusion
Live file watching — auto-reindexes when your code changes
Auto-docs discovery — finds documentation URLs for your Python dependencies from package metadata

Quick Start

# Clone and install
git clone https://github.com/rinaldofesta/helios.git
cd helios
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

# With numpy acceleration for vector search (recommended)
pip install -e ".[dev,numpy]"

# Start the MCP server (for testing)
helios serve

Connect to Claude Code

Add to ~/.claude.json:

{
  "mcpServers": {
    "helios": {
      "type": "stdio",
      "command": "/path/to/helios/.venv/bin/python",
      "args": ["-m", "helios.server"]
    }
  }
}

Or for any MCP-compatible agent, point it at python -m helios.server via stdio transport.

Index Your Project

Once connected, the agent can use these tools:

# Index a codebase (auto-watches for changes)
helios_index(path="/path/to/myproject")

# Index all project dependencies (finds site-packages/node_modules automatically)
helios_deps(path="/path/to/myproject")

# Index a documentation site
helios_web(url="https://docs.pydantic.dev")

# Search across everything
helios_search(query="how to validate nested models")

# Assemble multi-source context for a task
helios_context(task="Add OAuth2 login to the FastAPI app")

MCP Tools

Tool	Description
`helios_index`	Index a directory with scan + chunk + embed pipeline. Auto-watches for changes.
`helios_deps`	Auto-detect and index all project dependencies. Discovers documentation URLs from package metadata.
`helios_web`	Crawl and index a documentation site or web page.
`helios_search`	Hybrid keyword + semantic search across all indexed sources.
`helios_context`	Multi-source context assembly — searches project, deps, and docs, returns results grouped by source type.
`helios_status`	Show all indexed sources with stats (files, chunks, embeddings).
`helios_read`	Read the full contents of an indexed file or web page.
`helios_explore`	Browse the file structure of an indexed source.
`helios_remove`	Remove an indexed source and all its data.

MCP Resources

Agents can also read these MCP resources passively:

Resource	Description
`helios://status`	Current index status — sources, file counts, chunk counts, embeddings.
`helios://sources`	List of all indexed source names for use with the `source` filter.

How Search Works

Helios uses a three-layer search architecture:

FTS5 keyword search — SQLite full-text search with BM25 ranking. Fast, exact matches.
Vector semantic search — Ollama embeddings (nomic-embed-text, 768 dims) with cosine similarity. Finds conceptually similar content even with different wording. Accelerated with numpy when installed.
Reciprocal Rank Fusion — Merges keyword and semantic rankings into a single result set. Gets the best of both.

Search modes via helios_search:

"auto" (default) — hybrid if embeddings exist, keyword-only otherwise
"keyword" — FTS5 only
"semantic" — vector only

An in-memory embedding cache avoids reloading vectors from SQLite on repeated queries. The cache is automatically invalidated when embeddings change.

Dependency Intelligence

helios_deps is the killer feature. It:

Parses your dependency files (pyproject.toml, requirements.txt, package.json)
Finds the installed source code in your venv's site-packages or node_modules
Indexes each package with the full pipeline (scan → chunk → embed)
Names them dep:<package> so you can search within specific libraries
Discovers documentation URLs from package metadata and reports them

This means your AI agent has access to the actual installed version of every library — not stale training data. No more hallucinated APIs.

# Search within a specific dependency
helios_search(query="OAuth2PasswordBearer", source="dep:fastapi")

# Search across all dependencies
helios_search(query="connection pool configuration")

After running helios_deps, you'll see discovered documentation URLs:

Documentation URLs discovered (14):
  pydantic: https://docs.pydantic.dev
  typer: https://typer.tiangolo.com
  rich: https://rich.readthedocs.io/en/latest/
  ...
Use helios_web(url=...) to index any of these.

Live File Watching

After indexing, Helios watches directories for changes using watchdog. When files change:

Debounces changes (2-second window)
Re-scans only modified files (content-hash based)
Re-chunks only changed documents
Generates embeddings only for new chunks

The watcher runs as a background task inside the MCP server process.

Architecture

src/helios/
  indexing/        Content intelligence pipeline
    store.py       SQLite + FTS5 for documents, chunks, and embeddings
    scanner.py     Directory scanner with 60+ language mappings
    chunker.py     Language-aware document chunking
    embeddings.py  Ollama embedding generation + cosine similarity
    crawler.py     URL crawler with HTML text extraction
    dependencies.py Dependency detection, source path resolution, docs URL discovery
    watcher.py     File watcher with debounced auto-reindexing
  server/          MCP server
    app.py         FastMCP server with 9 tools + 2 resources
    __main__.py    Entry point (python -m helios.server)
  core/            Task orchestration engine (Orchestrator/Sub-agent/Refiner)
  providers/       LLM provider abstractions (Ollama, Groq, OpenAI-compatible, llama.cpp)
  memory/          Session persistence (SQLite + JSON), skills system
  cli/             Typer CLI (run, resume, serve, sessions, models, config, web)
  web/             FastAPI dashboard with WebSocket streaming
  config/          TOML + env var + CLI flag config loading
  tools/           Tool protocol and 7 built-in orchestration tools
  models/          HuggingFace Hub integration (search + GGUF download)

Tech Stack

Python 3.11+ with async-first design
SQLite + FTS5 for all persistence and full-text search
MCP SDK (mcp package) for agent integration via tools and resources
Ollama for local embeddings (nomic-embed-text) and LLM inference
Pydantic v2 for all data models
aiosqlite for async database access
watchdog for file system monitoring
httpx for URL crawling (transitive dep from mcp)
numpy (optional) for accelerated vector similarity search
Typer for CLI, Rich for console output
FastAPI for web dashboard (optional)

Installation Options

# Core (keyword search + embeddings via Ollama)
pip install -e .

# With numpy acceleration for vector search (recommended)
pip install -e ".[numpy]"

# With local GGUF model support
pip install -e ".[local]"

# With web dashboard
pip install -e ".[web]"

# Everything
pip install -e ".[all,dev]"

Configuration

Helios uses layered configuration (each overrides the previous):

Built-in defaults
Config file (helios.toml or ~/.config/helios/config.toml)
Environment variables (GROQ_API_KEY, etc.)
CLI flags

See helios.toml.example for all options.

Running Tests

pip install -e ".[dev]"
pytest

152 tests covering indexing, chunking, search, embeddings, dependencies, crawler, and file watcher.

Requirements

Python 3.11+
Ollama (for embeddings and local LLM inference) — optional but recommended
No cloud APIs required — everything runs locally

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Helios

What It Does

Quick Start

Connect to Claude Code

Index Your Project

MCP Tools

MCP Resources

How Search Works

Dependency Intelligence

Live File Watching

Architecture

Tech Stack

Installation Options

Configuration

Running Tests

Requirements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
helios		helios
legacy		legacy
src/helios		src/helios
tests		tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md
helios.toml.example		helios.toml.example
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Helios

What It Does

Quick Start

Connect to Claude Code

Index Your Project

MCP Tools

MCP Resources

How Search Works

Dependency Intelligence

Live File Watching

Architecture

Tech Stack

Installation Options

Configuration

Running Tests

Requirements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages