Documentation Index
Fetch the complete documentation index at: https://docs.qredence.ai/llms.txt
Use this file to discover all available pages before exploring further.
fleet-rlm combines a ReAct chat orchestrator with recursive long-context execution over a shared Daytona-backed interpreter runtime.
ReAct chat orchestrator
FleetAgent (wrapped by AgentRuntime) is the interactive orchestrator. It:
- Receives user requests from the CLI, HTTP API, or WebSocket transport.
- Decides tool actions using
dspy.ReAct. - Streams intermediate and final events back to the client.
- Maintains conversation history and document context.
Recursive long-context execution
For tasks that exceed a single ReAct context window, the agent delegates to a boundeddspy.RLM running inside a child Daytona sandbox. The recursive engine implements Algorithm 1 from arXiv 2512.24601v2:
- Inputs are stored as REPL variables inside the child sandbox.
- Sub-queries are dispatched recursively, bounded by
max_iterationsandmax_llm_calls. - Sandboxes are isolated per delegation.
- A single shared semantic-call budget covers the entire recursive tree.
Interpreter runtime backends
Interpreter backends provide isolated remote execution. The current contract is Daytona-only:- Sandbox isolation from the host environment.
- Persistent storage via durable mounted volumes (
memory/,artifacts/,buffers/,meta/). - Controlled execution profiles for root and delegate behavior.
dspy.RLM runtime serves the CLI, HTTP API, and Web UI.
Runtime surfaces
| Surface | Command |
|---|---|
| Web UI + API | uv run fleet web |
| Terminal chat | uv run fleet or uv run fleet-rlm chat |
| API server only | uv run fleet-rlm serve-api |
Observability and state
The system emits two WebSocket streams:/api/v1/ws/execution— chat stream events/api/v1/ws/execution/events— execution graph events
state payload restores dspy.History turns, agent core memory, loaded documents, and Daytona interpreter state.
Auth and environment guardrails
Runtime behavior is environment-sensitive via configuration:| Variable | Purpose |
|---|---|
APP_ENV | local, staging, or production |
AUTH_MODE | dev or entra |
AUTH_REQUIRED | Enforce auth on API routes |
DATABASE_REQUIRED | Enforce Neon/Postgres connectivity |
AUTH_MODE=entra, HTTP and WebSocket access use real Entra bearer-token validation plus Neon-backed tenant admission. Runtime settings writes are intentionally limited to APP_ENV=local.
Goal-first, not repo-first
Repositories are one possible source of context, alongside local files, staged documents, pasted content, and URLs. Requests may includerepo_url, repo_ref, context_paths, and batch_concurrency as per-turn execution hints.
Next
Architecture
Thin transport, runtime core, and Daytona substrate.
Recursive RLM
Delegation, isolation, and the shared call budget.