Developer Preview Ferrosa Memory 0.15 — structured, linked, auditable memory for agentic systems.
Ferrosa Memory · 0.15 · Developer Preview

Memory your agents can query, link, and trust.

Ferrosa Memory is an MCP-native memory server for LLM agents. Instead of dumping text into a vector store, it keeps a typed entity graph, bi-temporal facts, raw context segments, and Datalog-derived knowledge — retrieved with hybrid search, consolidated automatically, and forgettable with a full audit trail.

See how it works Compare alternatives
Why it exists

Most "agent memory" is a vector store with a summary on top.

That loses the things agents actually need: when a fact changed, how facts relate, why a memory exists, and how to safely remove it. Ferrosa Memory treats memory as structured, queryable data — not opaque text.

Bi-temporal facts

Record what changed over time instead of overwriting history. The current fact stays easy to retrieve, while superseded facts remain inspectable for audit, debugging, and time-travel queries.

Typed entity graph

Entities, folds, facts, skills, and tags are connected with typed edges (depends_on, contains, calls, references…) so an agent can follow relationships, not just match text.

Hybrid retrieval

Recall fuses lexical, vector, phonetic, graph, workspace, and recency signals with Reciprocal Rank Fusion, then expands around raw context or document chunks to recover neighboring evidence.

Explainable derivation

Datalog-style rules derive and explain higher-level facts from graph state, so memory behavior can be inspected and audited instead of trusted blindly.

Auditable forgetting

The 0.15 forget flow shows the blast radius before anything changes, then retracts reversibly (or hard-deletes) with an append-only journal. Memory you can remove is memory you can trust.

Durable, tiered storage

Built on Ferrosa DB: CQL storage, HNSW vector indexes, a property graph, and S3-backed durability with hot/warm/cold tiering. Memory survives restarts and scales past one process.

A thin memory layer over database, graph, and inference.

Agents talk to Ferrosa Memory through a small set of MCP tools — ingest, retrieve, link, explain, and forget. The memory layer owns memory semantics: entities, folds, temporal facts, retrieval outcomes, intentions, skills, Datalog rules, and explanations.

Underneath, Ferrosa Database provides CQL storage, vector indexes, graph traversal, and durable S3-backed tiering. A background "dream cycle" consolidates new memories — discovering connections, scoring importance, and decaying stale recall — while you sleep.

Read the full architecture →

Agent / IDE / runtime
        │ MCP tools
        ▼
Ferrosa Memory service
  ├─ entities + typed graph
  ├─ bi-temporal facts
  ├─ context segments + folds
  ├─ hybrid retrieval (RRF)
  ├─ dream-cycle consolidation
  ├─ Datalog-derived facts
  └─ auditable forget journal
        │
        ▼
Ferrosa Database
  CQL · HNSW vectors · graph · S3 tiering
Tool catalog

Two tiers: a focused default set, and the full toolbox.

Agents see a focused Tier 1 set of everyday tools by default. The full Tier 2 surface — batch operations, folds, bi-temporal facts, memo & plan tracking, skills, consolidation, and the Datalog governance plane — unlocks with include_all when an operator needs it.

🧰

Tier 1 — Everyday memory (21 tools)

Ingest and recall: smart_ingest, hybrid_search, entity lookup, typed edges, chunk & turn context, intention checks, the full session-task surface, feedback, stats, configure, and forget. Everything an agent reaches for in a normal session.

⚙️

Tier 2 — Advanced & operator (58 tools)

Batch ingest/edit, trajectory folds, bi-temporal facts, memo & plans, the intention lifecycle, stored skills, dream-cycle consolidation & graph inference, the Datalog expert system (rules, claims, approvals, explanations), and config/introspection.

Browse the full tool catalog by level →

Recall you can trust, and memory you can remove.

forget

Auditable forgetting

Propose candidates, see the blast radius (edges, temporal chains, derived facts), then confirm a reversible retraction or a hard delete — all recorded in an append-only forget journal with crash recovery.

recall

Recall-relevance hardening

Source-aware relevance guards, lexical-overlap checks, authority/PageRank adjustments, and workspace filtering so hooks prefer silence over noisy context. Native full-text indexing speeds lexical recall.

session

Durable session tasks

First-class task state — focus stack, working set, and recovery hints — survives restarts and supports multi-agent handoff, so an interrupted agent can resume where it left off.

hooks

Workspace-isolated hooks

Session-start recall derives stable, workspace-specific sessions instead of leaking process-global state, and fallback recall stays semantic/procedural — never raw episodic transcript.

0.15 is a developer-preview release. Run it locally or self-hosted; keep the workbench and visualization surfaces operator-only unless you configure TLS, auth, and tenant isolation.
How it compares

Built for structure, not just similarity.

Vector stores and most memory layers answer "what looks similar?" Ferrosa Memory also answers "what changed, what's connected, why does this exist, and can I safely remove it?"

Capability Ferrosa Memory Vector store + RAG
Typed knowledge graphYesNo — similarity only
Bi-temporal fact historyYesOverwrite / append
Hybrid retrieval (lexical+vector+graph+recency)Yes — RRF over ~11 signalsVector (± keyword)
Explainable derived factsYes — DatalogNo
Auditable forgettingYes — blast radius + journalManual delete
Automatic consolidationYes — dream cycleNo
MCP-native toolsYesFramework-dependent

See the full comparison vs. Mem0, Zep, and Letta →

Research foundation

Engineering informed by the memory literature.

Ferrosa Memory is product engineering, not a paper implementation — but the design draws on recent and classic work on agent memory, virtual context, graph retrieval, prompt compression, Datalog-style inference, and memory evaluation.

Selected references from the local research corpus. Links point to canonical arXiv records.
PaperarXivTheme
LoCoMo: Evaluating Very Long-Term Conversational Memory of LLM Agents2402.17753long-term conversational memory
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory2410.10813memory evaluation
Memory in Large Language Models: Mechanisms, Evaluation and Evolution2509.18868memory taxonomy
Episodic Memory is the Missing Piece for Long-Term LLM Agents2502.06975episodic memory
Microstructures and Accuracy of Graph Recall by Large Language Models2402.11821graph recall
From Local to Global: A Graph RAG Approach to Query-Focused Summarization2404.16130GraphRAG
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression2310.06839long-context compression
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory2402.04617context memory

Give your agents memory worth keeping.

Start local, inspect memory in the workbench, and watch raw context become linked, durable, queryable memory. Self-hosted today — from a single workgroup machine to a replicated cluster on the distributed Ferrosa store. A managed service is on the roadmap.

How it works Set up locally