MemState - Transactional Memory for AI Agents

Agents hallucinate because their memory drifts. SQL says one thing, the Vector DB says another. MemState keeps them in sync, always.

Mental Model: MemState extends database transactions to your Vector DB.
One unit. One commit. One rollback.

Why MemState exists

The Problem

AI agents usually store memory in two places:

SQL: structured facts (preferences, task history)
Vector DB: semantic search (embeddings for RAG)

These two stores drift easily. Even small failures create inconsistency:

# Step 1: SQL write succeeds
db.update("user_city", "London")

# Step 2: Vector DB update fails
vectors.upsert("User lives in London")  # ❌ failed

# Final state:
# SQL: London
# Vectors: New York (stale embedding)

Result: ghost vectors, inconsistent state, unpredictable agent behavior.
Drift accumulates silently, agents continue retrieving outdated or mismatched memory.

Why this happens

Real-world agent pipelines create drift even with correct code, because:

Vector DB upserts are not atomic
Retried writes can produce duplicates or stale embeddings
Async ingestion leads to race conditions
LLM outputs often contain non-schema JSON
Embedding model/version changes create semantic mismatch
SQL writes succeed while vector DB fails, partial updates persist

These issues are invisible until retrieval fails, making debugging extremely difficult. MemState prevents this by enforcing atomic memory operations: if any part fails, the whole operation is rolled back.

The Solution

MemState makes memory operations atomic:

SQL write + Vector upsert
→ succeed together or rollback together

Key features

Atomic commits: SQL and Vector DB stay in sync
Rollback: undo N steps across SQL and vectors
Type safety: Pydantic validation prevents malformed JSON
Append-only Fact Log: full versioned history
Crash-safe atomicity: if a vector DB write fails, the entire memory operation (SQL + vector) is rolled back. No partial writes, no ghost embeddings, no inconsistent checkpoints.
Hybrid Search: Search by meaning (Vector), filter by facts (SQL).

Proof: Benchmark under failure

1000 memory updates with 10% random vector DB failures:

METRIC	MANUAL SYNC	MEMSTATE
SQL Records	1000	900
Vector Records	910	900
DATA DRIFT	90	0
INCONSISTENCY RATE	9.0%	0.0%

Why 900 instead of 1000? MemState refuses partial writes.
If vector sync fails, SQL is rolled back automatically.

Manual sync produces silent drift.
Drift compounds over time, stale embeddings keep being retrieved forever.

Full benchmark script: benchmarks/

Ideal for

Long-running agents
LangGraph workflows
RAG systems requiring strict DB <-> embedding consistency
Local-first / offline-first setups (SQLite/Redis/PostgreSQL + Chroma/Qdrant/FAISS)
Deterministic, debuggable agentic pipelines

Storage backends

All backends participate in the same atomic commit cycle:

SQLite (JSON1)
Redis
PostgreSQL
In-memory
Custom backends via simple interface

Vector sync hooks:

ChromaDB
Qdrant
Custom hooks via simple interface

When you don't need it

Your agent is fully stateless
You store everything in a single SQL table
You never update embeddings after creation

If your pipelines depend on RAG or long-term state, consistency is required - most teams realize this only when debugging unpredictable retrieval.