Smriti

Durable, queryable long-term memory for AI agents.

Smriti (Sanskrit: memory) is a backend service that gives AI agents and applications persistent memory. It stores working, episodic, and semantic memories; generates embeddings and importance scores; consolidates duplicates; summarizes user history; and serves low-latency context for retrieval-augmented generation (RAG).

Features

Three memory tiers — working (Redis, session-scoped), episodic (discrete events), and semantic (durable facts with vector embeddings)
Synchronous RAG retrieval — POST /memories/context returns ranked, cache-aware context without touching Kafka
Async enrichment pipeline — Kafka workers handle embedding, importance scoring, summarization, consolidation, and user profiling off the request path
Vector search — Postgres + pgvector for similarity search in a single datastore
Production-oriented design — thin deployable apps, rich domain libraries, enforced Nx module boundaries, idempotent consumers with retries and DLQs
Observability — OpenTelemetry traces, Prometheus metrics, and Grafana dashboards

Architecture

flowchart LR
  client[Client / Agent] --> api[API Service]

  api --> postgres[(Postgres + pgvector)]
  api --> redis[(Redis)]
  api --> kafka[Kafka]

  api --> retrievalCore[retrieval-core]
  retrievalCore --> embeddingLib[embedding]
  retrievalCore --> rankingLib[ranking]
  retrievalCore --> postgres
  retrievalCore --> redis

  kafka --> embeddingWorker[embedding-worker]
  kafka --> importanceWorker[importance-worker]
  kafka --> summarizerWorker[summarizer-worker]
  kafka --> consolidationWorker[consolidation-worker]
  kafka --> profileWorker[profile-worker]

  scheduler[scheduler] --> kafka

  embeddingWorker --> postgres
  importanceWorker --> postgres
  summarizerWorker --> postgres
  consolidationWorker --> postgres
  profileWorker --> postgres

Write path: client creates a memory → API persists it → memory-created event published → workers enrich asynchronously.

Read path: client queries context → API runs the retrieval pipeline synchronously (cache → embed → search → rank → build).

See docs/architecture/ai-memory-service-architecture.md for the full design.

Tech stack

Layer	Choice
Monorepo	Nx + pnpm
Language	TypeScript (strict)
HTTP	NestJS + Fastify
SQL	Kysely + Postgres + pgvector
Cache	Redis
Messaging	Kafka
Telemetry	OpenTelemetry, Prometheus, Grafana

Project structure

smriti/
├── apps/
│   ├── api/                    # HTTP API — create, list, delete, retrieve context
│   ├── embedding-worker/       # Generate and persist embeddings
│   ├── importance-worker/      # Score memory importance
│   ├── summarizer-worker/      # Rolling user history summaries
│   ├── consolidation-worker/   # Merge near-duplicate memories
│   ├── profile-worker/         # Structured user profiles
│   └── scheduler/              # Periodic jobs (decay, cleanup, summarize)
├── libs/
│   ├── memory-core/            # Domain entities and use cases
│   ├── retrieval-core/         # Retrieval orchestration pipeline
│   ├── ranking/                # Pure ranking/scoring functions
│   ├── embedding/              # EmbeddingProvider abstraction
│   ├── postgres/               # Kysely repositories and migrations
│   ├── redis/                  # Working memory and context cache
│   ├── kafka/                  # Producer, consumer runtime, retry/DLQ
│   ├── events/                 # Versioned event contracts
│   ├── auth/                   # Principal resolution
│   ├── observability/          # Logger, metrics, tracing
│   ├── config/                 # Validated environment configuration
│   ├── shared-types/           # Shared DTOs
│   └── testing/                # Fixtures and test harnesses
├── infra/docker/               # Docker Compose for local development
└── docs/
    ├── architecture/           # System design documents
    └── local-development-runbook.md

Prerequisites

Tool	Version
Node.js	>= 20
pnpm	11.7.0
Docker Desktop	Running

On Windows, Git Bash or WSL is recommended for loading .env.

Quick start

1. Install and configure

pnpm install
cp .env.example .env

The default .env targets local Docker services. Postgres listens on host port 55432 (not 5432) to avoid conflicts with a local Postgres install.

2. Start infrastructure

pnpm infra:up

Wait ~10 seconds for Kafka to become ready, then confirm containers are healthy:

docker compose -f infra/docker/docker-compose.yml ps

Service	Host port	Purpose
Postgres (pgvector)	55432	Primary datastore + vectors
Redis	6379	Working memory + context cache
Kafka	9092	Event bus for workers
Prometheus	9090	Metrics
Grafana	3001	Dashboards (`admin` / `admin`)
OTel Collector	4317, 4318	Traces and metrics export

3. Migrate and seed a test user

# Git Bash / WSL / macOS / Linux
set -a && source .env && set +a && pnpm db:migrate

# PowerShell
Get-Content .env | ForEach-Object {
  if ($_ -match '^\s*([^#][^=]+)=(.*)$') { Set-Item -Path "env:$($matches[1])" -Value $matches[2] }
}
pnpm db:migrate

Memories reference users.id. Seed a dev user once:

docker exec smriti-postgres-1 psql -U smriti -d smriti -c \
  "INSERT INTO users (id, name) VALUES ('22222222-2222-2222-2222-222222222222', 'Local Dev') ON CONFLICT DO NOTHING;"

4. Start the API and workers

In separate terminals (load .env in each):

set -a && source .env && set +a && pnpm dev:api

set -a && source .env && set +a && pnpm dev:workers

5. Verify

curl http://localhost:3000/health/live
curl http://localhost:3000/health/ready

Expected:

{"status":"ok"}
{"status":"ok","dependencies":{"postgres":true,"redis":true}}

For a full walkthrough, smoke tests, and troubleshooting, see docs/local-development-runbook.md.

API

All authenticated endpoints require the x-user-id header (UUID). API key checks via x-api-key are optional in development.

Method	Path	Description
`POST`	`/memories`	Create a memory (returns `202`)
`POST`	`/memories/context`	Retrieve ranked RAG context for a query
`GET`	`/users/:id/memories`	List memories for a user
`DELETE`	`/memories/:id`	Delete a memory
`GET`	`/health/live`	Liveness probe
`GET`	`/health/ready`	Readiness probe (Postgres, Redis)
`GET`	`/metrics`	Prometheus metrics

Create a memory

curl -X POST http://localhost:3000/memories \
  -H "Content-Type: application/json" \
  -H "x-user-id: 22222222-2222-2222-2222-222222222222" \
  -d '{
    "type": "semantic",
    "content": "I am a backend engineer learning Kafka"
  }'

Memory types: working, episodic, semantic.

Retrieve context

curl -X POST http://localhost:3000/memories/context \
  -H "Content-Type: application/json" \
  -H "x-user-id: 22222222-2222-2222-2222-222222222222" \
  -d '{
    "query": "What is the user learning?",
    "limit": 5
  }'

Allow a few seconds after creating a memory for the embedding worker to process it before querying context.

List memories

curl "http://localhost:3000/users/22222222-2222-2222-2222-222222222222/memories" \
  -H "x-user-id: 22222222-2222-2222-2222-222222222222"

Environment variables

Variable	Default	Description
`NODE_ENV`	`development`	Runtime environment
`HTTP_HOST`	`0.0.0.0`	API bind address
`HTTP_PORT`	`3000`	API port
`POSTGRES_URL`	—	Postgres connection string
`POSTGRES_POOL_SIZE`	`10`	Connection pool size
`REDIS_URL`	—	Redis connection string
`KAFKA_BROKERS`	—	Comma-separated broker list
`KAFKA_CLIENT_ID`	`smriti`	Kafka client ID
`KAFKA_GROUP_ID`	`smriti-workers`	Consumer group ID
`EMBEDDING_PROVIDER`	`mock`	`mock` or `openai`
`EMBEDDING_MODEL`	`text-embedding-3-small`	OpenAI embedding model
`EMBEDDING_DIMENSIONS`	`1536`	Vector dimensions
`OPENAI_API_KEY`	—	Required when `EMBEDDING_PROVIDER=openai`
`OTEL_EXPORTER_OTLP_ENDPOINT`	`http://localhost:4318`	OTLP exporter URL
`OTEL_SERVICE_NAME`	`smriti`	Service name for telemetry

Copy .env.example as a starting point.

Development

Command	Description
`pnpm dev:api`	Start API with hot reload
`pnpm dev:workers`	Start all workers and scheduler
`pnpm infra:up`	Start Docker infrastructure
`pnpm infra:down`	Stop Docker infrastructure
`pnpm db:migrate`	Apply pending SQL migrations
`pnpm build`	Build all apps
`pnpm typecheck`	Typecheck all projects
`pnpm lint`	Lint all projects
`pnpm test`	Run unit tests
`pnpm graph`	Open Nx dependency graph

Individual workers can be started with pnpm dev:embedding-worker, pnpm dev:importance-worker, and so on.

Reset local data

docker compose -f infra/docker/docker-compose.yml down -v
pnpm infra:up
set -a && source .env && set +a && pnpm db:migrate
# Re-seed the test user

Documentation

Document	Contents
Local development runbook	Step-by-step setup, smoke tests, troubleshooting
System architecture	Topology, design principles, component map
Database design	Schemas and storage model
Retrieval pipeline	Context retrieval flow
Event-driven design	Kafka topics, workers, idempotency
Observability	Metrics, traces, health checks
Development roadmap	Phased delivery plan

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
apps		apps
docs		docs
infra		infra
libs		libs
scripts		scripts
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
README.md		README.md
nx.json		nx.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json
vitest.config.mts		vitest.config.mts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smriti

Features

Architecture

Tech stack

Project structure

Prerequisites

Quick start

1. Install and configure

2. Start infrastructure

3. Migrate and seed a test user

4. Start the API and workers

5. Verify

API

Create a memory

Retrieve context

List memories

Environment variables

Development

Reset local data

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Smriti

Features

Architecture

Tech stack

Project structure

Prerequisites

Quick start

1. Install and configure

2. Start infrastructure

3. Migrate and seed a test user

4. Start the API and workers

5. Verify

API

Create a memory

Retrieve context

List memories

Environment variables

Development

Reset local data

Documentation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages