Vettore

Vettore is a small vector toolkit for Elixir that keeps your data in ETS and uses Rust only where it helps: distance kernels, normalization, HNSW search, and MUVERA-style encodings.

Earlier versions leaned toward a Rust-owned in-memory database. That was fast, but it made the library feel less like an Elixir tool and more like an external engine with Elixir bindings. Vettore now chooses ETS as the canonical store on purpose:

records are visible and easy to inspect from Elixir
supervision, snapshots, and ownership stay simple
metadata and application values live beside vectors naturally
native indexes can be rebuilt from canonical ETS state
the public API stays small, predictable, and BEAM-friendly

The important idea is simple:

Elixir owns the records.
ETS is the source of truth.
Rust accelerates the expensive parts.
Search results say clearly what is a score and what is a distance.

That choice is not the absolute fastest possible architecture. A fully Rust-owned vector database can beat ETS for large exact scans, but Vettore optimizes for a different kind of usefulness: simple integration with ordinary Elixir systems, with Rust kept as acceleration rather than ownership.

What You Get

ETS-backed collections
exact flat search
native HNSW approximate search
Matryoshka-style funnel search
binary quantized candidate search
hybrid candidate pipelines with exact or multi-vector reranking
ColBERT-style late interaction over multi-vector records
MUVERA-style fixed-dimensional encodings
named distance, similarity, normalization, and MMR helpers
a top-level Vettore.* API, plus compatibility wrappers for the older Vettore.new/0 database-style API

Installation

def deps do
  [
    {:vettore, "~> 0.3.1"}
  ]
end

Quick Start

Create a collection, insert a few records, and search:

{:ok, collection} =
  Vettore.new(
    name: :documents,
    dimensions: 3,
    index: :flat,
    metric: :cosine,
    normalize: :l2
  )

:ok =
  Vettore.put_many(collection, [
    %{id: "east", vector: [1.0, 0.0, 0.0], metadata: %{kind: :axis}},
    %{id: "north", vector: [0.0, 1.0, 0.0]},
    %{id: "west", vector: [-1.0, 0.0, 0.0]}
  ])

{:ok, results} =
  Vettore.search(collection, [1.0, 0.0, 0.0], limit: 2)

Results are %Vettore.Result{} structs:

%Vettore.Result{
  id: "east",
  value: "east",
  score: 1.0,
  distance: 0.0,
  metric: :cosine,
  metadata: %{kind: :axis}
}

Public API

New code can stay under the top-level Vettore module:

Vettore.new(opts)
Vettore.put(collection, embedding)
Vettore.put_many(collection, embeddings)
Vettore.get(collection, id)
Vettore.delete(collection, id)
Vettore.all(collection)
Vettore.search(collection, query, opts)
Vettore.funnel_search(collection, query, opts)
Vettore.quantized_search(collection, query, opts)
Vettore.multi_vector_search(collection, query_vectors, opts)
Vettore.hybrid_search(collection, query, opts)
Vettore.snapshot(collection, path)
Vettore.load_snapshot(path, opts)

Vettore.new/1 creates a collection. Vettore.new/0 still creates the older compatibility database.

Choosing A Search Path

Start with the simplest thing that matches your job.

Use this	When
`search/3` with `index: :flat`	Small data, tests, correctness baselines, exact results
`search/3` with `index: :hnsw`	Fast approximate search over larger collections
`funnel_search/3`	Matryoshka embeddings where early dimensions are meaningful
`quantized_search/3`	Cheap sign-bit candidate search before exact reranking
`multi_vector_search/3`	ColBERT-style late interaction over token/page vectors
`hybrid_search/3`	Combine candidate generators, then rerank once

The standalone helpers are nice while exploring. For production-style retrieval, hybrid_search/3 is usually the most ergonomic surface.

Exact Search

Flat search keeps ids and vectors in a Rust resource and scores the whole exact scan in one native call. ETS remains the canonical store for values, metadata, snapshots, and usability.

{:ok, collection} =
  Vettore.new(
    name: :exact_vectors,
    dimensions: 384,
    index: :flat,
    metric: :cosine,
    normalize: :l2
  )

{:ok, results} =
  Vettore.search(collection, query_vector, limit: 10)

This path is intentionally boring. It is great for small collections, local caches, classifier centroids, deterministic tests, and recall baselines.

HNSW Search

HNSW keeps a native graph beside the ETS store. ETS remains canonical; the graph is an acceleration structure.

{:ok, collection} =
  Vettore.new(
    name: :ann_vectors,
    dimensions: 768,
    index: :hnsw,
    index_options: [
      m: 16,
      m0: 32,
      ef_construction: 100,
      ef_search: 64,
      max_level: 12
    ],
    metric: :cosine,
    normalize: :l2
  )

:ok = Vettore.put(collection, %{id: "doc-1", vector: embedding})

{:ok, results} =
  Vettore.search(collection, query_vector, limit: 10)

Supported HNSW metrics:

:l2
:cosine
:inner_product

Adaptive Candidate Search

These helpers first find a candidate set, then rerank with full stored vectors. They are useful when you want to make the first pass cheaper without changing the canonical store.

Matryoshka Funnel

Funnel search scores progressively larger vector prefixes. It works best with models trained for Matryoshka or nested embeddings.

{:ok, results} =
  Vettore.funnel_search(collection, query_vector,
    stages: [128, 256, 384],
    candidates: 200,
    limit: 10
  )

Binary Quantized Candidates

Quantized search uses stored sign bits for a cheap Hamming-distance first pass, then reranks with the collection metric.

{:ok, results} =
  Vettore.quantized_search(collection, query_vector,
    candidates: 200,
    limit: 10
  )

Vettore generates binary_vector at insert time:

{:ok, embedding} = Vettore.get(collection, "doc-1")
embedding.binary_vector
# [1, 0, 1, ...]

Hybrid Search

hybrid_search/3 lets you combine candidate generators, union their ids, fetch the canonical records from ETS, and rerank once.

{:ok, results} =
  Vettore.hybrid_search(collection, query_vector,
    generators: [
      funnel: [stages: [128, 384], candidates: 200],
      quantized: [candidates: 200]
    ],
    rerank: :exact,
    limit: 10
  )

For HNSW collections, add :hnsw as a generator:

{:ok, results} =
  Vettore.hybrid_search(collection, query_vector,
    generators: [
      hnsw: [candidates: 100],
      quantized: [candidates: 200]
    ],
    rerank: :exact,
    limit: 10
  )

The same pipeline can rerank with late interaction:

{:ok, results} =
  Vettore.hybrid_search(collection, query_vector,
    generators: [quantized: [candidates: 200]],
    rerank: {:multi_vector, query_vectors},
    limit: 10
  )

That is the general pattern:

Generate cheap candidates.
Merge them by id.
Rerank with the expensive scorer you actually care about.

Multi-Vector Search

Multi-vector search is for ColBERT-style retrieval: each record can hold many vectors, usually token vectors or page-patch vectors. A query also has many vectors. For each query vector, Vettore finds the best matching document vector and sums those best scores.

:ok =
  Vettore.put(collection, %Vettore.Embedding{
    id: "page-1",
    vectors: [
      [1.0, 0.0],
      [0.0, 1.0]
    ],
    metadata: %{source: "manual"}
  })

{:ok, results} =
  Vettore.multi_vector_search(
    collection,
    [[1.0, 0.0], [0.0, 1.0]],
    metric: :inner_product,
    limit: 10
  )

The lower-level scoring helper is available too:

Vettore.MultiVector.colbert_score(
  [[1.0, 0.0], [0.0, 1.0]],
  [[1.0, 0.0], [1.0, 1.0]],
  metric: :inner_product
)
# {:ok, 2.0}

Vettore.MultiVector.chamfer/3 is the same MaxSim-style operation under a more general name.

MUVERA-Style Encodings

MUVERA reduces multi-vector retrieval to fixed-dimensional vectors. The intended flow is:

Encode query multi-vectors into a fixed-dimensional query vector.
Encode document multi-vectors into fixed-dimensional document vectors.
Search those vectors with inner product.
Rerank candidates with exact MaxSim/Chamfer.

vectors = [
  [1.0, 0.0],
  [0.0, 1.0]
]

config = [
  num_repetitions: 1,
  num_simhash_projections: 4,
  seed: 42,
  projection_dimension: 2
]

{:ok, query_fde} = Vettore.Encoding.Muvera.encode_query(vectors, config)
{:ok, doc_fde} = Vettore.Encoding.Muvera.encode_document(vectors, config)

Config options:

:dimension - inferred from vectors by default
:num_repetitions - defaults to 1
:num_simhash_projections - defaults to 0
:seed - defaults to 1
:projection_dimension - defaults to input dimension
:final_projection_dimension - optional count-sketch compression size

Records And Storage

Records are %Vettore.Embedding{} structs or maps with equivalent keys.

%Vettore.Embedding{
  id: "doc-1",
  value: "optional external value",
  vector: [0.1, 0.2, 0.3],
  vectors: [[0.1, 0.2, 0.3], [0.0, 0.5, 0.5]],
  binary_vector: [1, 1, 1],
  metadata: %{source: "local"}
}

Useful details:

id is the preferred unique identifier.
If id is missing, a non-empty string value can be used as the id.
Duplicate ids are rejected.
Duplicate vectors are allowed.
Vectors are normalized at insertion according to the collection config.
If vectors is present but vector is omitted, Vettore stores an averaged representative vector for ordinary search/indexing.
binary_vector is generated automatically for quantized candidate search.

ETS collections can be snapshotted:

:ok = Vettore.snapshot(collection, "priv/snapshots/docs.ets")

{:ok, loaded} =
  Vettore.load_snapshot("priv/snapshots/docs.ets")

Snapshots store the ETS table: records, metadata, normalized vectors, binary vectors, multi-vectors, and collection config. Native indexes are rebuilt from ETS when loaded.

You can load the same data with a different index:

{:ok, loaded} =
  Vettore.load_snapshot("priv/snapshots/docs.ets", index: :hnsw)

ETS compression is available when you want to trade CPU for memory:

{:ok, collection} =
  Vettore.new(
    name: :compressed_documents,
    dimensions: 384,
    metric: :cosine,
    normalize: :l2,
    compressed: true
  )

Metrics And Scoring

Collection metrics:

:l2
:l2_squared
:cosine
:inner_product
:negative_inner_product
:manhattan
:chebyshev
:hamming
:jaccard

Aliases accepted by Vettore.new/1:

:euclidean -> :l2
:dot -> :inner_product
:dot_product -> :inner_product

with Vettore.Distance you can use directly all distance functions:

Vettore.Distance.l2([0.0, 0.0], [3.0, 4.0])
# {:ok, 5.0}

Vettore.Distance.cosine([1.0, 0.0], [0.0, 1.0])
# {:ok, 0.0}

Vettore.Distance.inner_product([1.0, 2.0], [3.0, 4.0])
# {:ok, 11.0}

Normalization

Supported normalization modes:

:none
:l2
:zscore
:minmax

Vettore.Distance.normalize([3.0, 4.0], :l2)
# {:ok, [0.6, 0.8]}

Collection defaults:

metric: :cosine defaults to normalize: :l2
all other metrics default to normalize: :none

Inserted vectors and query vectors are prepared with the same collection normalization mode.

Other Helpers

MMR reranking:

initial = [{"a", 0.9}, {"b", 0.8}, {"c", 0.1}]
embeddings = [{"a", [1.0, 0.0]}, {"b", [1.0, 0.0]}, {"c", [0.0, 1.0]}]

Vettore.Distance.mmr_rerank(initial, embeddings, :cosine, 0.5, 2)
# {:ok, [{"a", 0.9}, {"c", 0.1}]}

Sign compression:

Vettore.Distance.compress_f32_vector([1.0, -2.0, 0.0])
# [5]

left = Vettore.Distance.compress_f32_vector([1.0, -2.0, 0.0])
right = Vettore.Distance.compress_f32_vector([-1.0, -2.0, 0.0])
Vettore.Distance.packed_hamming(left, right, 3)
# {:ok, 1.0}

Compatibility API

The old top-level API still exists as a small compatibility layer backed by ETS collections:

db = Vettore.new()

{:ok, "legacy"} =
  Vettore.create_collection(db, "legacy", 2, :cosine)

{:ok, "a"} =
  Vettore.insert(db, "legacy", %Vettore.Embedding{
    value: "a",
    vector: [1.0, 0.0]
  })

{:ok, results} =
  Vettore.similarity_search(db, "legacy", [1.0, 0.0], limit: 1)

New code should prefer the collection-style top-level API: Vettore.new/1, Vettore.put/2, and Vettore.search/3.

Development

The tests include a real ex_fastembed integration with BAAI/bge-small-en-v1.5 over a small phrase corpus.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
.github/workflows		.github/workflows
bench		bench
lib		lib
native/vettore		native/vettore
test		test
.formatter.exs		.formatter.exs
.gitignore		.gitignore
.tool-versions		.tool-versions
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
checksum-Elixir.Vettore.Nifs.exs		checksum-Elixir.Vettore.Nifs.exs
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vettore

What You Get

Installation

Quick Start

Public API

Choosing A Search Path

Exact Search

HNSW Search

Adaptive Candidate Search

Matryoshka Funnel

Binary Quantized Candidates

Hybrid Search

Multi-Vector Search

MUVERA-Style Encodings

Records And Storage

Metrics And Scoring

Normalization

Other Helpers

Compatibility API

Development

About

Uh oh!

Releases 18

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vettore

What You Get

Installation

Quick Start

Public API

Choosing A Search Path

Exact Search

HNSW Search

Adaptive Candidate Search

Matryoshka Funnel

Binary Quantized Candidates

Hybrid Search

Multi-Vector Search

MUVERA-Style Encodings

Records And Storage

Metrics And Scoring

Normalization

Other Helpers

Compatibility API

Development

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages