ragline

A retrieval-augmented generation pipeline with a rigorous offline evaluation harness.

ragline ingests documents, splits them into overlapping chunks, embeds them into a vector store, retrieves the most relevant chunks for a question by cosine similarity, and generates a grounded answer. Its defining feature is an offline evaluation harness that measures retrieval quality and answer faithfulness against a labeled dataset — so you can know whether the system actually works, not just that it runs.

The Problem

RAG systems are easy to assemble and hard to trust. Retrieval can return irrelevant context; generation can ignore the context it was given. ragline treats evaluation as a first-class concern, reporting precision@k, recall@k, MRR, and a faithfulness check so quality is measured, not assumed.

No API key required

Embedding and generation sit behind provider interfaces. The default providers are deterministic and local — a hash-based embedder and a template generator — so the entire pipeline, its tests, and the evaluation harness run with no API key and no network access. A real provider (e.g. OpenAI) is available as an optional dependency for production use.

Architecture

src/ragline/
  document.py        documents and chunks
  chunking.py        split documents into overlapping chunks
  providers/         Embedder + Generator interfaces; local (real) + openai (optional)
  vector_store.py    NumPy cosine-similarity store with top-k retrieval
  pipeline.py        chunk -> embed -> store -> retrieve -> generate
  evaluation/        retrieval/faithfulness metrics + the eval harness
  cli.py             ingest / query / eval

See docs/architecture.md and docs/evaluation.md.

Requirements

Python 3.10+
NumPy (installed automatically)

Setup

pip install -e ".[dev]"

Use

ragline ingest data/corpus
ragline query "How do solar panels generate electricity?"
ragline eval data/eval/qa.jsonl

Develop

ruff check .
black --check .
mypy
pytest

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.devcontainer		.devcontainer
.github		.github
data		data
docs		docs
scripts		scripts
src/ragline		src/ragline
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ragline

The Problem

No API key required

Architecture

Requirements

Setup

Use

Develop

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ragline

The Problem

No API key required

Architecture

Requirements

Setup

Use

Develop

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages