text-embeddings-inference

Star

Here are 7 public repositories matching this topic...

joe32140 / tei-qdrant-cache

Star

Docker Compose stack for scalable TEI embeddings (multi-GPU) fronted by a FastAPI proxy with a Qdrant cache. 🐳⛓️💾

embeddings retreival qdrant text-embeddings-inference

Updated Apr 8, 2025
Python

seonglae / tei

Sponsor

Star

Text Embeddings Inference (TEI)'s unofficial python wrapper library for batch processing with asyncio

aiohttp embeddings asyncio tei embedding embedding-vectors text-embeddings text-embeddings-inference

Updated Nov 27, 2023
Python

Self-hosted text embeddings server powered by Hugging Face TEI, with an OpenAI-compatible API. Supports BGE, Nomic, MiniLM and other models. Features optional API key auth, offline/air-gapped mode, and persistent model cache.

linux docker information-retrieval ai deep-learning docker-compose docker-image self-hosted embeddings openai semantic-search rag sentence-transformers hugging-face llm langchain retrieval-augmented-generation text-embeddings-inference openai-compatible

Updated May 5, 2026
Shell

jorge-martinez-gil / efficient-edge-embeddings

Star

Repository of the project Efficient Edge Embeddings (E*3) project, subgrant 2dAI2OC07 from EU Horizon dAIEDGE

embeddings multi-objective-optimization knee-point edge-ai text-embeddings text-embeddings-inference daiedge

Updated Apr 30, 2026
HTML

sitetester / auto-batching-proxy

Star

It will automatically batch inference requests from independent users together in a single batch request for efficiency, so that for users the interface looks like individual requests, but internally it is handled as a batch request

rest proxy rocket tei text-embeddings-inference

Updated Jan 15, 2026
Rust

miyako / TEI

Star

Local inference engine

text-embeddings-inference 4d-llm

Updated Feb 7, 2026
Ruby

elis-salobehaj / Vellum

Star

An enterprise-grade RAG Chatbot built with AI-assisted development. Features local LLMs (Ollama), LlamaIndex integration, and Entra ID SSO. Designed for secure, air-gapped PDF analysis. 📄

react python kubernetes enterprise chatbot tei kubeflow rag mlops fastapi kfp ai-agent qdrant llamaindex local-llm entra-id ollama text-embeddings-inference antigravity

Updated Mar 22, 2026
TypeScript

Improve this page

Add a description, image, and links to the text-embeddings-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-embeddings-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

text-embeddings-inference

Here are 7 public repositories matching this topic...

joe32140 / tei-qdrant-cache

seonglae / tei

hwdsl2 / docker-embeddings

jorge-martinez-gil / efficient-edge-embeddings

sitetester / auto-batching-proxy

miyako / TEI

elis-salobehaj / Vellum

Improve this page

Add this topic to your repo