Skip to content
Change the repository type filter

All

    Repositories list

    • kreuzcrawl

      Public
      Rust
      Other
      0400Updated Apr 19, 2026Apr 19, 2026
    • kreuzberg

      Public
      A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and…
      Rust
      Other
      3807.6k101Updated Apr 19, 2026Apr 19, 2026
    • R-universe repository for Kreuzberg.dev
      0000Updated Apr 19, 2026Apr 19, 2026
    • homebrew-tap

      Public
      Ruby
      1010Updated Apr 19, 2026Apr 19, 2026
    • alef

      Public
      Generate fully-typed, lint-clean language bindings for Rust libraries across 11 languages
      Rust
      MIT License
      0800Updated Apr 19, 2026Apr 19, 2026
    • liter-llm

      Public
      Universal LLM API client — 142+ providers, 11 native language bindings, powered by Rust core
      Rust
      MIT License
      914800Updated Apr 19, 2026Apr 19, 2026
    • html-to-markdown

      Public
      High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engi…
      HTML
      MIT License
      5565712Updated Apr 19, 2026Apr 19, 2026
    • langchain-kreuzberg

      Public
      Langchain document loader for Kreuzberg
      Python
      MIT License
      0400Updated Apr 18, 2026Apr 18, 2026
    • tree-sitter-language-pack

      Public
      Comprehensive tree-sitter grammar compilation with polyglot bindings — Rust, Python, Node.js, Go, Java, Ruby, Elixir, PHP, C#, WASM, and CLI. 305+ languages.
      Rust
      MIT License
      5233200Updated Apr 18, 2026Apr 18, 2026
    • actions

      Public
      Shared GitHub actions
      Shell
      0000Updated Apr 17, 2026Apr 17, 2026
    • kreuzberg-txtai

      Public
      Kreuzberg integration for txtai — drop-in Textractor replacement and custom pipeline
      Python
      MIT License
      0100Updated Apr 16, 2026Apr 16, 2026
    • LlamaIndex reader and node parser integrations for kreuzberg — 88+ format document extraction with element-aware splitting
      Python
      MIT License
      0006Updated Apr 16, 2026Apr 16, 2026
    • agno

      Public
      Build, run, manage agentic software at scale.
      Python
      Apache License 2.0
      5.3k100Updated Apr 14, 2026Apr 14, 2026
    • kreuzberg-crewai

      Public
      Extract text and metadata from 88+ document formats — PDF, DOCX, XLSX, HTML, images with OCR, and more — directly from your CrewAI agents.
      Python
      MIT License
      0101Updated Apr 14, 2026Apr 14, 2026
    • .github

      Public
      Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 88+ document formats using streaming parsers and b…
      0110Updated Apr 10, 2026Apr 10, 2026
    • Spring AI DocumentReader integration for Kreuzberg document extraction engine
      Java
      MIT License
      0110Updated Apr 8, 2026Apr 8, 2026
    • Extract, chunk, and embed documents from 88+ formats directly into SurrealDB.
      Python
      MIT License
      11100Updated Mar 30, 2026Mar 30, 2026
    • 🚀 A list of Haystack Integrations, maintained by the community or deepset.
      139000Updated Mar 25, 2026Mar 25, 2026
    • Additional packages (components, document stores and the likes) to extend the capabilities of Haystack
      Python
      Apache License 2.0
      248000Updated Mar 25, 2026Mar 25, 2026
    • ai-rulez

      Public
      MIT License
      0300Updated Mar 25, 2026Mar 25, 2026
    • C++
      Apache License 2.0
      0100Updated Mar 16, 2026Mar 16, 2026
    • A high-level idiomatic Rust wrapper around Pdfium, the C++ PDF library used by the Google Chromium project.
      Rust
      Other
      118200Updated Dec 25, 2025Dec 25, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.