Skip to content

Pinned Loading

  1. kreuzberg kreuzberg Public

    A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 76+ formats. Available for Rust, Python, Rub…

    Rust 6.7k 315

  2. html-to-markdown html-to-markdown Public

    High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts stru…

    HTML 563 50

  3. langchain-kreuzberg langchain-kreuzberg Public

    Langchain document loader for Kreuzberg

    Python 4

  4. tree-sitter-language-pack tree-sitter-language-pack Public

    A tree-sitter language pack

    Elixir 255 43

Repositories

Showing 10 of 12 repositories
  • kreuzberg Public

    A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 76+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

    kreuzberg-dev/kreuzberg’s past year of commit activity
    Rust 6,661 MIT 315 23 (1 issue needs help) 4 Updated Mar 10, 2026
  • homebrew-tap Public
    kreuzberg-dev/homebrew-tap’s past year of commit activity
    Ruby 0 0 1 0 Updated Mar 10, 2026
  • tree-sitter-language-pack Public

    A tree-sitter language pack

    kreuzberg-dev/tree-sitter-language-pack’s past year of commit activity
    Elixir 255 43 2 (2 issues need help) 0 Updated Mar 10, 2026
  • kreuzberg-dev/kreuzberg-surrealdb’s past year of commit activity
    0 MIT 0 0 1 Updated Mar 10, 2026
  • html-to-markdown Public

    High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.

    kreuzberg-dev/html-to-markdown’s past year of commit activity
    HTML 563 MIT 50 1 1 Updated Mar 10, 2026
  • kreuzberg-dev.r-universe.dev Public

    R-universe repository for Kreuzberg.dev

    kreuzberg-dev/kreuzberg-dev.r-universe.dev’s past year of commit activity
    0 0 0 0 Updated Mar 8, 2026
  • ai-rulez Public
    kreuzberg-dev/ai-rulez’s past year of commit activity
    2 MIT 0 0 0 Updated Mar 8, 2026
  • haystack-core-integrations Public Forked from deepset-ai/haystack-core-integrations

    Additional packages (components, document stores and the likes) to extend the capabilities of Haystack

    kreuzberg-dev/haystack-core-integrations’s past year of commit activity
    Python 0 Apache-2.0 218 0 0 Updated Mar 6, 2026
  • langchain-kreuzberg Public

    Langchain document loader for Kreuzberg

    kreuzberg-dev/langchain-kreuzberg’s past year of commit activity
    Python 4 MIT 0 0 0 Updated Mar 4, 2026
  • .github Public

    Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 75+ document formats using streaming parsers and built-in OCR. Designed for RAG pipelines, batch workloads, and production deployments.

    kreuzberg-dev/.github’s past year of commit activity
    1 0 1 0 Updated Feb 28, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.