A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
Embed a live health badge in a README or docs page.
[](https://www.ai-tools-scout.com/projects/kreuzberg)See how this project stacks up against other framework tools.
Get the fastest-growing projects, useful MCP servers, and technical reads in one weekly email.