Rapid MLX

The fastest local AI engine for Apple Silicon. 4.2x faster than Ollama, 0.08s cached TTFT, 100% tool calling. 17 tool parsers, prompt cache, reasoning separation, cloud routing. Drop-in OpenAI replacement. Works with Claude Code, Cursor, Aider.

GitHub &nearr;Website &nearr;Apache-2.0⚡ Inference

⭐

2.1K

Stars

🍴

269

Forks

👥

Contributors

💚

Health Score

Star History

Not enough data for chart

Health Score Breakdown

Star Momentum

Commit Frequency

100

Issue Health

Contributors

Release Cadence

100

Documentation

100

Community Signal

100

Project Info

Language: Python
License: Apache-2.0
Last Commit: 15h ago
Last Release: v0.6.35 (15h ago)
Open Issues: 26
Bus Factor: 2
Commits/Week: 30.4
Avg Issue Close: 4.9 days

apple-siliconclaude-codecursordeepseekfastapihacktoberfestinferencellmlocal-llmm1m2m3macosmlxollama-alternativeopenai-apipython

Project Badge

Embed a live health badge in a README or docs page.

[![AI Tools Scout](https://www.ai-tools-scout.com/api/badge/rapid-mlx?metric=health&style=shield)](https://www.ai-tools-scout.com/projects/rapid-mlx)

Compare Rapid MLX

See how this project stacks up against other inference tools.

Rapid MLX vs Ollama95

171.2K stars+450 weekly

Rapid MLX vs Prompts.chat57

162.0K stars0 weekly

Rapid MLX vs Andrej Karpathy Skills32

125.3K stars0 weekly

Rapid MLX vs llama.cpp100

109.6K stars+1.2K weekly

Rapid MLX vs vLLM93

Weekly AI open-source movers

Get the fastest-growing projects, useful MCP servers, and technical reads in one weekly email.