Vllm Mlx MCP Server

v1.0.0AI/MLstable

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

anthropicapple-siliconaudio-processingclaude-codecomputer-vision
Share:
1,222
Stars
0
Downloads
0
Weekly
0/5

About

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL, LLaVA) with continuous batching, MCP tool calling, and multimodal support. Native MLX backend, 400+ tok/s. Works with Claude Code.

Features

  • OpenAI and Anthropic compatible server for Apple Silicon. Ru

Use Cases

OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-la
waybarrios

Maintainer

LicenseApache-2.0
Languagepython
Versionv1.0.0
UpdatedMay 21, 2026
Statushealthy
Maintenanceactive

Works with

ClaudeOpenAIwindowsmacoslinux

Installation

Manual Installation

npx vllm-mlx

Examples

Basic Configuration

Configure Vllm Mlx MCP Server

npx vllm-mlx

Configuration

Configuration Details

Config File

claude_desktop_config.json

Web UI

Not available

Security

Authentication

RequiredNo

Features

Data Encryption
Audit Logging
RBAC Support

Performance

Response Metrics

Response Time< 200ms
ThroughputMedium

Resource Usage

Memory UsageLow
CPU UsageLow

More AI/ML MCP Servers

Explore other servers in the AI/ML category.

Quick Config Preview

{ "mcpServers": { "vllm-mlx": { "command": "npx", "args": ["-y", "vllm-mlx"] } } }

Add this to your claude_desktop_config.json or .cursor/mcp.json

Read the full setup guide →

Ready to use Vllm Mlx MCP Server?

Browse our complete directory of 60+ MCP servers, read setup guides for your editor, and start building with the Model Context Protocol.

60+ ServersFree & Open SourceStep-by-Step Guides