Skip to content

gpu-mcp-server

Roadmap

pmady/gpu-mcp-server

Roadmap¶

Public roadmap for gpu-mcp-server. Updated quarterly.

Q3 2026 (Current)¶

[x] Core MCP server with stdio transport
[x] list_gpus, get_gpu_metrics, gpu_summary tools
[x] get_gpu_processes tool
[x] NVML-based collector with MIG support
[x] Mock collector for testing without GPU hardware
[x] GitHub Actions CI
[x] Publish to MCP servers registry
[x] First tagged release (v0.1.0)
[x] Container image on GitHub Container Registry (ghcr.io)

Q4 2026¶

[ ] Streamable HTTP transport (run as a network service)
[ ] MCP resources expose GPU info as context for agents
[ ] Per-process GPU usage (PID-level memory and compute attribution)
[ ] Prometheus metrics endpoint (optional, alongside MCP)
[ ] Integration tests with Claude Desktop and Goose

Q1 2027¶

[ ] Multi-node support — aggregate metrics from remote hosts
[ ] GPU event notifications (thermal throttling, ECC errors, XID events)
[ ] OpenSSF Best Practices badge
[x] Helm chart for Kubernetes deployment

Q2 2027¶

[ ] AMD ROCm support (via rocm-smi)
[ ] Intel GPU support (via oneAPI Level Zero)
[ ] MCP sampling integration — let agents request GPU snapshots at intervals
[ ] AAIF project proposal submission (Growth stage)

How to contribute¶

Pick an item from the roadmap, open an issue to discuss the approach, and submit a PR. See Contributing for details.