Skip to content

gpu-mcp-server

An MCP server that exposes NVIDIA GPU metrics as tools. Any MCP-compatible AI agent (Claude, Goose, Cursor, Windsurf) can query real-time GPU utilization, memory, temperature, power, PCIe and NVLink throughput — no Prometheus or dcgm-exporter required.

Built on the official Go MCP SDK and NVIDIA go-nvml.

Tools

Tool Description
list_gpus List all GPUs with utilization and memory info
get_gpu_metrics Detailed metrics for a GPU by index or UUID
get_gpu_processes PID-level GPU process attribution
gpu_summary Aggregate stats across all devices

All tools support MIG (Multi-Instance GPU) — MIG instances appear as separate devices with their parent GPU's shared metrics (temperature, power, PCIe).

Project info

  • License: Apache 2.0
  • Language: Go
  • AAIF alignment: MCP
  • Related: keda-gpu-scaler — GPU autoscaling for Kubernetes