Perfect for AI Agents needing remote NVIDIA GPU acceleration for machine learning workflows. Run ML training, LLM inference, and ComfyUI workflows on remote NVIDIA GPUs (A100, H100, RTX 4090). Cloud GPU compute with smart file sync — prefix any command with gpu to run it remotely.

How do I install gpu-cli?

Run the command: npx killer-skills add gpu-cli/gpu/gpu-cli. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for gpu-cli?

Key use cases include: Automating machine learning model training on remote GPUs, Debugging GPU-accelerated workflows with `gpu doctor` and `gpu status`, Optimizing costs by monitoring running pods and available GPUs with `gpu inventory`.

Which IDEs are compatible with gpu-cli?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for gpu-cli?

Requires NVIDIA GPU access. Dependent on `uv` for workflow management. Needs proper setup and authentication for remote GPU provisioning.

gpu-cli

Install gpu-cli, an AI agent skill for AI agent workflows and automation. Review the use cases, limitations, and setup path before rollout.

SKILL.md

Readonly

Upstream Repository Material

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

Supporting Evidence

GPU CLI

Name: gpu-cli
Availability: InStock
Author: gpu-cli

GPU CLI runs local commands on remote NVIDIA GPUs by prefixing with gpu. It provisions a pod, syncs your code, streams logs, and syncs outputs back: uv run python train.py becomes gpu run uv run python train.py.

Quick diagnostics

bash
1gpu doctor --json       # Check if setup is healthy (daemon, auth, provider keys)
2gpu status --json       # See running pods and costs
3gpu inventory --json    # See available GPUs and pricing

Command families

Getting started

Command	Purpose
`gpu login`	Browser-based authentication
`gpu logout [-y]`	Remove session
`gpu init [--gpu-type T] [--force]`	Initialize project config
`gpu upgrade`	Open subscription upgrade page

Running code

Command	Purpose
`gpu run <command>`	Execute on remote GPU (main command)
`gpu run -d <command>`	Run detached (background)
`gpu run -a <job_id>`	Reattach to running job
`gpu run --cancel <job_id>`	Cancel a running job
`gpu status [--json]`	Show project status, pods, costs
`gpu logs [-j JOB] [-f] [--tail N] [--json]`	View job output
`gpu attach <job_id>`	Reattach to job output stream
`gpu stop [POD_ID] [-y]`	Stop active pod

Key gpu run flags: --gpu-type, --gpu-count <1-8>, --min-vram, --rebuild, -o/--output, --no-output, --sync, -p/--publish <PORT>, -e <KEY=VALUE>, -i/--interactive.

GPU inventory

Command	Purpose
`gpu inventory [--available] [--min-vram N] [--max-price P] [--json]`	List GPUs with pricing

Volumes

Command	Purpose
`gpu volume list [--detailed] [--json]`	List network volumes
`gpu volume create [--name N] [--size GB] [--datacenter DC]`	Create volume
`gpu volume delete <VOL> [--force]`	Delete volume
`gpu volume extend <VOL> --size <GB>`	Increase size
`gpu volume set-global <VOL>`	Set default volume
`gpu volume status [--volume V] [--json]`	Volume usage
`gpu volume migrate <VOL> --to <DC>`	Migrate to datacenter
`gpu volume sync <SRC> <DEST>`	Sync between volumes

Vault (encrypted storage)

Command	Purpose
`gpu vault list [--json]`	List encrypted output files
`gpu vault export <PATH> <DEST>`	Export decrypted file
`gpu vault stats [--json]`	Storage usage stats

Configuration

Command	Purpose
`gpu config show [--json]`	Show merged config
`gpu config validate`	Validate against schema
`gpu config schema`	Print JSON schema
`gpu config set <KEY> <VALUE>`	Set global config option
`gpu config get <KEY>`	Get global config value

Authentication

Command	Purpose
`gpu auth login [--profile P]`	Authenticate with cloud provider
`gpu auth logout`	Remove credentials
`gpu auth status`	Show auth status
`gpu auth add <HUB>`	Add hub credentials (hf, civitai)
`gpu auth remove <HUB>`	Remove hub credentials
`gpu auth hubs`	List configured hubs

Organizations

Command	Purpose
`gpu org list`	List organizations
`gpu org create <NAME>`	Create organization
`gpu org switch [SLUG]`	Set active org context
`gpu org invite <EMAIL>`	Invite member
`gpu org service-account create --name N`	Create service token
`gpu org service-account list`	List service accounts
`gpu org service-account revoke <ID>`	Revoke token

LLM inference

Command	Purpose
`gpu llm run [--ollama\|--vllm] [--model M] [-y]`	Launch LLM inference
`gpu llm info [MODEL] [--url URL] [--json]`	Show model info

ComfyUI workflows

Command	Purpose
`gpu comfyui list [--json]`	Browse available workflows
`gpu comfyui info <WORKFLOW> [--json]`	Show workflow details
`gpu comfyui validate <WORKFLOW> [--json]`	Pre-flight checks
`gpu comfyui run <WORKFLOW>`	Run workflow on GPU
`gpu comfyui generate "<PROMPT>"`	Text-to-image generation
`gpu comfyui stop [WORKFLOW] [--all]`	Stop ComfyUI pod

Notebooks

Command	Purpose
`gpu notebook [FILE] [--run] [--new NAME]`	Run Marimo notebook on GPU

Alias: gpu nb

Serverless endpoints

Command	Purpose
`gpu serverless deploy [--template T] [--json]`	Deploy endpoint
`gpu serverless status [ENDPOINT] [--json]`	Endpoint status
`gpu serverless logs [ENDPOINT]`	View request logs
`gpu serverless list [--json]`	List all endpoints
`gpu serverless delete [ENDPOINT]`	Delete endpoint
`gpu serverless warm [--cpu\|--gpu]`	Pre-warm endpoint

Templates

Command	Purpose
`gpu template list [--json]`	Browse official templates
`gpu template clear-cache`	Clear cached templates

Daemon control

Command	Purpose
`gpu daemon status [--json]`	Show daemon health
`gpu daemon start`	Start daemon
`gpu daemon stop`	Stop daemon
`gpu daemon restart`	Restart daemon
`gpu daemon logs [-f] [-n N]`	View daemon logs

Tools and utilities

Command	Purpose
`gpu dashboard`	Interactive TUI for pods and jobs
`gpu doctor [--json]`	Diagnostic checks
`gpu agent-docs`	Print agent reference to stdout
`gpu update [--check]`	Update CLI
`gpu changelog [VERSION]`	View release notes
`gpu issue ["desc"]`	Report issue
`gpu desktop`	Desktop app management
`gpu support`	Open community Discord

Common workflows

Setup: gpu login then gpu init
Run job: gpu run python train.py --epochs 10
With specific GPU: gpu run --gpu-type "RTX 4090" python train.py
Detached job: gpu run -d python long_training.py then gpu status --json
Check status: gpu status --json
View logs: gpu logs --json
Stop pods: gpu stop -y
LLM inference: gpu llm run --ollama --model llama3 -y
ComfyUI: gpu comfyui run flux_schnell
Diagnose issues: gpu doctor --json

gpu run is pod-reuse oriented: after a command completes, the next gpu run reuses the existing pod until you gpu stop or cooldown ends.

JSON output

Most commands support --json for machine-readable output. Structured data goes to stdout; human-oriented status and progress messages go to stderr.

Commands with --json: status, logs, doctor, inventory, config show, daemon status, volume list, volume status, vault list, vault stats, comfyui list, comfyui info, comfyui validate, serverless deploy, serverless status, serverless list, template list, llm info.

Exit codes

Code	Meaning	Recovery
`0`	Success	Proceed
`1`	General error	Read stderr
`2`	Usage error	Fix command syntax
`10`	Auth required	`gpu auth login`
`11`	Quota exceeded	`gpu upgrade` or wait
`12`	Not found	Check resource ID
`13`	Daemon unavailable	`gpu daemon start`, retry
`14`	Timeout	Retry
`15`	Cancelled	Re-run if needed
`130`	Interrupted	Re-run if needed

Configuration

Project config: gpu.toml, gpu.jsonc, or pyproject.toml [tool.gpu]
Global config: ~/.gpu-cli/config.toml (via gpu config set/get)
Sync model: .gitignore controls upload; outputs patterns control download
Secrets and credentials must stay in the OS keychain, never plaintext project files
CI env vars: GPU_RUNPOD_API_KEY, GPU_SSH_PRIVATE_KEY, GPU_SSH_PUBLIC_KEY

References

Project generation and task setup: references/create.md
Debugging and common failures: references/debug.md
Config schema and field examples: references/config.md
Cost and GPU selection guidance: references/optimize.md
Persistent storage and volumes: references/volumes.md

gpu-cli — community gpu-cli, community, ide skills

About this Skill

Killer-Skills Review

Core Value

Ideal Agent Persona

↓ Capabilities Granted for gpu-cli

! Prerequisites & Limits

Source Boundary

Decide The Next Action Before You Keep Reading Repository Material

Start With Installation And Validation

Cross-Check Against Trusted Picks

Move To Workflow Collections For Team Rollout

Browser Sandbox Environment

⚡️ Ready to unleash?

FAQ & Installation Steps

? Frequently Asked Questions

What is gpu-cli?

How do I install gpu-cli?

What are the use cases for gpu-cli?

Which IDEs are compatible with gpu-cli?

Are there any limitations for gpu-cli?

↓ How To Install

Upstream Repository Material

gpu-cli

GPU CLI

Quick diagnostics

Command families

Getting started

Running code

GPU inventory

Volumes

Vault (encrypted storage)

Configuration

Authentication

Organizations

LLM inference

ComfyUI workflows

Notebooks

Serverless endpoints

Templates

Daemon control

Tools and utilities

Common workflows

JSON output

Exit codes

Configuration

References

Related Skills

Looking for an alternative to gpu-cli or another community skill for your workflow? Explore these related open-source skills.

openclaw-release-maintainer

widget-generator

flags

pr-review