KS
Killer-Skills

inference-smoke-tests — how to use inference-smoke-tests how to use inference-smoke-tests, inference-smoke-tests setup guide, golang GPT3 tooling, inference-smoke-tests alternative, inference-smoke-tests vs gpt3 tooling, running smoke tests with golang, OpenAI API key setup, inference-smoke-tests install

v1.0.0
GitHub

About this Skill

Ideal for AI Agents like Cursor, Windsurf, and Claude Code requiring streamlined inference smoke testing with GPT3 tooling inference-smoke-tests is a golang GPT3 tooling skill that enables running smoke tests via a bundled script and supports OpenAI and Claude credentials

Features

Runs the fast suite via the bundled script using geppetto/.codex/skills/inference-smoke-tests/scripts/run_smoke.sh
Supports OpenAI API key and Claude credentials for authentication
Provides a manual checklist in playbook.md for test execution
Utilizes geppetto non-TUI and pinocchio agent TUI for test runs
Requires OPENAI_API_KEY and ANTHROPIC_API credentials for setup

# Core Topics

go-go-golems go-go-golems
[0]
[0]
Updated: 3/6/2026

Quality Score

Top 5%
39
Excellent
Based on code quality & docs
Installation
SYS Universal Install (Auto-Detect)
Cursor IDE Windsurf IDE VS Code IDE
> npx killer-skills add go-go-golems/geppetto/inference-smoke-tests

Agent Capability Analysis

The inference-smoke-tests MCP Server by go-go-golems is an open-source Categories.community integration for Claude and other AI agents, enabling seamless task automation and capability expansion. Optimized for how to use inference-smoke-tests, inference-smoke-tests setup guide, golang GPT3 tooling.

Ideal Agent Persona

Ideal for AI Agents like Cursor, Windsurf, and Claude Code requiring streamlined inference smoke testing with GPT3 tooling

Core Value

Empowers agents to run comprehensive smoke tests via a bundled script, utilizing Geppetto and Pinocchio agent TUI, while leveraging OpenAI API and Claude credentials for seamless execution, supporting protocols like bash scripting and markdown references

Capabilities Granted for inference-smoke-tests MCP Server

Automating inference smoke tests with Geppetto
Debugging Pinocchio agent TUI integrations
Validating OpenAI API key setups

! Prerequisites & Limits

  • Requires OpenAI API Key
  • Needs Claude credentials
  • Dependent on bash scripting and markdown format compatibility
Project
SKILL.md
3.0 KB
.cursorrules
1.2 KB
package.json
240 B
Ready
UTF-8

# Tags

[No tags]
SKILL.md
Readonly

Inference Smoke Tests

Quick Start (Recommended)

Run the fast suite (geppetto non-TUI + pinocchio agent TUI) via the bundled script:

bash
1bash geppetto/.codex/skills/inference-smoke-tests/scripts/run_smoke.sh --quick

If you need the full manual checklist, open:

geppetto/.codex/skills/inference-smoke-tests/references/playbook.md

Preconditions

  • Ensure OPENAI_API_KEY is set (for OpenAI Chat + OpenAI Responses).
  • Ensure Claude credentials are available (e.g. ANTHROPIC_API_KEY) if you want the Claude tool-calling smoke step to pass.
  • Ensure tmux is installed (required for non-interactive TUI runs).
  • Expect costs: these tests make real API calls.

Workflow Decision Tree

  1. Validate provider “thinking” streaming (Responses)?
  • Run geppetto/cmd/examples/openai-tools in --mode thinking.
  1. Validate tool loop orchestration?
  • Run geppetto/cmd/examples/generic-tool-calling.
  1. Validate Bubble Tea TUI event flow (thinking deltas + final)?
  • Run pinocchio/cmd/agents/simple-chat-agent in tmux.
  1. Validate Claude tool calling?
  • Run geppetto/cmd/examples/claude-tools with --ai-api-type claude --ai-engine claude-haiku-4-5.
  1. Validate multi-turn chat state persistence?
  • Run pinocchio TUI chat in tmux (manual) and/or pinocchio webchat in browser (manual).

What “Benefits From InferenceState” (Rules of Thumb)

Already benefits (multi-turn, cancel-sensitive, tool-loop, strict provider validation):

  • pinocchio TUI chat (pinocchio/cmd/pinocchio … --chat)
  • pinocchio agent TUI (pinocchio/cmd/agents/simple-chat-agent …)
  • pinocchio webchat (pinocchio/cmd/web-chat)
  • geppetto example runners that execute via geppetto/pkg/inference/core.Session

Could benefit (optional; mainly consistency/cancel):

  • pinocchio/cmd/examples/simple-redis-streaming-inference (transport-focused; currently eng.RunInference direct)
  • pinocchio/cmd/examples/simple-chat (exercises PinocchioCommand runner; could benefit indirectly if that runner standardizes on InferenceState)

Does not apply (not an inference runner):

  • geppetto/cmd/examples/citations-event-stream

Troubleshooting (Common Failure Modes)

“OpenAI Responses 400” errors

  • Re-run with higher logging:
    • Add --log-level debug --with-caller where supported.
  • Confirm you’re using the correct provider mode:
    • --ai-api-type openai-responses
  • If the error mentions invalid parameter support (e.g., temperature unsupported), it’s model-dependent; reduce parameters and retry.

TUI doesn’t submit the prompt

  • Some TUIs submit on Tab (not Enter).
  • Always capture logs to a file and confirm inference actually ran (look for EventPartialCompletionStart, EventFinal).

References

When you need copy/paste commands for the full sweep, read:

  • geppetto/.codex/skills/inference-smoke-tests/references/playbook.md

When you need to find new example entry points, search:

bash
1rg -n "cmd/examples" -S geppetto/cmd/examples pinocchio/cmd/examples 2rg -n "cmd/agents" -S pinocchio/cmd/agents

Related Skills

Looking for an alternative to inference-smoke-tests or building a Categories.community AI Agent? Explore these related open-source MCP Servers.

View All

widget-generator

Logo of f
f

widget-generator is an open-source AI agent skill for creating widget plugins that are injected into prompt feeds on prompts.chat. It supports two rendering modes: standard prompt widgets using default PromptCard styling and custom render widgets built as full React components.

149.6k
0
Design

chat-sdk

Logo of lobehub
lobehub

chat-sdk is a unified TypeScript SDK for building chat bots across multiple platforms, providing a single interface for deploying bot logic.

73.0k
0
Communication

zustand

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
Communication

data-fetching

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
Communication