What is inference-smoke-tests?

Perfect for AI Agents needing automated inference smoke testing capabilities with GPT3 tooling Run repeatable inference smoke tests using geppetto/pinocchio example binaries (single-pass, streaming, tool-loop, OpenAI Responses thinking) including tmux-driven TUI tests. Use when refactors touch

How do I install inference-smoke-tests?

Run the command: npx killer-skills add go-go-golems/geppetto/inference-smoke-tests. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for inference-smoke-tests?

Key use cases include: Automating inference smoke tests for GPT3 models, Generating test execution reports using the bundled script, Debugging inference issues with the manual checklist in playbook.md.

Which IDEs are compatible with inference-smoke-tests?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for inference-smoke-tests?

Requires OpenAI API Key. Requires Claude credentials. Needs golang environment for execution.

inference-smoke-tests

Install inference-smoke-tests, an AI agent skill for AI agent workflows and automation. Review the use cases, limitations, and setup path before rollout.

SKILL.md

Readonly

Upstream Repository Material

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

Supporting Evidence

Inference Smoke Tests

Name: inference-smoke-tests
Availability: InStock
Author: go-go-golems

Quick Start (Recommended)

Run the fast suite (geppetto non-TUI + pinocchio agent TUI) via the bundled script:

bash
1bash geppetto/.codex/skills/inference-smoke-tests/scripts/run_smoke.sh --quick

If you need the full manual checklist, open:

geppetto/.codex/skills/inference-smoke-tests/references/playbook.md

Preconditions

Ensure OPENAI_API_KEY is set (for OpenAI Chat + OpenAI Responses).
Ensure Claude credentials are available (e.g. ANTHROPIC_API_KEY) if you want the Claude tool-calling smoke step to pass.
Ensure tmux is installed (required for non-interactive TUI runs).
Expect costs: these tests make real API calls.

Workflow Decision Tree

Validate provider “thinking” streaming (Responses)?

Run geppetto/cmd/examples/openai-tools in --mode thinking.

Validate tool loop orchestration?

Run geppetto/cmd/examples/generic-tool-calling.

Validate Bubble Tea TUI event flow (thinking deltas + final)?

Run pinocchio/cmd/agents/simple-chat-agent in tmux.

Validate Claude tool calling?

Run geppetto/cmd/examples/claude-tools with --ai-api-type claude --ai-engine claude-haiku-4-5.

Validate multi-turn chat state persistence?

Run pinocchio TUI chat in tmux (manual) and/or pinocchio webchat in browser (manual).

What “Benefits From InferenceState” (Rules of Thumb)

Already benefits (multi-turn, cancel-sensitive, tool-loop, strict provider validation):

pinocchio TUI chat (pinocchio/cmd/pinocchio … --chat)
pinocchio agent TUI (pinocchio/cmd/agents/simple-chat-agent …)
pinocchio webchat (pinocchio/cmd/web-chat)
geppetto example runners that execute via geppetto/pkg/inference/core.Session

Could benefit (optional; mainly consistency/cancel):

pinocchio/cmd/examples/simple-redis-streaming-inference (transport-focused; currently eng.RunInference direct)
pinocchio/cmd/examples/simple-chat (exercises PinocchioCommand runner; could benefit indirectly if that runner standardizes on InferenceState)

Does not apply (not an inference runner):

geppetto/cmd/examples/citations-event-stream

Troubleshooting (Common Failure Modes)

“OpenAI Responses 400” errors

Re-run with higher logging:
- Add --log-level debug --with-caller where supported.
Confirm you’re using the correct provider mode:
- --ai-api-type openai-responses
If the error mentions invalid parameter support (e.g., temperature unsupported), it’s model-dependent; reduce parameters and retry.

TUI doesn’t submit the prompt

Some TUIs submit on Tab (not Enter).
Always capture logs to a file and confirm inference actually ran (look for EventPartialCompletionStart, EventFinal).

References

When you need copy/paste commands for the full sweep, read:

geppetto/.codex/skills/inference-smoke-tests/references/playbook.md

When you need to find new example entry points, search:

bash
1rg -n "cmd/examples" -S geppetto/cmd/examples pinocchio/cmd/examples
2rg -n "cmd/agents" -S pinocchio/cmd/agents

inference-smoke-tests — community inference-smoke-tests, geppetto, community, ide skills

Killer-Skills Review

Core Value

Ideal Agent Persona

↓ Capabilities Granted for inference-smoke-tests

! Prerequisites & Limits

Why this page is reference-only

Source Boundary

Decide The Next Action Before You Keep Reading Repository Material

Start With Installation And Validation

Cross-Check Against Trusted Picks

Move To Workflow Collections For Team Rollout

Browser Sandbox Environment

⚡️ Ready to unleash?

FAQ & Installation Steps

? Frequently Asked Questions

What is inference-smoke-tests?

How do I install inference-smoke-tests?

What are the use cases for inference-smoke-tests?

Which IDEs are compatible with inference-smoke-tests?

Are there any limitations for inference-smoke-tests?

↓ How To Install

! Reference-Only Mode

Upstream Repository Material

inference-smoke-tests

Inference Smoke Tests

Quick Start (Recommended)

Preconditions

Workflow Decision Tree

What “Benefits From InferenceState” (Rules of Thumb)

Troubleshooting (Common Failure Modes)

“OpenAI Responses 400” errors

TUI doesn’t submit the prompt

References

Related Skills

Looking for an alternative to inference-smoke-tests or another community skill for your workflow? Explore these related open-source skills.

openclaw-release-maintainer

widget-generator

flags

pr-review

inference-smoke-tests — community inference-smoke-tests, geppetto, community, ide skills

About this Skill

Killer-Skills Review

Core Value

Ideal Agent Persona

↓ Capabilities Granted for inference-smoke-tests

! Prerequisites & Limits

Why this page is reference-only

Source Boundary

Decide The Next Action Before You Keep Reading Repository Material

Start With Installation And Validation

Cross-Check Against Trusted Picks

Move To Workflow Collections For Team Rollout

Browser Sandbox Environment

⚡️ Ready to unleash?

FAQ & Installation Steps

? Frequently Asked Questions

What is inference-smoke-tests?

How do I install inference-smoke-tests?

What are the use cases for inference-smoke-tests?

Which IDEs are compatible with inference-smoke-tests?

Are there any limitations for inference-smoke-tests?

↓ How To Install

! Reference-Only Mode

Upstream Repository Material

inference-smoke-tests

Inference Smoke Tests

Quick Start (Recommended)

Preconditions

Workflow Decision Tree

What “Benefits From InferenceState” (Rules of Thumb)

Troubleshooting (Common Failure Modes)

“OpenAI Responses 400” errors

TUI doesn’t submit the prompt

References

Related Skills

Looking for an alternative to inference-smoke-tests or another community skill for your workflow? Explore these related open-source skills.

openclaw-release-maintainer

widget-generator

flags

pr-review