What is checkpoint-ambiguity-review?

Perfect for Code Review Agents needing advanced checkpoint analysis and test validation capabilities. Review checkpoint specs and tests to identify tests that encode ambiguous interpretations rather than explicit requirements. Use when asked to check checkpoint_N.md against test_checkpoint_N.py, when

How do I install checkpoint-ambiguity-review?

Run the command: npx killer-skills add SprocketLab/slop-code-bench/checkpoint-ambiguity-review. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for checkpoint-ambiguity-review?

Key use cases include: Reviewing checkpoint specs for ambiguity, Validating test accuracy and reporting non-explicit interpretations, Debugging test failures due to unclear checkpoint specifications.

Which IDEs are compatible with checkpoint-ambiguity-review?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for checkpoint-ambiguity-review?

Requires access to checkpoint spec and test files. Limited to Python-based test files (.py). Dependent on directory structure like problems/{problem}/checkpoint_{N}.md.

checkpoint-ambiguity-review

Install checkpoint-ambiguity-review, an AI agent skill for AI agent workflows and automation. Review the use cases, limitations, and setup path before...

SKILL.md

Readonly

Upstream Repository Material

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

Supporting Evidence

Checkpoint Ambiguity Review

Name: checkpoint-ambiguity-review
Availability: InStock
Author: SprocketLab

Overview

Review a checkpoint's spec and tests to find tests that enforce a reasonable but non-explicit interpretation, and report those cases with rationale and fixes.

Workflow

1) Collect inputs

Problem name and checkpoint number (N).
Test file path(s) and checkpoint spec path (if not provided, infer):
- Spec: problems/{problem}/checkpoint_{N}.md
- Tests: problems/{problem}/tests/test_checkpoint_{N}.py
- Also scan problems/{problem}/tests/conftest.py and problems/{problem}/tests/data/ if they influence expectations.
Optional: snapshot path for ambiguity verification.

2) Read the spec and tests

Extract explicit requirements from the spec.
Map each test assertion to a specific spec clause or an implied behavior.
Note any test assumptions that are not spelled out in the spec.

3) Flag ambiguous interpretations

Only report tests that enforce an interpretation that could reasonably differ given the spec wording. Do not report tests that are simply incorrect against explicit requirements.

Common ambiguity cues:

Output ordering when the spec does not mandate order.
Tie-breaking rules that are unstated.
Whitespace, casing, or formatting details not defined by the spec.
Rounding or precision requirements not defined.
Error handling for invalid inputs when not specified.
Boundary behavior (inclusive/exclusive) not stated.
Default values or optional fields not defined.
Determinism or randomness expectations not specified.
Multiple reasonable data structure representations (list vs set, map order).

4) Optional snapshot verification

If a snapshot is provided, run:

bash
1slop-code --quiet eval-snapshot {snapshot} -p {problem} -o /tmp/eval -c {N} -e configs/environments/docker-python3.12-uv.yaml --json

Use failures to corroborate ambiguity, not to invent it. A failing test is ambiguous only if the spec supports multiple reasonable interpretations.

5) Report format

Use the following structure for each ambiguous test:

## {test name} ({path}::{node_id})

**Why:** {spec language + alternate interpretation that could be valid}
**Fix:** {proposed test relaxation or spec clarification}

Keep entries concise and actionable. If no ambiguity is found, state that clearly (e.g., "No ambiguity issues found.").

checkpoint-ambiguity-review — community checkpoint-ambiguity-review, slop-code-bench, community, ide skills

Killer-Skills Review

Core Value

Ideal Agent Persona

↓ Capabilities Granted for checkpoint-ambiguity-review

! Prerequisites & Limits

Why this page is reference-only

Source Boundary

Decide The Next Action Before You Keep Reading Repository Material

Start With Installation And Validation

Cross-Check Against Trusted Picks

Move To Workflow Collections For Team Rollout

Browser Sandbox Environment

⚡️ Ready to unleash?

FAQ & Installation Steps

? Frequently Asked Questions

What is checkpoint-ambiguity-review?

How do I install checkpoint-ambiguity-review?

What are the use cases for checkpoint-ambiguity-review?

Which IDEs are compatible with checkpoint-ambiguity-review?

Are there any limitations for checkpoint-ambiguity-review?

↓ How To Install

! Reference-Only Mode

Upstream Repository Material

checkpoint-ambiguity-review

Checkpoint Ambiguity Review

Overview

Workflow

1) Collect inputs

2) Read the spec and tests

3) Flag ambiguous interpretations

4) Optional snapshot verification

5) Report format

Related Skills

Looking for an alternative to checkpoint-ambiguity-review or another community skill for your workflow? Explore these related open-source skills.

openclaw-release-maintainer

widget-generator

flags

pr-review

checkpoint-ambiguity-review — community checkpoint-ambiguity-review, slop-code-bench, community, ide skills

About this Skill

Killer-Skills Review

Core Value

Ideal Agent Persona

↓ Capabilities Granted for checkpoint-ambiguity-review

! Prerequisites & Limits

Why this page is reference-only

Source Boundary

Decide The Next Action Before You Keep Reading Repository Material

Start With Installation And Validation

Cross-Check Against Trusted Picks

Move To Workflow Collections For Team Rollout

Browser Sandbox Environment

⚡️ Ready to unleash?

FAQ & Installation Steps

? Frequently Asked Questions

What is checkpoint-ambiguity-review?

How do I install checkpoint-ambiguity-review?

What are the use cases for checkpoint-ambiguity-review?

Which IDEs are compatible with checkpoint-ambiguity-review?

Are there any limitations for checkpoint-ambiguity-review?

↓ How To Install

! Reference-Only Mode

Upstream Repository Material

checkpoint-ambiguity-review

Checkpoint Ambiguity Review

Overview

Workflow

1) Collect inputs

2) Read the spec and tests

3) Flag ambiguous interpretations

4) Optional snapshot verification

5) Report format

Related Skills

Looking for an alternative to checkpoint-ambiguity-review or another community skill for your workflow? Explore these related open-source skills.

openclaw-release-maintainer

widget-generator

flags

pr-review