babysit-pr — GitHub pull request monitoring babysit-pr, community, GitHub pull request monitoring, ide skills, CI check automation, workflow run polling, review comment handling, merge conflict detection, flaky failure retry, GitHub API integration

v1.0.0

About this Skill

Ideal for GitHub Integration Agents needing automated pull request monitoring and merge management. babysit-pr is a GitHub pull request monitoring skill that continuously polls CI checks, workflow runs, and review comments until the PR is merged or closed.

Features

Polls CI checks and workflow runs using GitHub API
Handles review comments and diagnoses failures
Auto-fixes branch-related issues and retries flaky failures
Monitors mergeability state and detects potential conflicts
Stops when user help is required, such as CI infrastructure issues

# Core Topics

Agents2AgentsAI Agents2AgentsAI
[23]
[0]
Updated: 2/26/2026

Killer-Skills Review

Decision support comes first. Repository text comes second.

Reviewed Landing Page Review Score: 10/11

Killer-Skills keeps this page indexable because it adds recommendation, limitations, and review signals beyond the upstream repository text.

Original recommendation layer Concrete use-case guidance Explicit limitations and caution Quality floor passed for review Locale and body language aligned
Review Score
10/11
Quality Score
65
Canonical Locale
en
Detected Body Locale
en

Ideal for GitHub Integration Agents needing automated pull request monitoring and merge management. babysit-pr is a GitHub pull request monitoring skill that continuously polls CI checks, workflow runs, and review comments until the PR is merged or closed.

Core Value

Empowers agents to continuously poll CI checks and review comments, ensuring seamless merge readiness by leveraging GitHub APIs and CI infrastructure, while handling potential merge conflicts and required review approvals.

Ideal Agent Persona

Ideal for GitHub Integration Agents needing automated pull request monitoring and merge management.

Capabilities Granted for babysit-pr

Automating GitHub pull request monitoring for merge readiness
Debugging CI infrastructure issues and flaky test failures
Generating notifications for user intervention on blocked merges

! Prerequisites & Limits

  • Requires GitHub API access and CI infrastructure setup
  • Limited to GitHub pull requests only
  • Dependent on CI check and review comment availability

Source Boundary

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

After The Review

Decide The Next Action Before You Keep Reading Repository Material

Killer-Skills should not stop at opening repository instructions. It should help you decide whether to install this skill, when to cross-check against trusted collections, and when to move into workflow rollout.

Labs Demo

Browser Sandbox Environment

⚡️ Ready to unleash?

Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.

Boot Container Sandbox

FAQ & Installation Steps

These questions and steps mirror the structured data on this page for better search understanding.

? Frequently Asked Questions

What is babysit-pr?

Ideal for GitHub Integration Agents needing automated pull request monitoring and merge management. babysit-pr is a GitHub pull request monitoring skill that continuously polls CI checks, workflow runs, and review comments until the PR is merged or closed.

How do I install babysit-pr?

Run the command: npx killer-skills add Agents2AgentsAI/ata/babysit-pr. It works with Cursor, Windsurf, VS Code, Claude Code, and 19+ other IDEs.

What are the use cases for babysit-pr?

Key use cases include: Automating GitHub pull request monitoring for merge readiness, Debugging CI infrastructure issues and flaky test failures, Generating notifications for user intervention on blocked merges.

Which IDEs are compatible with babysit-pr?

This skill is compatible with Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for babysit-pr?

Requires GitHub API access and CI infrastructure setup. Limited to GitHub pull requests only. Dependent on CI check and review comment availability.

How To Install

  1. 1. Open your terminal

    Open the terminal or command line in your project directory.

  2. 2. Run the install command

    Run: npx killer-skills add Agents2AgentsAI/ata/babysit-pr. The CLI will automatically detect your IDE or AI agent and configure the skill.

  3. 3. Start using the skill

    The skill is now active. Your AI agent can use babysit-pr immediately in the current project.

Upstream Repository Material

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

Upstream Source

babysit-pr

Streamline your GitHub workflow with babysit-pr, an AI agent skill that continuously monitors PRs, diagnoses failures, and auto-fixes issues - try it now!

SKILL.md
Readonly
Upstream Repository Material
The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.
Supporting Evidence

PR Babysitter

Objective

Babysit a PR persistently until one of these terminal outcomes occurs:

  • The PR is merged or closed.
  • CI is successful, there are no unaddressed review comments surfaced by the watcher, required review approval is not blocking merge, and there are no potential merge conflicts (PR is mergeable / not reporting conflict risk).
  • A situation requires user help (for example CI infrastructure issues, repeated flaky failures after retry budget is exhausted, permission problems, or ambiguity that cannot be resolved safely).

Do not stop merely because a single snapshot returns idle while checks are still pending.

Inputs

Accept any of the following:

  • No PR argument: infer the PR from the current branch (--pr auto)
  • PR number
  • PR URL

Core Workflow

  1. When the user asks to "monitor"/"watch"/"babysit" a PR, start with the watcher's continuous mode (--watch) unless you are intentionally doing a one-shot diagnostic snapshot.
  2. Run the watcher script to snapshot PR/CI/review state (or consume each streamed snapshot from --watch).
  3. Inspect the actions list in the JSON response.
  4. If diagnose_ci_failure is present, inspect failed run logs and classify the failure.
  5. If the failure is likely caused by the current branch, patch code locally, commit, and push.
  6. If process_review_comment is present, inspect surfaced review items and decide whether to address them.
  7. If a review item is actionable and correct, patch code locally, commit, and push.
  8. If the failure is likely flaky/unrelated and retry_failed_checks is present, rerun failed jobs with --retry-failed-now.
  9. If both actionable review feedback and retry_failed_checks are present, prioritize review feedback first; a new commit will retrigger CI, so avoid rerunning flaky checks on the old SHA unless you intentionally defer the review change.
  10. On every loop, verify mergeability / merge-conflict status (for example via gh pr view) in addition to CI and review state.
  11. After any push or rerun action, immediately return to step 1 and continue polling on the updated SHA/state.
  12. If you had been using --watch before pausing to patch/commit/push, relaunch --watch yourself in the same turn immediately after the push (do not wait for the user to re-invoke the skill).
  13. Repeat polling until the PR is green + review-clean + mergeable, stop_pr_closed appears, or a user-help-required blocker is reached.
  14. Maintain terminal/session ownership: while babysitting is active, keep consuming watcher output in the same turn; do not leave a detached --watch process running and then end the turn as if monitoring were complete.

Commands

One-shot snapshot

bash
1python3 .codex/skills/babysit-pr/scripts/gh_pr_watch.py --pr auto --once

Continuous watch (JSONL)

bash
1python3 .codex/skills/babysit-pr/scripts/gh_pr_watch.py --pr auto --watch

Trigger flaky retry cycle (only when watcher indicates)

bash
1python3 .codex/skills/babysit-pr/scripts/gh_pr_watch.py --pr auto --retry-failed-now

Explicit PR target

bash
1python3 .codex/skills/babysit-pr/scripts/gh_pr_watch.py --pr <number-or-url> --once

CI Failure Classification

Use gh commands to inspect failed runs before deciding to rerun.

  • gh run view <run-id> --json jobs,name,workflowName,conclusion,status,url,headSha
  • gh run view <run-id> --log-failed

Prefer treating failures as branch-related when logs point to changed code (compile/test/lint/typecheck/snapshots/static analysis in touched areas).

Prefer treating failures as flaky/unrelated when logs show transient infra/external issues (timeouts, runner provisioning failures, registry/network outages, GitHub Actions infra errors).

If classification is ambiguous, perform one manual diagnosis attempt before choosing rerun.

Read .codex/skills/babysit-pr/references/heuristics.md for a concise checklist.

Review Comment Handling

The watcher surfaces review items from:

  • PR issue comments
  • Inline review comments
  • Review submissions (COMMENT / APPROVED / CHANGES_REQUESTED)

It intentionally surfaces Codex reviewer bot feedback (for example comments/reviews from chatgpt-codex-connector[bot]) in addition to human reviewer feedback. Most unrelated bot noise should still be ignored. For safety, the watcher only auto-surfaces trusted human review authors (for example repo OWNER/MEMBER/COLLABORATOR, plus the authenticated operator) and approved review bots such as Codex. On a fresh watcher state file, existing pending review feedback may be surfaced immediately (not only comments that arrive after monitoring starts). This is intentional so already-open review comments are not missed.

When you agree with a comment and it is actionable:

  1. Patch code locally.
  2. Commit with codex: address PR review feedback (#<n>).
  3. Push to the PR head branch.
  4. Resume watching on the new SHA immediately (do not stop after reporting the push).
  5. If monitoring was running in --watch mode, restart --watch immediately after the push in the same turn; do not wait for the user to ask again.

If you disagree or the comment is non-actionable/already addressed, record it as handled by continuing the watcher loop (the script de-duplicates surfaced items via state after surfacing them). If a code review comment/thread is already marked as resolved in GitHub, treat it as non-actionable and safely ignore it unless new unresolved follow-up feedback appears.

Git Safety Rules

  • Work only on the PR head branch.
  • Avoid destructive git commands.
  • Do not switch branches unless necessary to recover context.
  • Before editing, check for unrelated uncommitted changes. If present, stop and ask the user.
  • After each successful fix, commit and git push, then re-run the watcher.
  • If you interrupted a live --watch session to make the fix, restart --watch immediately after the push in the same turn.
  • Do not run multiple concurrent --watch processes for the same PR/state file; keep one watcher session active and reuse it until it stops or you intentionally restart it.
  • A push is not a terminal outcome; continue the monitoring loop unless a strict stop condition is met.

Commit message defaults:

  • codex: fix CI failure on PR #<n>
  • codex: address PR review feedback (#<n>)

Monitoring Loop Pattern

Use this loop in a live Codex session:

  1. Run --once.
  2. Read actions.
  3. First check whether the PR is now merged or otherwise closed; if so, report that terminal state and stop polling immediately.
  4. Check CI summary, new review items, and mergeability/conflict status.
  5. Diagnose CI failures and classify branch-related vs flaky/unrelated.
  6. Process actionable review comments before flaky reruns when both are present; if a review fix requires a commit, push it and skip rerunning failed checks on the old SHA.
  7. Retry failed checks only when retry_failed_checks is present and you are not about to replace the current SHA with a review/CI fix commit.
  8. If you pushed a commit or triggered a rerun, report the action briefly and continue polling (do not stop).
  9. After a review-fix push, proactively restart continuous monitoring (--watch) in the same turn unless a strict stop condition has already been reached.
  10. If everything is passing, mergeable, not blocked on required review approval, and there are no unaddressed review items, report success and stop.
  11. If blocked on a user-help-required issue (infra outage, exhausted flaky retries, unclear reviewer request, permissions), report the blocker and stop.
  12. Otherwise sleep according to the polling cadence below and repeat.

When the user explicitly asks to monitor/watch/babysit a PR, prefer --watch so polling continues autonomously in one command. Use repeated --once snapshots only for debugging, local testing, or when the user explicitly asks for a one-shot check. Do not stop to ask the user whether to continue polling; continue autonomously until a strict stop condition is met or the user explicitly interrupts. Do not hand control back to the user after a review-fix push just because a new SHA was created; restarting the watcher and re-entering the poll loop is part of the same babysitting task. If a --watch process is still running and no strict stop condition has been reached, the babysitting task is still in progress; keep streaming/consuming watcher output instead of ending the turn.

Polling Cadence

Use adaptive polling and continue monitoring even after CI turns green:

  • While CI is not green (pending/running/queued or failing): poll every 1 minute.
  • After CI turns green: start at every 1 minute, then back off exponentially when there is no change (for example 1m, 2m, 4m, 8m, 16m, 32m), capping at every 1 hour.
  • Reset the green-state polling interval back to 1 minute whenever anything changes (new commit/SHA, check status changes, new review comments, mergeability changes, review decision changes).
  • If CI stops being green again (new commit, rerun, or regression): return to 1-minute polling.
  • If any poll shows the PR is merged or otherwise closed: stop polling immediately and report the terminal state.

Stop Conditions (Strict)

Stop only when one of the following is true:

  • PR merged or closed (stop as soon as a poll/snapshot confirms this).
  • PR is ready to merge: CI succeeded, no surfaced unaddressed review comments, not blocked on required review approval, and no merge conflict risk.
  • User intervention is required and Codex cannot safely proceed alone.

Keep polling when:

  • actions contains only idle but checks are still pending.
  • CI is still running/queued.
  • Review state is quiet but CI is not terminal.
  • CI is green but mergeability is unknown/pending.
  • CI is green and mergeable, but the PR is still open and you are waiting for possible new review comments or merge-conflict changes per the green-state cadence.
  • The PR is green but blocked on review approval (REVIEW_REQUIRED / similar); continue polling on the green-state cadence and surface any new review comments without asking for confirmation to keep watching.

Output Expectations

Provide concise progress updates while monitoring and a final summary that includes:

  • During long unchanged monitoring periods, avoid emitting a full update on every poll; summarize only status changes plus occasional heartbeat updates.

  • Treat push confirmations, intermediate CI snapshots, and review-action updates as progress updates only; do not emit the final summary or end the babysitting session unless a strict stop condition is met.

  • A user request to "monitor" is not satisfied by a couple of sample polls; remain in the loop until a strict stop condition or an explicit user interruption.

  • A review-fix commit + push is not a completion event; immediately resume live monitoring (--watch) in the same turn and continue reporting progress updates.

  • When CI first transitions to all green for the current SHA, emit a one-time celebratory progress update (do not repeat it on every green poll). Preferred style: 🚀 CI is all green! 33/33 passed. Still on watch for review approval.

  • Do not send the final summary while a watcher terminal is still running unless the watcher has emitted/confirmed a strict stop condition; otherwise continue with progress updates.

  • Final PR SHA

  • CI status summary

  • Mergeability / conflict status

  • Fixes pushed

  • Flaky retry cycles used

  • Remaining unresolved failures or review comments

References

  • Heuristics and decision tree: .codex/skills/babysit-pr/references/heuristics.md
  • GitHub CLI/API details used by the watcher: .codex/skills/babysit-pr/references/github-api-notes.md

Related Skills

Looking for an alternative to babysit-pr or another community skill for your workflow? Explore these related open-source skills.

View All

openclaw-release-maintainer

Logo of openclaw
openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

333.8k
0
AI

widget-generator

Logo of f
f

Generate customizable widget plugins for the prompts.chat feed system

149.6k
0
AI

flags

Logo of vercel
vercel

The React Framework

138.4k
0
Browser

pr-review

Logo of pytorch
pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

98.6k
0
Developer