Healthcare AI Evaluation
Guide evaluation of healthcare AI systems with domain-specific safety criteria, clinical accuracy rubrics, and score interpretation. Use when building or reviewing health/medical AI evaluations.
浏览和安装 Killer-Skills 目录中的数千个 AI Agent 技能。支持 Claude Code、Windsurf、Cursor 等。
Guide evaluation of healthcare AI systems with domain-specific safety criteria, clinical accuracy rubrics, and score interpretation. Use when building or reviewing health/medical AI evaluations.
TypeScript SDK patterns for Opik. Use when working in sdks/typescript.
Fully Autonomous AI Research System with Self-Evolution, built natively on Claude Code
Fully Autonomous AI Research System with Self-Evolution, built natively on Claude Code
Your agents forget. Neotoma makes them remember.
Generate and verify E2E tests for a feature. Explores live app, creates test plan, generates tests, runs and fixes until passing.
Debug stuck Hawk/Inspect AI evaluations. Use when user mentions stuck eval, eval not progressing, eval hanging, samples not completing, eval set frozen, runner stuck, 500 errors in eval, retry loop, eval timeout, or asks why an evaluation isnt finishing.
Manages project progress tracking and maintains a chronological log of completed tasks, decisions, and updates. Use when completing milestones, making architectural decisions, or documenting project evolution. Creates and updates LOG.md file based on CLAUDE.md context.
Bud AI Foundry - A comprehensive inference stack for compound AI deployment, optimization and scaling. Bud Stack provides intelligent infrastructure automation, performance optimization, and seamless model deployment across multi-cloud/multi-hardware environments.
The github-ops AI agent skill automates GitHub operations, including PRs, issues, and releases, to boost developer productivity. It integrates with GitHub API for seamless workflow automation.
Hawk Job监控是一种用于监视运行或完成的作业的功能,包括查看日志和状态
Padrões de teste para Node.js/TypeScript backend com Jest e Supertest. Inclui unit tests, integration tests, mocking e boas práticas. Use ao escrever testes de API, services ou revisar estratégia de testes backend.