Explorando:

Explora e instala miles de habilidades para AI Agents en el directorio de Killer-Skills. Compatible con Claude Code, Windsurf, Cursor y más.

6 habilidades disponibles

sprint-review

Logo of rhesis-ai
rhesis-ai

Resumen localizado: Bring engineers, PMs, and domain experts together to generate tests, simulate (adversarial) conversations, and trace every failure to its root cause. It covers generative-ai, llm-evaluation, llm-evaluation-framework workflows. This AI agent skill supports Claude Code, Cursor

pr-review

Logo of gonzoblasco
gonzoblasco

Resumen localizado: 🔬 Advanced LLM evaluation framework for testing and comparing prompt variants with an AI Judge. It covers ai-testing, anthropic, llm-evaluation workflows. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.

eval-harness

[ Destacado ]
Logo of affaan-m
affaan-m

Eval Harness es un marco de evaluación para sesiones de Claude Code que permite medir la confiabilidad y el rendimiento de los agentes de AI

171.1k
0
Desarrollador

eval-harness

Logo of j7-dev
j7-dev

Resumen localizado: rewrite everything-claude-code for github-copilot # Eval Harness Skill A formal evaluation framework for Copilot CLI sessions, implementing eval-driven development (EDD) principles. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.

8
0
Desarrollador

prompt-engineer

Logo of Jeffallan
Jeffallan

Resumen localizado: Use when designing prompts for LLMs, optimizing model performance, building evaluation frameworks, or implementing advanced prompting techniques like chain-of-thought, few-shot learning, or structured outputs. This AI agent skill supports Claude Code, Cursor, and Windsurf

0
0
Desarrollador

agent-evaluation

Logo of oimiragieo
oimiragieo

Resumen localizado: Agents evaluate outputs, compute a weighted composite score, and emit a structured verdict with evidence citations. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.

14
0
Desarrollador