Benchmark Manager
Benchmark Manager는 AI 언어 평가 기준 관리 도구입니다
Browse AI and ML workflow skills for model integration, prompt engineering, evaluations, and LLM automation across major IDEs.
This directory brings installable AI Agent skills into one place so you can filter by search, category, topic, and official source, then install them directly into Claude Code, Cursor, Windsurf, and other supported environments.
Benchmark Manager는 AI 언어 평가 기준 관리 도구입니다
Debug stuck Hawk/Inspect AI evaluations. Use when user mentions stuck eval, eval not progressing, eval hanging, samples not completing, eval set frozen, runner stuck, 500 errors in eval, retry loop, eval timeout, or asks why an evaluation isnt finishing.
AI 에이전트 스킬 팀은 임시 팀을 자동으로 구성하는 능력
Collection of OpenSpec Schema for Workflows other than standard spec-driven schema that is included in OpenSpec.
One prompt. A full AI engineering team. Go lie on the couch. 🧠
AI Agent Skills Repo
🤖 Team onboarding kit for Claude Code AI coding assistant. Pre-configured with agents, skills, slash commands, and MCP integrations for Java 21/Spring Boot WebFlux, Angular, Flutter, PostgreSQL, and Firebase. Clone → install → start building.
Taskery is an AI agent skill that enables end-to-end task management, allowing developers to automate workflows, prioritize tasks, and streamline productivity. It provides a reusable instruction set for teaching other models how to operate Taskery.
Multi-agent system for software development
Official Cognite Data Fusion Toolkit CLI
Personal OS framework for orchestrating AI workers. Built on Ralph methodology.
setup [dev]는 AI 에이전트 스킬의 하나로 Pedro 저장소 설정과 Linux에서 가벼운 EDR 구현에 사용