evaluation
AIエージェント评估框架是用于评估AIエージェント系统性能的结构
浏览和安装 Killer-Skills 目录中的数千个 AI Agent 技能。支持 Claude Code、Windsurf、Cursor 等。
这个技能目录把可安装的 AI Agent 技能集中在一起,方便你按搜索、分类、主题和官方来源快速筛选,并直接安装到 Claude Code、Cursor、Windsurf 等环境。
AIエージェント评估框架是用于评估AIエージェント系统性能的结构
本地化技能摘要: Create new scientific tools for ToolUniverse framework with proper structure, validation, and testing. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
本地化技能摘要: Build slide decks and presentations for research talks using Nano Banana Pro AI. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
本地化技能摘要: Apply relevant best practices and validate outcomes. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
本地化技能摘要: Evaluate LLM systems using automated metrics, LLM-as-judge, and benchmarks. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
MCP Builder是一种用于构建MCP服务器的工具,实现AI系统与外部工具和数据源的连接
本地化技能摘要: Analyzes web performance using Chrome DevTools MCP. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
本地化技能摘要: Overcome LLM knowledge cutoffs with real-time developer content. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
本地化技能摘要: Control Chrome browser via CLI for testing, automation, and debugging. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
FastMCP开发是使用FastMCP框架创建或修改Model Context Protocol(MCP)服务器的过程
本地化技能摘要: Use telnet to interact with IoT device shells for pentesting operations including device enumeration, vulnerability discovery, credential testing, and post-exploitation. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.
本地化技能摘要: # Project Development Methodology This skill covers the principles for identifying tasks suited to LLM processing, designing effective project architectures, and iterating rapidly using agent-assisted development. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.