agent-browser — community agent-browser, momentum-cms, community, ide skills, Claude Code, Cursor, Windsurf

v1.0.0

关于此技能

非常适合需要高级浏览器自动化功能的AI代理,尤其是与Angular SSR CMS和Claude Code集成。 Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to open a website, fill out a form, click a button, take a screenshot, scrape data from a page, test this web app, login to a site, automate browser actions, or any task requiring programmatic web interaction.

DonaldMurillo DonaldMurillo
[5]
[1]
更新于: 3/10/2026

Killer-Skills Review

Decision support comes first. Repository text comes second.

Reference-Only Page Review Score: 9/11

This page remains useful for operators, but Killer-Skills treats it as reference material instead of a primary organic landing page.

Original recommendation layer Concrete use-case guidance Explicit limitations and caution Quality floor passed for review
Review Score
9/11
Quality Score
60
Canonical Locale
en
Detected Body Locale
en

非常适合需要高级浏览器自动化功能的AI代理,尤其是与Angular SSR CMS和Claude Code集成。 Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to open a website, fill out a form, click a button, take a screenshot, scrape data from a page, test this web app, login to a site, automate browser actions, or any task requiring programmatic web interaction.

核心价值

赋予代理使用协议(如导航、快照和元素交互)自动执行浏览器交互的能力,利用浏览器自动化、Angular SSR CMS和Claude Code等技术能力实现高效的工作流管理。

适用 Agent 类型

非常适合需要高级浏览器自动化功能的AI代理,尤其是与Angular SSR CMS和Claude Code集成。

赋予的主要能力 · agent-browser

自动提交网页表单
使用快照分析调试基于浏览器的应用程序
为网页应用程序交互生成测试场景

! 使用限制与门槛

  • 需要Angular SSR CMS设置
  • 仅限于使用Claude Code集成的浏览器自动化
  • 需要特定的元素引用以进行交互

Why this page is reference-only

  • - Current locale does not satisfy the locale-governance contract.

Source Boundary

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

评审后的下一步

先决定动作,再继续看上游仓库材料

Killer-Skills 的主价值不应该停在“帮你打开仓库说明”,而是先帮你判断这项技能是否值得安装、是否应该回到可信集合复核,以及是否已经进入工作流落地阶段。

实验室 Demo

Browser Sandbox Environment

⚡️ Ready to unleash?

Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.

Boot Container Sandbox

常见问题与安装步骤

以下问题与步骤与页面结构化数据保持一致,便于搜索引擎理解页面内容。

? FAQ

agent-browser 是什么?

非常适合需要高级浏览器自动化功能的AI代理,尤其是与Angular SSR CMS和Claude Code集成。 Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to open a website, fill out a form, click a button, take a screenshot, scrape data from a page, test this web app, login to a site, automate browser actions, or any task requiring programmatic web interaction.

如何安装 agent-browser?

运行命令:npx killer-skills add DonaldMurillo/momentum-cms/agent-browser。支持 Cursor、Windsurf、VS Code、Claude Code 等 19+ IDE/Agent。

agent-browser 适用于哪些场景?

典型场景包括:自动提交网页表单、使用快照分析调试基于浏览器的应用程序、为网页应用程序交互生成测试场景。

agent-browser 支持哪些 IDE 或 Agent?

该技能兼容 Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer。可使用 Killer-Skills CLI 一条命令通用安装。

agent-browser 有哪些限制?

需要Angular SSR CMS设置;仅限于使用Claude Code集成的浏览器自动化;需要特定的元素引用以进行交互。

安装步骤

  1. 1. 打开终端

    在你的项目目录中打开终端或命令行。

  2. 2. 执行安装命令

    运行:npx killer-skills add DonaldMurillo/momentum-cms/agent-browser。CLI 会自动识别 IDE 或 AI Agent 并完成配置。

  3. 3. 开始使用技能

    agent-browser 已启用,可立即在当前项目中调用。

! 参考页模式

此页面仍可作为安装与查阅参考,但 Killer-Skills 不再把它视为主要可索引落地页。请优先阅读上方评审结论,再决定是否继续查看上游仓库说明。

Upstream Repository Material

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

Upstream Source

agent-browser

安装 agent-browser,这是一款面向AI agent workflows and automation的 AI Agent Skill。支持 Claude Code、Cursor、Windsurf,一键安装。

SKILL.md
Readonly
Upstream Repository Material
The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.
Supporting Evidence

Browser Automation with agent-browser

Core Workflow

Every browser automation follows this pattern:

  1. Navigate: agent-browser open <url>
  2. Snapshot: agent-browser snapshot -i (get element refs like @e1, @e2)
  3. Interact: Use refs to click, fill, select
  4. Re-snapshot: After navigation or DOM changes, get fresh refs
bash
1agent-browser open https://example.com/form 2agent-browser snapshot -i 3# Output: @e1 [input type="email"], @e2 [input type="password"], @e3 [button] "Submit" 4 5agent-browser fill @e1 "user@example.com" 6agent-browser fill @e2 "password123" 7agent-browser click @e3 8agent-browser wait --load networkidle 9agent-browser snapshot -i # Check result

Essential Commands

bash
1# Navigation 2agent-browser open <url> # Navigate (aliases: goto, navigate) 3agent-browser close # Close browser 4 5# Snapshot 6agent-browser snapshot -i # Interactive elements with refs (recommended) 7agent-browser snapshot -s "#selector" # Scope to CSS selector 8 9# Interaction (use @refs from snapshot) 10agent-browser click @e1 # Click element 11agent-browser fill @e2 "text" # Clear and type text 12agent-browser type @e2 "text" # Type without clearing 13agent-browser select @e1 "option" # Select dropdown option 14agent-browser check @e1 # Check checkbox 15agent-browser press Enter # Press key 16agent-browser scroll down 500 # Scroll page 17 18# Get information 19agent-browser get text @e1 # Get element text 20agent-browser get url # Get current URL 21agent-browser get title # Get page title 22 23# Wait 24agent-browser wait @e1 # Wait for element 25agent-browser wait --load networkidle # Wait for network idle 26agent-browser wait --url "**/page" # Wait for URL pattern 27agent-browser wait 2000 # Wait milliseconds 28 29# Capture 30agent-browser screenshot # Screenshot to temp dir 31agent-browser screenshot --full # Full page screenshot 32agent-browser pdf output.pdf # Save as PDF

Common Patterns

Form Submission

bash
1agent-browser open https://example.com/signup 2agent-browser snapshot -i 3agent-browser fill @e1 "Jane Doe" 4agent-browser fill @e2 "jane@example.com" 5agent-browser select @e3 "California" 6agent-browser check @e4 7agent-browser click @e5 8agent-browser wait --load networkidle

Authentication with State Persistence

bash
1# Login once and save state 2agent-browser open https://app.example.com/login 3agent-browser snapshot -i 4agent-browser fill @e1 "$USERNAME" 5agent-browser fill @e2 "$PASSWORD" 6agent-browser click @e3 7agent-browser wait --url "**/dashboard" 8agent-browser state save auth.json 9 10# Reuse in future sessions 11agent-browser state load auth.json 12agent-browser open https://app.example.com/dashboard

Data Extraction

bash
1agent-browser open https://example.com/products 2agent-browser snapshot -i 3agent-browser get text @e5 # Get specific element text 4agent-browser get text body > page.txt # Get all page text 5 6# JSON output for parsing 7agent-browser snapshot -i --json 8agent-browser get text @e1 --json

Parallel Sessions

bash
1agent-browser --session site1 open https://site-a.com 2agent-browser --session site2 open https://site-b.com 3 4agent-browser --session site1 snapshot -i 5agent-browser --session site2 snapshot -i 6 7agent-browser session list

Visual Browser (Debugging)

bash
1agent-browser --headed open https://example.com 2agent-browser highlight @e1 # Highlight element 3agent-browser record start demo.webm # Record session

iOS Simulator (Mobile Safari)

bash
1# List available iOS simulators 2agent-browser device list 3 4# Launch Safari on a specific device 5agent-browser -p ios --device "iPhone 16 Pro" open https://example.com 6 7# Same workflow as desktop - snapshot, interact, re-snapshot 8agent-browser -p ios snapshot -i 9agent-browser -p ios tap @e1 # Tap (alias for click) 10agent-browser -p ios fill @e2 "text" 11agent-browser -p ios swipe up # Mobile-specific gesture 12 13# Take screenshot 14agent-browser -p ios screenshot mobile.png 15 16# Close session (shuts down simulator) 17agent-browser -p ios close

Requirements: macOS with Xcode, Appium (npm install -g appium && appium driver install xcuitest)

Real devices: Works with physical iOS devices if pre-configured. Use --device "<UDID>" where UDID is from xcrun xctrace list devices.

Ref Lifecycle (Important)

Refs (@e1, @e2, etc.) are invalidated when the page changes. Always re-snapshot after:

  • Clicking links or buttons that navigate
  • Form submissions
  • Dynamic content loading (dropdowns, modals)
bash
1agent-browser click @e5 # Navigates to new page 2agent-browser snapshot -i # MUST re-snapshot 3agent-browser click @e1 # Use new refs

Semantic Locators (Alternative to Refs)

When refs are unavailable or unreliable, use semantic locators:

bash
1agent-browser find text "Sign In" click 2agent-browser find label "Email" fill "user@test.com" 3agent-browser find role button click --name "Submit" 4agent-browser find placeholder "Search" type "query" 5agent-browser find testid "submit-btn" click

Deep-Dive Documentation

ReferenceWhen to Use
references/commands.mdFull command reference with all options
references/snapshot-refs.mdRef lifecycle, invalidation rules, troubleshooting
references/session-management.mdParallel sessions, state persistence, concurrent scraping
references/authentication.mdLogin flows, OAuth, 2FA handling, state reuse
references/video-recording.mdRecording workflows for debugging and documentation
references/proxy-support.mdProxy configuration, geo-testing, rotating proxies

Ready-to-Use Templates

TemplateDescription
templates/form-automation.shForm filling with validation
templates/authenticated-session.shLogin once, reuse state
templates/capture-workflow.shContent extraction with screenshots
bash
1./templates/form-automation.sh https://example.com/form 2./templates/authenticated-session.sh https://app.example.com/login 3./templates/capture-workflow.sh https://example.com ./output

相关技能

寻找 agent-browser 的替代方案 (Alternative) 或可搭配使用的同类 community Skill?探索以下相关开源技能。

查看全部

openclaw-release-maintainer

Logo of openclaw
openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

333.8k
0
AI

widget-generator

Logo of f
f

为prompts.chat的信息反馈系统生成可定制的插件小部件

149.6k
0
AI

flags

Logo of vercel
vercel

React 框架

138.4k
0
浏览器

pr-review

Logo of pytorch
pytorch

Python中具有强大GPU加速的张量和动态神经网络

98.6k
0
开发者工具