split-pdf — for Claude Code split-pdf, jaehyun-claude-workflow, community, for Claude Code, ide skills, articles, **Directory convention:**, The original PDF remains in, Deep-Read, Academic

v1.0.0

关于此技能

适用场景: Ideal for AI agents that need split-pdf: download, split, and deep-read academic papers. 本地化技能摘要: jaehyun-claude-workflow # Split-PDF: Download, Split, and Deep-Read Academic Papers CRITICAL RULE: Never read a full PDF. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.

功能特性

Split-PDF: Download, Split, and Deep-Read Academic Papers
When This Skill Is Invoked
The user wants you to read, review, or summarize an academic paper. The input is either:
A file path to a local PDF (e.g., ./articles/smith 2024.pdf)
A search query or paper title (e.g., "Gentzkow Shapiro Sinkinson 2014 competition newspapers")

# 核心主题

choi-jaehyun choi-jaehyun
[0]
[0]
更新于: 3/26/2026

Killer-Skills Review

Decision support comes first. Repository text comes second.

Reference-Only Page Review Score: 10/11

This page remains useful for teams, but Killer-Skills treats it as reference material instead of a primary organic landing page.

Original recommendation layer Concrete use-case guidance Explicit limitations and caution Quality floor passed for review
Review Score
10/11
Quality Score
55
Canonical Locale
en
Detected Body Locale
en

适用场景: Ideal for AI agents that need split-pdf: download, split, and deep-read academic papers. 本地化技能摘要: jaehyun-claude-workflow # Split-PDF: Download, Split, and Deep-Read Academic Papers CRITICAL RULE: Never read a full PDF. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.

核心价值

推荐说明: split-pdf helps agents split-pdf: download, split, and deep-read academic papers. jaehyun-claude-workflow # Split-PDF: Download, Split, and Deep-Read Academic Papers CRITICAL RULE: Never read a full PDF. This AI

适用 Agent 类型

适用场景: Ideal for AI agents that need split-pdf: download, split, and deep-read academic papers.

赋予的主要能力 · split-pdf

适用任务: Applying Split-PDF: Download, Split, and Deep-Read Academic Papers
适用任务: Applying When This Skill Is Invoked
适用任务: Applying The user wants you to read, review, or summarize an academic paper. The input is either:

! 使用限制与门槛

  • 限制说明: If the file is NOT already inside ./articles/, copy it there (do not move — preserve the original location)
  • 限制说明: Only read the 4-page split files, and only 3 splits at a time ( 12 pages)
  • 限制说明: The downloaded or provided PDF in ./articles/ must NEVER be deleted, moved, or overwritten at any point in this workflow

Why this page is reference-only

  • - Current locale does not satisfy the locale-governance contract.

Source Boundary

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

评审后的下一步

先决定动作,再继续看上游仓库材料

Killer-Skills 的主价值不应该停在“帮你打开仓库说明”,而是先帮你判断这项技能是否值得安装、是否应该回到可信集合复核,以及是否已经进入工作流落地阶段。

实验室 Demo

Browser Sandbox Environment

⚡️ Ready to unleash?

Experience this Agent in a zero-setup browser environment powered by WebContainers. No installation required.

Boot Container Sandbox

常见问题与安装步骤

以下问题与步骤与页面结构化数据保持一致,便于搜索引擎理解页面内容。

? FAQ

split-pdf 是什么?

适用场景: Ideal for AI agents that need split-pdf: download, split, and deep-read academic papers. 本地化技能摘要: jaehyun-claude-workflow # Split-PDF: Download, Split, and Deep-Read Academic Papers CRITICAL RULE: Never read a full PDF. This AI agent skill supports Claude Code, Cursor, and Windsurf workflows.

如何安装 split-pdf?

运行命令:npx killer-skills add choi-jaehyun/jaehyun-claude-workflow/split-pdf。支持 Cursor、Windsurf、VS Code、Claude Code 等 19+ IDE/Agent。

split-pdf 适用于哪些场景?

典型场景包括:适用任务: Applying Split-PDF: Download, Split, and Deep-Read Academic Papers、适用任务: Applying When This Skill Is Invoked、适用任务: Applying The user wants you to read, review, or summarize an academic paper. The input is either:。

split-pdf 支持哪些 IDE 或 Agent?

该技能兼容 Cursor, Windsurf, VS Code, Trae, Claude Code, OpenClaw, Aider, Codex, OpenCode, Goose, Cline, Roo Code, Kiro, Augment Code, Continue, GitHub Copilot, Sourcegraph Cody, and Amazon Q Developer。可使用 Killer-Skills CLI 一条命令通用安装。

split-pdf 有哪些限制?

限制说明: If the file is NOT already inside ./articles/, copy it there (do not move — preserve the original location);限制说明: Only read the 4-page split files, and only 3 splits at a time ( 12 pages);限制说明: The downloaded or provided PDF in ./articles/ must NEVER be deleted, moved, or overwritten at any point in this workflow。

安装步骤

  1. 1. 打开终端

    在你的项目目录中打开终端或命令行。

  2. 2. 执行安装命令

    运行:npx killer-skills add choi-jaehyun/jaehyun-claude-workflow/split-pdf。CLI 会自动识别 IDE 或 AI Agent 并完成配置。

  3. 3. 开始使用技能

    split-pdf 已启用,可立即在当前项目中调用。

! 参考页模式

此页面仍可作为安装与查阅参考,但 Killer-Skills 不再把它视为主要可索引落地页。请优先阅读上方评审结论,再决定是否继续查看上游仓库说明。

Upstream Repository Material

The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.

Upstream Source

split-pdf

jaehyun-claude-workflow # Split-PDF: Download, Split, and Deep-Read Academic Papers CRITICAL RULE: Never read a full PDF. This AI agent skill supports Claude

SKILL.md
Readonly
Upstream Repository Material
The section below is imported from the upstream repository and should be treated as secondary evidence. Use the Killer-Skills review above as the primary layer for fit, risk, and installation decisions.
Supporting Evidence

Split-PDF: Download, Split, and Deep-Read Academic Papers

CRITICAL RULE: Never read a full PDF. Never. Only read the 4-page split files, and only 3 splits at a time (~12 pages). Reading a full PDF will either crash the session with an unrecoverable "prompt too long" error — destroying all context — or produce shallow, hallucinated output. There are no exceptions.

When This Skill Is Invoked

The user wants you to read, review, or summarize an academic paper. The input is either:

  • A file path to a local PDF (e.g., ./articles/smith_2024.pdf)
  • A search query or paper title (e.g., "Gentzkow Shapiro Sinkinson 2014 competition newspapers")

Important: You cannot search for a paper you don't know exists. The user MUST provide either a file path or a specific search query — an author name, a title, keywords, a year, or some combination that identifies the paper. If the user invokes this skill without specifying what paper to read, ask them. Do not guess.

Step 1: Acquire the PDF

If a local file path is provided:

  • Verify the file exists
  • If the file is NOT already inside ./articles/, copy it there (do not move — preserve the original location)
  • Proceed to Step 2

If a search query or paper title is provided:

  1. Use WebSearch to find the paper
  2. Use WebFetch or Bash (curl/wget) to download the PDF
  3. Save it to ./articles/ in the project directory (create the directory if needed)
  4. Proceed to Step 2

CRITICAL: Always preserve the original PDF. The downloaded or provided PDF in ./articles/ must NEVER be deleted, moved, or overwritten at any point in this workflow. The split files are derivatives — the original is the permanent artifact. Do not clean up, do not remove, do not tidy. The original stays.

Step 2: Split the PDF

Create a subdirectory for the splits and run the splitting script:

python
1from PyPDF2 import PdfReader, PdfWriter 2import os, sys 3 4def split_pdf(input_path, output_dir, pages_per_chunk=4): 5 os.makedirs(output_dir, exist_ok=True) 6 reader = PdfReader(input_path) 7 total = len(reader.pages) 8 prefix = os.path.splitext(os.path.basename(input_path))[0] 9 10 for start in range(0, total, pages_per_chunk): 11 end = min(start + pages_per_chunk, total) 12 writer = PdfWriter() 13 for i in range(start, end): 14 writer.add_page(reader.pages[i]) 15 16 out_name = f"{prefix}_pp{start+1}-{end}.pdf" 17 out_path = os.path.join(output_dir, out_name) 18 with open(out_path, "wb") as f: 19 writer.write(f) 20 21 print(f"Split {total} pages into {-(-total // pages_per_chunk)} chunks in {output_dir}")

Directory convention:

articles/
├── smith_2024.pdf                    # original PDF — NEVER DELETE THIS
└── split_smith_2024/                 # split subdirectory
    ├── smith_2024_pp1-4.pdf
    ├── smith_2024_pp5-8.pdf
    ├── smith_2024_pp9-12.pdf
    └── ...

The original PDF remains in articles/ permanently. The splits are working copies. If anything goes wrong, you can always re-split from the original.

If PyPDF2 is not installed, install it: pip install PyPDF2

Step 3: Read in Batches of 3 Splits

Read exactly 3 split files at a time (~12 pages). After each batch:

  1. Read the 3 split PDFs using the Read tool
  2. Update the running notes file (notes.md in the split subdirectory)
  3. Pause and tell the user:

"I have finished reading splits [X-Y] and updated the notes. I have [N] more splits remaining. Would you like me to continue with the next 3?"

  1. Wait for the user to confirm before reading the next batch

Do NOT read ahead. Do NOT read all splits at once. The pause-and-confirm protocol is mandatory.

Step 4: Structured Extraction

As you read, collect information along these dimensions and write them into notes.md:

  1. Research question — What is the paper asking and why does it matter?
  2. Audience — Which sub-community of researchers cares about this?
  3. Method — How do they answer the question? What is the identification strategy?
  4. Data — What data do they use? Where precisely did they find it? What is the unit of observation? Sample size? Time period?
  5. Statistical methods — What econometric or statistical techniques do they use? What are the key specifications?
  6. Findings — What are the main results? Key coefficient estimates and standard errors?
  7. Contributions — What is learned from this exercise that we didn't know before?
  8. Replication feasibility — Is the data publicly available? Is there a replication archive? A data appendix? URLs for the underlying data?

These questions extract what a researcher needs to build on or replicate the work — a structured extraction more detailed and specific than a typical summary.

The Notes File

The output is notes.md in the split subdirectory:

articles/split_smith_2024/notes.md

This file is updated incrementally after each batch. Structure it with clear headers for each of the 8 dimensions. After each batch, update whichever dimensions have new information — do not rewrite from scratch.

By the time all splits are read, the notes should contain specific data sources, variable names, equation references, sample sizes, coefficient estimates, and standard errors. Not a summary — a structured extraction.

When NOT to Split

  • Papers shorter than ~15 pages: read directly (still use the Read tool, not Bash)
  • Policy briefs or non-technical documents: a rough summary is fine
  • Triage only: read just the first split (pages 1-4) for abstract and introduction

Quick Reference

StepAction
AcquireDownload to ./articles/ or use existing local file
Split4-page chunks into ./articles/split_<name>/
Read3 splits at a time, pause after each batch
WriteUpdate notes.md with structured extraction
ConfirmAsk user before continuing to next batch

For detailed explanation of why this method works, see methodology.md.

相关技能

寻找 split-pdf 的替代方案 (Alternative) 或可搭配使用的同类 community Skill?探索以下相关开源技能。

查看全部

openclaw-release-maintainer

Logo of openclaw
openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

333.8k
0
AI

widget-generator

Logo of f
f

为prompts.chat的信息反馈系统生成可定制的插件小部件

149.6k
0
AI

flags

Logo of vercel
vercel

React 框架

138.4k
0
浏览器

pr-review

Logo of pytorch
pytorch

Python中具有强大GPU加速的张量和动态神经网络

98.6k
0
开发者工具