What is reference-indexer?

Perfect for Knowledge Retrieval Agents needing advanced document indexing and search capabilities for PDF and Word files. Sistema de alertas em tempo real para volatilidade de ativos da B3

How do I install reference-indexer?

Run the command: npx killer-skills add arbgjr/satra/reference-indexer. It works with Cursor, Windsurf, VS Code, Claude Code, and 15+ other IDEs.

What are the use cases for reference-indexer?

Key use cases include: Indexing legal documents for quick reference, Searching through external knowledge bases for specific queries, Automating the addition of new reference materials to the RAG corpus.

Which IDEs are compatible with reference-indexer?

This skill is compatible with Cursor, Windsurf, VS Code, Claude Code, GitHub Copilot, JetBrains, Cline, Roo Code, and many more. Use the Killer-Skills CLI for universal one-command installation.

Are there any limitations for reference-indexer?

Requires filesystem access to read and process documents. Limited to PDF and Word file formats for text extraction. Dependent on the RAG corpus for indexing and search functionality.

Reference Indexer Skill

Name: reference-indexer
Availability: InStock
Rating: 1.6 (1 reviews)
Author: arbgjr

Proposito

Esta skill gerencia documentos de referencia externa, indexando-os para uso no RAG.

Comandos

/ref-add {path}

Adiciona documento ao indice de referencias:

bash
1/ref-add .agentic_sdlc/references/legal/lei-13775-2018.pdf

Acoes:

Valida o arquivo
Extrai texto (se PDF/Word)
Cria resumo automatico
Adiciona ao corpus RAG
Atualiza indice

/ref-search {query}

Busca nos documentos de referencia:

bash
1/ref-search "prazo de aceite duplicata"

Retorna:

Documentos relevantes
Trechos com contexto
Score de relevancia

/ref-list

Lista todos os documentos indexados:

bash
1/ref-list

Mostra:

Documentos por categoria
Status de indexacao
Data de adicao

/ref-remove {path}

Remove documento do indice:

bash
1/ref-remove .agentic_sdlc/references/legal/documento-antigo.pdf

Formatos Suportados

Formato	Extensao	Metodo de Extracao
PDF	.pdf	pdftotext / PyPDF2
Word	.docx	python-docx
Markdown	.md	Direto
Texto	.txt	Direto
HTML	.html	BeautifulSoup

Estrutura de Referencias

.agentic_sdlc/references/
├── legal/              # Leis, regulamentos, normas
├── technical/          # RFCs, especificacoes tecnicas
├── business/           # Regras de negocio, manuais
├── internal/           # Documentos internos
└── _index.yml          # Indice de documentos

Indice de Documentos

Arquivo _index.yml:

yaml
1index:
2  version: 1
3  updated_at: "2026-01-12T..."
4
5documents:
6  - id: "ref-001"
7    path: "legal/lei-13775-2018.pdf"
8    title: "Lei 13.775/2018 - Duplicatas Eletrônicas"
9    category: legal
10    added_at: "2026-01-12T..."
11    indexed: true
12    summary: "Lei que regulamenta as duplicatas escriturais..."
13    keywords:
14      - duplicata
15      - escritural
16      - eletronica
17    page_count: 5
18
19  - id: "ref-002"
20    path: "technical/icp-brasil.pdf"
21    title: "Padrões ICP-Brasil"
22    category: technical
23    added_at: "2026-01-12T..."
24    indexed: true

Extracao de Texto

PDF

bash
1# Usando pdftotext (poppler-utils)
2pdftotext -layout input.pdf output.txt
3
4# Usando Python
5python3 << 'EOF'
6import PyPDF2
7
8with open('input.pdf', 'rb') as f:
9    reader = PyPDF2.PdfReader(f)
10    text = ''
11    for page in reader.pages:
12        text += page.extract_text() + '\n'
13    print(text)
14EOF

Word (docx)

python
1from docx import Document
2
3doc = Document('input.docx')
4text = '\n'.join([p.text for p in doc.paragraphs])
5print(text)

Integracao com RAG

Documentos indexados sao adicionados ao corpus RAG:

yaml
1corpus_entry:
2  id: "ref-001"
3  source: "references/legal/lei-13775-2018.pdf"
4  type: "reference"
5  category: "legal"
6  content: "{texto extraido}"
7  embeddings: [...]  # Gerado pelo RAG
8  metadata:
9    title: "Lei 13.775/2018"
10    page: 1
11    section: "Art. 1"

Workflow de Indexacao

yaml
1indexing_workflow:
2  1_validate:
3    - Verificar formato suportado
4    - Verificar tamanho (max 50MB)
5    - Verificar permissoes
6
7  2_extract:
8    - Extrair texto do documento
9    - Limpar formatacao
10    - Dividir em chunks
11
12  3_analyze:
13    - Gerar resumo automatico
14    - Extrair keywords
15    - Classificar categoria
16
17  4_index:
18    - Adicionar ao corpus RAG
19    - Gerar embeddings
20    - Atualizar indice
21
22  5_verify:
23    - Testar busca
24    - Verificar qualidade

Configuracao

No settings.json:

json
1{
2  "memory": {
3    "rag_corpus": ".agentic_sdlc/corpus",
4    "max_document_size_mb": 50,
5    "chunk_size": 1000,
6    "chunk_overlap": 200
7  }
8}

Boas Praticas

Nomeie arquivos descritivamente: lei-13775-2018-duplicatas.pdf
Organize por categoria: legal, technical, business
Mantenha versoes: Nao sobrescreva, versione
Documente a fonte: Adicione de onde veio
Resuma docs longos: Crie resumos para PDFs grandes

Troubleshooting

PDF nao extrai texto

Alguns PDFs sao imagens escaneadas. Use OCR:

bash
1ocrmypdf input.pdf output.pdf
2pdftotext output.pdf -

Documento muito grande

Divida em partes menores ou aumente max_document_size_mb.

Encoding incorreto

Force UTF-8 na extracao:

bash
1pdftotext -enc UTF-8 input.pdf output.txt

reference-indexer — Categories.community

↓ Quality Score

Agent Capability Analysis

Ideal Agent Persona

Core Value

↓ Capabilities Granted for reference-indexer MCP Server

! Prerequisites & Limits

# Tags

Reference Indexer Skill

Proposito

Comandos

/ref-add {path}

/ref-search {query}

/ref-list

/ref-remove {path}

Formatos Suportados

Estrutura de Referencias

Indice de Documentos

Extracao de Texto

PDF

Word (docx)

Integracao com RAG

Workflow de Indexacao

Configuracao

Boas Praticas

Troubleshooting

PDF nao extrai texto

Documento muito grande

Encoding incorreto

Related Skills

Looking for an alternative to reference-indexer or building a Categories.community AI Agent? Explore these related open-source MCP Servers.

widget-generator

chat-sdk

zustand

data-fetching

reference-indexer — Categories.community

About this Skill

↓ Quality Score

Agent Capability Analysis

Ideal Agent Persona

Core Value

↓ Capabilities Granted for reference-indexer MCP Server

! Prerequisites & Limits

# Tags

Reference Indexer Skill

Proposito

Comandos

/ref-add {path}

/ref-search {query}

/ref-list

/ref-remove {path}

Formatos Suportados

Estrutura de Referencias

Indice de Documentos

Extracao de Texto

PDF

Word (docx)

Integracao com RAG

Workflow de Indexacao

Configuracao

Boas Praticas

Troubleshooting

PDF nao extrai texto

Documento muito grande

Encoding incorreto

Related Skills

Looking for an alternative to reference-indexer or building a Categories.community AI Agent? Explore these related open-source MCP Servers.

widget-generator

chat-sdk

zustand

data-fetching