KS
Killer-Skills

dfdl_ref — how to use dfdl_ref how to use dfdl_ref, dfdl_ref vs DataFusion, dfdl_ref setup guide, what is dfdl_ref, dfdl_ref alternative to PyArrow, DataFusion API reference, DeltaLake API guide, PyArrow API tutorial, dfdl_ref install, dfdl_ref documentation

v1.0.0
GitHub

About this Skill

Perfect for Data Analysis Agents needing comprehensive references for DataFusion, DeltaLake, and PyArrow APIs dfdl_ref is a technical reference skill for DataFusion, PyArrow, and DeltaLake APIs, providing operating rules and guidelines for efficient development.

Features

Provides reference maps for Core DataFusion Python surfaces (IO, catalog, SQL, DataFrame API)
Includes guidelines for probing local environments and searching repositories for existing API usage
Offers best-in-class deployment strategies for DataFusion and DeltaLake
Supports implementation using existing local patterns for PyArrow and UDF APIs
Features a comprehensive reference file (reference/datafusion.md) for DataFusion API details

# Core Topics

paul-heyse paul-heyse
[0]
[0]
Updated: 3/7/2026

Quality Score

Top 5%
26
Excellent
Based on code quality & docs
Installation
SYS Universal Install (Auto-Detect)
Cursor IDE Windsurf IDE VS Code IDE
> npx killer-skills add paul-heyse/CodeAnatomy/dfdl_ref

Agent Capability Analysis

The dfdl_ref MCP Server by paul-heyse is an open-source Categories.community integration for Claude and other AI agents, enabling seamless task automation and capability expansion. Optimized for how to use dfdl_ref, dfdl_ref vs DataFusion, dfdl_ref setup guide.

Ideal Agent Persona

Perfect for Data Analysis Agents needing comprehensive references for DataFusion, DeltaLake, and PyArrow APIs

Core Value

Empowers agents to efficiently develop with DataFusion, DeltaLake, and PyArrow by providing guidelines and references for their APIs, including Core DataFusion Python surfaces and best-in-class deployment patterns, ensuring accurate and reliable interactions with these technologies

Capabilities Granted for dfdl_ref MCP Server

Debugging DataFusion and DeltaLake integrations
Optimizing PyArrow data processing workflows
Implementing efficient DataFusion and DeltaLake APIs using existing local patterns

! Prerequisites & Limits

  • Requires access to local environment for version and method probing
  • Limited to DataFusion, DeltaLake, and PyArrow APIs
Project
SKILL.md
1.8 KB
.cursorrules
1.2 KB
package.json
240 B
Ready
UTF-8

# Tags

[No tags]
SKILL.md
Readonly

Operating rule: never guess DataFusion/DeltaLake/PyArrow/UDF APIs

When uncertain:

  1. Probe local environment (versions + available methods).
  2. Search the repo for how we already use it.
  3. Open the relevant reference file below (only the section you need).
  4. Implement using existing local patterns unless the plan says otherwise.

Reference map (open these files as needed)

  • Core DataFusion Python surfaces (IO, catalog, SQL, DataFrame API): reference/datafusion.md
  • "Best-in-class deployment gaps" (caching, stats, observability, planning knobs): reference/datafusion_addendum.md
  • Planning deep dive (logical/physical plan pipeline, introspection, optimization rules): reference/datafusion_planning.md
  • Rust UDF contracts (Scalar/UDAF/UDWF/Async/named args): reference/datafusion_rust_UDFs.md
  • Schema management + schema pitfalls: reference/datafusion_schema.md
  • DeltaLake ↔ DataFusion integration details: reference/deltalake_datafusion_integration.md
  • Advanced Rust integration (PyO3 packaging, wheels, CI, native module distribution): reference/datafusion_deltalake_advanced_rust_integration.md
  • DataFusionMixins trait (Delta snapshot schema + predicate parsing helpers): reference/deltalake_datafusionmixins.md
  • Plan combination (composing DataFusion plans via joins/unions/CTEs, Delta integration, parameterized queries, plan serialization): reference/datafusion_plan_combination.md
  • Rust LogicalPlan programmatic construction (LogicalPlanBuilder, Expr, schema/DFSchema, plan rewriting via TreeNode, extensibility, serialization): reference/Datafusion_logicplan_rust.md
  • DataFusion tracing (Rust community extension: execution spans, metrics capture, partial-result previews, rule-phase instrumentation, OpenTelemetry export): reference/datafusion-tracing.md
  • DeltaLake core (format/protocol, client APIs, 3-layer model): reference/deltalake.md

Related Skills

Looking for an alternative to dfdl_ref or building a Categories.community AI Agent? Explore these related open-source MCP Servers.

View All

widget-generator

Logo of f
f

widget-generator is an open-source AI agent skill for creating widget plugins that are injected into prompt feeds on prompts.chat. It supports two rendering modes: standard prompt widgets using default PromptCard styling and custom render widgets built as full React components.

149.6k
0
Design

chat-sdk

Logo of lobehub
lobehub

chat-sdk is a unified TypeScript SDK for building chat bots across multiple platforms, providing a single interface for deploying bot logic.

73.0k
0
Communication

zustand

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
Communication

data-fetching

Logo of lobehub
lobehub

The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harness to the next level — enabling multi-agent collaboration, effortless agent team design, and introducing agents as the unit of work interaction.

72.8k
0
Communication