Skip to content

Update pipeio specs and CLAUDE.md docs for v2 architectural decisions

Goal

Update pipeio's documentation to reflect the v2 architectural decisions: - One flow = one snakebids app = one derivative directory - pipeio is an agent-facing authoring + discovery layer (not an execution engine) - Execution delegated to snakebids/snakemake, provenance to DataLad - pipe is a category label, not a hierarchy level

Context

See roadmap: docs/log/idea/idea-arash-20260330-174518-164647.md See research: docs/log/idea/deep-research-pipeio-scope.md

Key docs to update: - packages/pipeio/CLAUDE.md — package-level guidance - packages/pipeio/docs/specs/pipeio/overview.md — architecture overview - packages/pipeio/docs/specs/pipeio/registry.md — registry spec - packages/pipeio/docs/specs/pipeio/flow-config.md — config spec - packages/pipeio/docs/specs/pipeio/mcp-tools.md — tool documentation - CLAUDE.md (projio root) — ecosystem description

Prompt

Update pipeio documentation to reflect v2 architectural decisions. This is a docs-only task — do NOT change any Python code.

Key decisions to document:

  1. North star: pipeio is an agent-facing authoring + discovery layer. It does not compete with execution engines (Snakemake), provenance systems (DataLad), app lifecycle (snakebids), or path resolution (snakebids bids()). It makes pipeline knowledge queryable and actionable for AI agents.

  2. One flow = one derivative: Each flow is a self-contained snakebids app producing one derivative directory. The pipe/flow hierarchy is being flattened — pipe becomes a category tag.

  3. Delegation model:

  4. Execution: snakebids run.py → Snakemake
  5. Provenance: DataLad run records
  6. Path resolution: snakebids bids() + generate_inputs()
  7. App lifecycle: snakebids deployment modes

  8. pipeio's unique value: registry/discovery, AI-safe authoring (rule_insert, config_patch, mod_create), contract semantics (I/O validation, cross-flow wiring), notebook lifecycle, documentation.

  9. Tool categories going forward: keep (27 unique), thin-to-adapter (4: dag, completion, log_parse, config_init), stop/replace (4: run, run_status, run_dashboard, run_kill).

  10. New features to document:

  11. pipeio_dag_export — thin adapter over snakemake --rulegraph/--dag/--d3dag, supports dot/mermaid/svg/json output
  12. pipeio_report — thin adapter over snakemake --report, supports targeting a specific rule (e.g. report) for flows with partial outputs
  13. pipeio_nb_update — update notebook metadata (kind, description, status) in notebook.yml
  14. pipeio_mod_context — bundled read tool returning rules, scripts, doc, config params, bids signatures for a mod in one call
  15. pf shell helper (bin/pf.sh) — flow navigator modeled on wg (project navigator). Source from bashrc. Commands: pf (list flows), pf <flow> (cd into flow dir), pf <flow> smk [args] (snakemake with auto-resolved --snakefile/--directory/conda env), pf <flow> deriv (cd into derivative dir), pf <flow> config (print config path). Tab completion from registry.
  16. CLI subcommands: pipeio flow ids, pipeio flow path <flow>, pipeio flow config <flow>, pipeio flow deriv <flow>, pipeio flow smk <flow> [args]
  17. pipeio_run now resolves snakemake via conda env wrapping (cogpy) and passes --directory for config resolution. use_conda flag passes --use-conda to snakemake CLI.
  18. NotebookEntry now has kind, description, status fields for lifecycle tracking

Read each doc file listed above. Update them to reflect these decisions and new features. Be concise — don't bloat the docs. Remove or mark as deprecated any sections describing pipeio as an execution engine. Add a clear "Delegation" or "Ecosystem" section explaining what pipeio owns vs delegates.

For CLAUDE.md files, update the architecture description, tool routing tables, and CLI surface section.

Acceptance Criteria

  • [ ] pipeio CLAUDE.md updated with v2 architecture and new features
  • [ ] Overview spec reflects delegation model
  • [ ] Registry spec notes pipe→category evolution
  • [ ] MCP tools spec notes which tools are being thinned/replaced and lists new tools
  • [ ] CLI surface documented (pf shell helper, flow subcommands)
  • [ ] projio root CLAUDE.md updated

Result

(Filled in after execution)

Batch Result

  • status: done
  • batch queue_id: d52d4b497700
  • session: dd9c6c81-6b88-4a7d-87b4-d0280355ac87
  • batch duration: 379.5s