References¶
Inspirations (handbook-wide)¶
The handbook owes its shape to several solo-author works and traditions. Primary inspirations:
- goodresearch.dev — Patrick Mineault. The canonical solo-author research-workflow handbook; this handbook's most direct influence.
- xcorr.net — Patrick Mineault (blog companion to goodresearch.dev).
- cartesian.app — Elias Yilma (interactive DSA handbook; explorable-essay pattern).
Additional inspirations (organised by tradition):
- Solo handbooks: Jenny Bryan (Happy Git with R); Hadley Wickham (R Packages, R for Data Science); Karpathy ("Recipe for Training Neural Networks"); Stas Bekman (ml-engineering); Google DL Tuning Playbook; Vince Buffalo (Bioinformatics Data Skills); The Turing Way.
- Note-to-blog essayists: Simon Willison; Julia Evans; Lilian Weng; Jay Alammar; Chris Olah; Andy Matuschak; Eugene Yan; Chip Huyen; Maggie Appleton; Dan Luu.
- Interactive / explorable: Bartosz Ciechanowski; Amit Patel (Red Blob Games); Nicky Case; Bret Victor; Distill.pub; Setosa.io; Seeing Theory; Immersive Linear Algebra.
- Neuroscience-specific: Mike X Cohen (Analyzing Neural Time Series Data); Russell Poldrack (Statistical Thinking for the 21st Century); Neuromatch Academy.
Source documents¶
- Deep Research synthesis (2026-05-07): Interactive Mathematics Beyond the Static Page — see
reference/research/Interactive Mathematics Beyond the Static Page.pdf. Source of the 7-paradigm taxonomy and the gap argument in00-frame/why-interactivity.md.
Per-chapter further reading¶
§00 Framing¶
- goodresearch.dev (Patrick Mineault) — the closest companion handbook: solo-author research workflows from question to figure.
- The Turing Way — community handbook for reproducible, ethical, collaborative research; especially the Reproducible Research guide.
- ml-engineering (Stas Bekman) — large-scale engineering handbook for ML practitioners; model for the opinionated practitioner-guide format.
- Interactive Mathematics Beyond the Static Page — Deep Research synthesis; 7-paradigm taxonomy and gap argument that frames this chapter.
- Explorable Explanations (Bret Victor) — the foundational manifesto for reactive, manipulable documents.
- Distill.pub — archive of interactive ML essays; exemplar of the explorable-essay pattern at publication quality.
- Bartosz Ciechanowski — physics and engineering simulations as interactive essays; the gold standard for manipulable embedded figures.
- The Turing Way — patterns for transitioning from solo to collaborative research practice; bus-factor and FAIR principles.
- goodresearch.dev — Patrick Mineault; team and continuity considerations alongside the solo-author workflow.
- ml-engineering (Stas Bekman) — large single-author effort that grew into a community resource; case study in sustainability.
§10 BIDS¶
- BIDS specification — canonical source for all entity names, sidecar requirements, and
dataset_description.jsonfields. - bids-validator — run
bids-validator raw/to catch layout violations; JavaScript and Python variants. - PyBIDS — Python library for querying BIDS datasets; complement to snakebids for non-Snakemake code.
- MNE-BIDS — BIDS-aware I/O for electrophysiology; handles sidecar creation from raw EEG/iEEG recordings.
- BIDS derivatives specification — formal rules for derivative dataset layout,
dataset_description.jsoninderivatives/, andGeneratedByprovenance fields. - PyBIDS derivatives —
BIDSLayout(derivatives=True)for querying processed outputs alongside raw.
- BIDS Extension Proposals — active proposals extending BIDS to video (BEP 024), microscopy, MEG, and other modalities.
- BIDS starter kit — annotated examples and templates for adopting BIDS in a new modality.
§20 DataLad¶
- DataLad handbook — comprehensive reference covering
datalad install, nested datasets, provenance recording, and the YODA principles. - git-annex — underlying binary-tracking layer; useful when DataLad's abstraction is insufficient or when working with non-DataLad repositories.
- DataLad handbook §Publishing —
datalad push, sibling setup, SSH and GitHub/GitLab configurations; RIA store creation and usage. - git-annex special remotes — the protocol layer underlying DataLad siblings, including
ria+file://andria+ssh://transports.
- DataLad handbook §YODA principles — the layout principle that keeps code pinned at a commit inside the superdataset; rationale and workflow.
- DataLad run —
datalad runrecords a command's provenance; the complement to pinning code versions.
§30 Snakemake¶
- Snakemake documentation — reference for rule syntax,
input/output,run,shell, andscriptdirectives; cluster execution profiles. - Mölder et al. 2021 — "Sustainable data analysis with Snakemake," F1000Research; cite this when describing the pipeline engine in a methods section.
- Snakemake tutorial — hands-on walkthrough; fastest path from zero to a running first rule.
- snakebids documentation —
generate_inputs(),BidsComponent, and the snakebids YAML config format. - BIDS specification §entities — entity definitions (
sub,ses,run,task) that map directly to snakebids wildcard names.
- Snakemake §Configuration —
configfile:, theconfigdict, and profile-based configuration for reproducible parameter sweeps. - snakebids documentation — how snakebids config extends Snakemake's own config with BIDS-aware input specifications.
- snakebids documentation — full reference for idioms 1 and 2 (snakebids-only and snakebids + BidsPaths).
- Snakemake documentation — idiom 3 baseline; plain Snakemake without BIDS-aware parameterisation.
§40 Marimo¶
- Marimo documentation — installation, the reactive execution model, UI element API (
mo.ui.*), and the.pyfile format. - Marimo GitHub — source and issues; the blog posts in the repository explain core design decisions.
- xarray —
DataArray,Dataset,.sel()/.isel()coordinate selection, and groupby operations on labelled N-D arrays. - HoloViews — declarative multi-dimensional plotting; the
.hvplotaccessor that bridges xarray and interactive bokeh/panel renderers. - MNE-Python — EEG/iEEG processing;
read_raw_*, epochs, and time-frequency representations.
- Marimo §WASM export —
marimo export html-wasm; bundle size limits, supported PyPI packages, and embedding options. - Pyodide — Python in WebAssembly; the runtime that powers Marimo's browser-side execution.
§50 Publication¶
- Material for MkDocs — theme reference; navigation, admonitions, search, social cards, and the full plugin list.
- MkDocs documentation —
nav:structure,mkdocs.yml, custom hooks, and deployment to GitHub Pages.
- Quarto documentation — formats (
html,pdf,revealjs,docx), YAML front-matter,_quarto.ymlproject files, and theincludeshortcode. - Quarto revealjs guide — slide transitions, incremental lists, fragment animations, and code-block highlighting options.
Two surfaces, one cross-link protocol
- Material for MkDocs — cross-page links and the
mkdocs-ezlinksplugin that resolves bare filenames. - Quarto projects —
_quarto.ymland{{< include >}}for cross-document transclusion within a Quarto project.
§60 Projio¶
- Model Context Protocol specification — the JSON-RPC wire format that projio's MCP server implements; tool and resource schemas.
- FastMCP — Python library used to register projio's MCP tools; decorator-based tool definition.
- The Turing Way §Research Data Management — structured note-keeping, provenance recording, and metadata conventions in team research.
- Snakemake documentation — underlying execution engine; pipeio wraps its scheduling and wildcard resolution.
- snakebids documentation — BIDS-aware input generation used inside pipeio-managed flows.
- Docling — PDF text extraction library; table, figure, and structured reference extraction.
- GROBID — ML tool for structured reference and header extraction from PDFs; powers
biblio_grobid. - OpenAlex API — open scholarly metadata API; powers DOI resolution and citation-graph expansion in biblio.
- Pandoc user manual —
--citeproc,--bibliography, Lua filter interface, and all output format options. - Citation Style Language — CSL spec; the APA, IEEE, Chicago, and Vancouver styles bundled by projio are drawn from this repository.
- uv — fast Python package manager;
uv tool install --editableis used to share editable core libraries across environments without per-project installs.
§70 Agentic workflows¶
- Claude Code documentation — installation,
.mcp.jsonconfiguration, CLAUDE.md memory hierarchy, and the tool-permission model. - Model Context Protocol — the JSON-RPC wire format; reference for writing a new MCP server from scratch.
Permissions and bounded context
- Claude Code settings reference —
permissions.allow,allowedTools,additionalDirectories,defaultMode, and hook configuration.
- Claude Code §Memory and context — the CLAUDE.md memory hierarchy that skills plug into; project-level vs user-level instructions.
- Claude Code documentation — the execution model underpinning
execute_task()andrun_prompt(); session and subagent lifecycle. - Anthropic model overview — haiku / sonnet / opus capability tiers; the basis for model selection in dispatch calls.
§80 Orchestration¶
- Claude Code §Sub-agents — the subagent model that worklog's queue taps into; how sessions are isolated and parallelised.
- Anthropic model overview — the model-tier ladder (haiku → sonnet → opus) that worklog uses for cost-sensitive dispatch.
- Claude Code documentation — the agent primitives (tasks, sessions, captures) that goals decompose into.
- Anthropic model overview — haiku / sonnet / opus capability tiers; guidance for matching model to task complexity in dispatch calls.
- Claude Code §Sub-agents — how the
Agent(...)tool spawns isolated subagent contexts; the mechanismrun_prompt()drives.
§90 Future directions¶
- Claude Code §Sub-agents — the current
Agent(subagent_type=...)primitive that two-tier hierarchies are built on today. - Model Context Protocol — the shared communication layer that makes tool-access portable across agent tiers.
- Claude Code documentation — the current session model; context for understanding what a "persistent multi-agent session" would extend.
§99 Honest gaps¶
- BIDS specification — the authoritative source for derivative validation requirements described in gap 1.
- The Turing Way §Reproducibility checklist — community-assembled checklist of common gaps in reproducible research practice.
- goodresearch.dev — the companion handbook against which this cohort's gaps were calibrated.