Skip to content

Citekey generation mangles diacritics in author names

biblio_ingest generated: - mlle_2009_InfluenceLearning for Mölle et al. 2009 (should be molle_2009_...) - anda_2020_BidirectionalInteraction for Sanda et al. 2021 (should be sanda_2020_... — dropped the leading 'S') - latchoumane_2017_ThalamicSpindles for Latchoumane/Ngo et al. 2017 — the paper is commonly cited as Ngo, Martinetz, Born, Mölle 2013; OpenAlex may have returned a different version

Impact

Citekeys are permanent identifiers — mangled names make papers harder to find and cite correctly. The ö→ø→dropped pattern suggests the Unicode normalization step is stripping rather than transliterating diacritics.


Source context: pixecog

PixEcog (pixecog): Neuropixels and ECoG dataset and analysis

Recent commits:

8dc0d9d Pipeline docs: gitignore docs/pipelines/, relocate hand-authored files
96cd1ec Refactor sharpwaveripple/contracts: extract generic helpers to utils/io, remove pipelines __init__.py
36f9326 Add result note directory and sample note

README:


type: readme


Quick Start for Collaborators

Follow this checklist to get started with Pixecog documentation and workflows.

🐀 Pixecog Project — Compact Overview

Core principles

  • One immutable BIDS raw dataset (raw/) as the canonical baseline
  • Each analysis pipeline ha
  • issue-arash-20260409-231546-838942.md — Missing biblio_openalex_resolve tool — directly relevant since the current note suspects OpenAlex returned a wrong/different paper for one of the mangled citekeys
  • issue-arash-20260409-231618-516346.md — biblio_enrich returning 0 when resolution is missing — same biblio ingest/enrichment pipeline, likely triggered in the same session
  • issue-arash-20260409-231641-242830.md — biblio_pdf_fetch_oa html_fallback issue — co-occurring biblio toolchain issue from the same session
  • issue-arash-20260404-021642-474901.md — biblio-glutton study for high-performance bibliographic matching — improving author name normalization and reference resolution is directly relevant to the diacritic/citekey problem