Paper Revisits
Created 2026-05-20. Registry of pre-committed paper V2/V3 revisits with explicit trigger conditions. Companion to
paper-coverage-gaps.md,dream-papers.md(legendary-tier long-horizon pitches), andresearch-arcs-and-idle-projects.md.
Why this exists
A V2 paper is a strong honesty signal only if the trigger condition was committed before the V1 result was known. This file tracks those commitments. When a trigger fires (N threshold met, calendar date reached, infra landed, event observed), the V2 paper is queued with the original methodology — no methodology shopping, no post-hoc justification.
Each entry must contain enough to re-run V2 mechanically: original paper, trigger condition, pre-registered method, and what would change the conclusion.
Trigger flavors
- N-gated — wait until sample size crosses a stated power threshold. Trigger is a SQL query against
observations. - Coverage-gated — wait until a new data source, sensor, or constituent comes online. Trigger is the presence of a new datasource_id or metric.
- Event-gated — wait for the next instance of an external event (Forbush, X-class flare, named storm, large earthquake near sensor). Trigger is event detection in the relevant feed.
- Infra-gated — wait for fetcher / pipeline work to land. Trigger is a closed platform issue.
- Calendar-gated — wait for a specific date, typically an anniversary or seasonal window. Trigger is just the clock.
A single revisit can be multi-gated (e.g., AQI V2 is N-gated AND calendar-gated; CNEOS V2 is N-gated with a calendar estimate).
Per-entry fields
Every entry below uses the same skeleton:
- Original — workspace slug + commit(s) + explainer pointer
- Flavor(s) — one or more of the five above
- Trigger condition — specific, falsifiable, machine-checkable when possible
- Earliest expected fire — date or conditional estimate
- Pre-registered method — pointer to V1 §M.x or supplemental; methodology is locked
- What would change the conclusion — one sentence per possible outcome
- Status —
pending/fired/superseded/cancelled
Active revisits
V2 of #203 — AQI anomaly characterization
- Original: workspace
aqi-anomaly-characterization, paper commitsf0efe77→e4bcb3a, explainerdocs/paper-explainers/203-aqi-anomalies.md - Flavor(s): N-gated + Calendar-gated
- Trigger condition: ≥60 ≥3σ anomalies in full-constituent (post-2026-03-16) ingest. Check via
SELECT COUNT(*)of anomaly-flagged rows wheretimestamp_utc >= '2026-03-16'. - Earliest expected fire: ~2026-09-01 (~5.5 months of full-constituent data at current cadence projects to 60–80 anomalies).
- Pre-registered method: 6-D KMeans clustering across the 6 AQI constituents (originally pre-registered in #203 V1). V1 reported structural finding instead because N=28 was insufficient for stable cluster recovery.
- What would change the conclusion: Stable cluster recovery at k=2 or k=3 with silhouette ≥0.4 would upgrade V1's structural finding to a real anomaly typology. Failure to recover stable clusters confirms the modest bimodality (V1 ΔBIC=4.64) is noise, not a population structure.
- Status: pending
V2 of #205 — CNEOS fireballs × meteor showers
- Original: workspace
pma-cneos-fireball-day-of-year-vs-meteor, paper commits1518211→9406b37, explainerdocs/paper-explainers/205-cneos-fireball-showers.md - Flavor(s): N-gated (with calendar estimate)
- Trigger condition: N ≥ 600 unique CNEOS fireballs, deduped on minute-resolution timestamp + 0.1° rounded coordinates. At N=600 the Bonferroni-12 detection floor drops to rate ratio ~1.5, matching the optical-bolide Geminid literature range.
- Earliest expected fire: ~2034 at current ingest rate (~22 deduped events/year post-2014). Re-check yearly; could fire earlier if CNEOS sensor program expands.
- Pre-registered method: Schuster test + per-shower Poisson rate-ratio at h ∈ {3,5,7,10}d with Bonferroni across 12 named showers, identical to V1 §3. Pred. column (A=asteroidal, C=cometary) carries forward.
- What would change the conclusion: Geminids (or any asteroidal-parent shower) surviving Bonferroni at any h with rate ratio in 1.3–1.5 resolves V1's ambiguity toward Story B (real but previously sub-floor signature). Continued Bonferroni-null at the lower floor resolves toward Story A (genuinely sporadic kt-energy population, distinct from optical meteors).
- Status: pending
V2 of #202 — Safecast × Forbush radiation modulation
- Original: workspace
safecast-forbush-radiation-modulation, paper commitsf6619db→6a2ec46, explainerdocs/paper-explainers/202-safecast-forbush.md - Flavor(s): Infra-gated (partially satisfied; see below)
- Trigger condition: NMDB fetcher merged to
mainAND producing rows for ≥3 stations spanning a useful latitude range (must include ≥1 equatorial |φ|<30° AND ≥1 southern-hemisphere monitor). V1 was bounded by Safecast CPM detection physics — gamma-only, ground-level, no polar/equatorial coverage. NMDB neutron-monitor data is the correct instrument, but inherits the same global-coverage requirement. - Status of trigger (verified 2026-05-20): Partially met. NMDB fetcher is live (
nmdb_cosmic_rays.py, 4,469 rows, 2026-04-02 → 2026-05-20). Station selection is 4 NH-only sites — Jungfraujoch (46.55°N), Oulu (65.05°N), Apatity (67.57°N), Thule (76.5°N) — polar/mid only, no equatorial, no southern. The spirit of the original condition (replicate Safecast's latitude-coverage gap with the correct instrument) is NOT met. Two paths forward: (a) expand NMDB station selection to include equatorial + southern monitors (Tsumeb, Princess Sirindhorn, Kerguelen, etc. are public NMDB stations); (b) honestly scope V2 as a NH-only paired-SEA with explicit acknowledgement that polar/equatorial decomposition isn't possible at current station selection. - Earliest expected fire: Depends on chosen path. (a) requires a fetcher-config change + ~few weeks of new station data accumulating; (b) is runnable today with the caveat documented up front.
- Pre-registered method: Same paired SEA design as V1 §4, swapping Safecast CPM for NMDB neutron count. Bonferroni across the same 5 candidate drivers. Latitude stratification using the NMDB station network as configured at fire time.
- What would change the conclusion: A Bonferroni-significant Forbush signal in NMDB data closes out the "CPM channel was the wrong tool" hypothesis with a positive result and supersedes V1. Continued null in NMDB points at driver selection or coupling mechanism, not the sensor, and bounds the search differently.
- Status: pending — awaiting station-network decision (see "Status of trigger" above)
Workflow when a trigger fires
- Verify the trigger condition still holds (re-run the N query; confirm fetcher is live and stable; etc.).
- Open a GitHub issue for V2 (numbered separately, e.g., "V2 of #203 — ...").
- Run extraction with the V1 script. Only date-range or metric updates allowed.
- Run analysis with the pre-registered V1 method. Do not add new tests. A new test = V3, not a methodology change.
- Write V2 paper as "what V1 said → what changed in the data → what V2 says now." Cite V1 prominently and link the explainer.
- If V2 contradicts V1, that's more interesting, not less. Lead with the contradiction.
Maintenance
- Add a new entry whenever a published paper has a pre-registered V2 condition. Do this at ship time, not later.
- Update
Statuswhen a trigger fires; never delete fired/superseded entries — keep the chain so V2 can cite the registry. - Calendar-gated and near-N-threshold entries: re-check at the start of each session if within 30 days of expected fire.
- Yearly review: re-estimate N-gated fire dates from current ingest rates.
- If a V1 paper should have had a V2 condition pre-registered but didn't, note it here under a
## Retroactive candidatessection — but treat retroactive entries as weaker honesty signal than pre-registered ones.