Appendix: reproducibility steps and the ATL
Appendix: reproducibility steps and the ATL DataPack schema — This appendix pins down “who runs what where, and which log/output must appear.” In a real release, paths/file names/hashes below must be finalized in the reproducibility bundle. Recommended required outputs: (1) UNLOCK_checklist.txt, (2) omega_nogo.json, (3) an RSL summary table (CSV), and (4) a QA report (MD/PDF).
This appendix pins down “who runs what where, and which log/output must appear.” In a real release, paths/file names/hashes below must be finalized in the reproducibility bundle.
This appendix pins down “who runs what where, and which log/output must appear.” In a real release, paths/file names/hashes below must be finalized in the reproducibility bundle.
(Example) execution steps
# 0) Environment setup (example): Python 3.11 + requirements.txt
python -V
pip install -r requirements.txt
# 1) Verify data integrity
sha256sum -c checksums/sha256sum.txt
# 2) Verify prereg thresholds (Omega-NoGo) are fixed
cat config/constraints.yml
# 3) Run hard gates PASS/FAIL
python tests/tests_hardgate.py --constraints config/constraints.yml --results_dir results/
# 4) Check outputs
cat results/UNLOCK_checklist.txt
Recommended required outputs: (1) UNLOCK_checklist.txt, (2) omega_nogo.json, (3) an RSL summary table (CSV), and (4) a QA report (MD/PDF). Without these, the release is “narrative” rather than “reproducible.”
ATL DataPack v0.9 (schema stub): file/field/codebook schema
This appendix defines the minimum data modules (files/fields) required to make predictions P1–P5 executable. Important: file names/fields/units here must be fixed by preregistration before release and checksum-sealed after release.
{
Module File (example) Key fields (minimum; defined in codebook) Linked prediction ATL-PB data/atlantic/atl_pb_boundaries.geojsonseg_id, type(ridge/transform/subduction), length_km, basinP1 ATL-DEF data/atlantic/atl_deformation_sites.csvsite_id, lat, lon, proxy_type, proxy_value, sigma, r_kmP2 ATL-RSL data/rsl/rsl_timeseries.csvbasin, basin_type, site, Age_BP, RSL_m, sigma, correction_tagP3 ATL-LUB data/geo/atl_lubrication_markers.csvsample_id, L1..L4, evidence_ref, qa_flagP4 ATL-ONSET data/strat/atl_opening_onset.csvsegment_id, onset_age, sigma_age, method, refP5
}
Recommended use of an additional field basin_type. For the P3 contrast (including the Indian Ocean), assign each RSL point a basin_type. Recommended values: PACIFIC, ATLANTIC, INDIAN. (basin is the geographic basin name, while basin_type is the mechanism-class label.)
Extension modules (optional): supporting P6/P8 (ATL DataPack v0.2 draft)
P6 (spatiotemporal propagation) and P8 (magnetic stripes) can require more complex data composition than P1–P5. Therefore, minimal schemas are separated as optional extension modules (the core compile/hard-gate can run without them).
{
Module File (example) Key fields (minimum; defined in codebook) Linked prediction ATL-PROP data/strat/atl_opening_points.csvpoint_id, lat, lon, onset_age, sigma_age, segment_id, refP6 ATL-MAG data/mag/mag_anomaly_profiles.csvprofile_id, ridge_seg, x_km, anomaly_nT, sampling_km, proc_tag, refP8
}
Cross-validation extension modules (optional): supporting P9/P10/P11/P12 (ATL DataPack v0.3 draft)
The modules below are not mandatory hard gates for C1–C3, but are included for independent cross-validation of “event-like” claims.
{
Module File (example) Key fields (minimum; defined in codebook) Linked prediction ORO-UHP data/orogeny/uhp_exhumation_cases.csvcase_id, belt, P_max_GPa, t_peak_Ma, t_exhum_Ma, dt_Myr, refP9 ATL-SED data/sediment/atl_sed_thickness_transects.csvtransect_id, distance_km, crust_age_Ma, sed_thick_m, ref, qa_flagP10 ATL-DRILL data/sediment/atl_drill_sites.csvsite_id, lat, lon, basement_age_Ma, sed_rate_m_Myr, method, refP10 MN-NOD data/sediment/mn_nodule_samples.csvsample_id, lat, lon, size_mm, age_est_Myr, method, refP11 VOLC-AGE data/volc/volcano_construction_ages.csvvolcano_id, name, region, T80_ka, volume_km3, method, refP12
}
Minimum codebook/provenance requirements (common to all modules).
docs/codebook.md: field definitions, units, missing-value rules, QC rules (PASS/FAIL).docs/provenance.md: raw-data sources/access date, cleaning rules, exclusion criteria (pre-registered).checksums/sha256sum.txt: SHA256 hashes of all files.
ATL DataPack v0.8 draft (v1.34): hydro/climate/friction/isotope extension modules
This extension schema is not for the base mechanism (C1–C3), but for locking mandatory gates of additional variants (V-SLIP / V-HOLO / V-COUPLED) into data.
{
(A) Mechanism / hydro / climate / kinematics modules
Module ID File (example) Key fields (minimum) Linked prediction FRIC-PAR data/friction/plate_friction_params.csvregion, mu_s, alpha, Pf_MPa, tau_drive_MPa, refP13 KIN-CON data/kinematics/plate_velocity_constraints.csvplate, v_now_m_yr, S_total_km, T_event_yr, refP14 THERM-BUD data/thermal/thermal_budget_params.csvtau_MPa, A_km2, S_km, h_eff_km, eta_h, refP14 HYD-VOL data/hydro/basin_volume_scenarios.csvscenario, deltaV_km3, T_yr, basin, refP15 RSL-CON data/rsl/holocene_sea_level_constraints.csvsite, age_ka, rsl_m, sigma_m, method, refP15 CLIM-FW data/climate/freshwater_proxy_timeseries.csvrecord, proxy, region, age_ka, value, unit, refP16 BIO-SPL data/bio/atlantic_split_candidates.csvtaxon, pair, t_div_Ma, method, refP17 ISO-CASE data/isotopes/open_system_casebook.csvcase, known_age_yr, method, apparent_age_Ma, mechanism, refP18 SLB-COMP data/hydro/sea_level_budget_components.csvdate, sl_obs_mm, steric_mm, mass_mm, land_mm, sigma_mm, refP19 BASIN-PRX data/hydro/basin_volume_change_proxy.csvproxy, basin, date, dV_km3, sigma_km3, refP19 RIV-MIS data/geomorph/misfit_rivers.csvriver, basin, W_valley_m, W_channel_m, Q_m3s, controls, refP20 DELTA-AGE data/geomorph/mega_delta_ages.csvdelta, onset_ka, sigma_ka, method, criterion, refP20 KIN-MOD data/kinematics/plate_velocity_models.csvplate, model, epoch, frame, vx_mm_yr, vy_mm_yr, refP21
(B) Resource / sediment / discovery-bias modules
Module ID File (example) Key fields (minimum) Linked prediction PET-OIL data/petroleum/oil_provinces.csvprovince, lat, lon, type, basin, refP22 ICE-PROX data/glacial/ice_extent_proxies.csvregion, proxy, age_ka, value, unit, refP22/P23 REFUGIA data/glacial/refugia_catalog.csvregion, glaciated, heatflow_mWm2, volc_index, clim_controls, refP23 LAKE-END data/hydro/endorheic_lakes.csvBMN, site, lat, lon, age_kyr, status, trend, num14CP24 SHELF-W data/geomorph/shelf_width_profiles.csvprofile, margin_type, shelf_width_km, canyon_density, drainage_area_km2, refP25 UNCON-S data/strat/unconformity_sites.csvsite, age_min_Ma, age_max_Ma, weathering, hiE_marker, refP26 POLY-CASE data/strat/polystrate_cases.csvcase, env, unit, age, notes, refP27 COAL-CASE data/strat/coal_marine_cases.csvcase, coal_type, marine_fossils, rootlets, sed_struct, refP28 EVT-WIN data/meta/event_window_estimates.csvmodule, proxy_class, t_center_ka, sigma_ka, sign, weight, method, ref, includeP29 CTRL-REG data/meta/controls_registry.csvmodule, control_type, control_id, selector, notes, refP30 COAL-RANK data/geo/coal_rank_stress_cases.csvbasin, country, tectonic_setting, rank, Ro, stress_proxy, burial_proxy, refP31 PET-BUDG data/petroleum/oil_source_budget_cases.csvcase_id, province, oil_in_place_bbl, recoverable_bbl, basin_area_km2, source_thickness_m, toc_frac, eta_conv, eta_exp, refP32 PET-THERM data/petroleum/oil_thermal_context.csvprovince, geothermal_gradient_C_km, heatflow_mWm2, intrusion_distance_km, Ro, tmax_C, phase, refP33 PET-AGE data/petroleum/petroleum_chronometers.csvprovince, age_Ma, sigma_Ma, method, material, interpretation, qa_flag, refP34 PET-DISC data/petroleum/oil_discovery_bias_cases.csvbasin, sed_flux_proxy, shelf_sed_thickness_m, wells, seismic_km2, exploration_years, discovered_bboe, giant_fields, refP35
}