Appendix: reproducibility steps and the ATL

Appendix: reproducibility steps and the ATL DataPack schema — This appendix pins down “who runs what where, and which log/output must appear.” In a real release, paths/file names/hashes below must be finalized in the reproducibility bundle. Recommended required outputs: (1) UNLOCK_checklist.txt, (2) omega_nogo.json, (3) an RSL summary table (CSV), and (4) a QA report (MD/PDF).

This appendix pins down “who runs what where, and which log/output must appear.” In a real release, paths/file names/hashes below must be finalized in the reproducibility bundle.

This appendix pins down “who runs what where, and which log/output must appear.” In a real release, paths/file names/hashes below must be finalized in the reproducibility bundle.

(Example) execution steps

# 0) Environment setup (example): Python 3.11 + requirements.txt
python -V
pip install -r requirements.txt

# 1) Verify data integrity
sha256sum -c checksums/sha256sum.txt

# 2) Verify prereg thresholds (Omega-NoGo) are fixed
cat config/constraints.yml

# 3) Run hard gates PASS/FAIL
python tests/tests_hardgate.py --constraints config/constraints.yml --results_dir results/

# 4) Check outputs
cat results/UNLOCK_checklist.txt

Recommended required outputs: (1) UNLOCK_checklist.txt, (2) omega_nogo.json, (3) an RSL summary table (CSV), and (4) a QA report (MD/PDF). Without these, the release is “narrative” rather than “reproducible.”

ATL DataPack v0.9 (schema stub): file/field/codebook schema

This appendix defines the minimum data modules (files/fields) required to make predictions P1–P5 executable. Important: file names/fields/units here must be fixed by preregistration before release and checksum-sealed after release.

{

ModuleFile (example)Key fields (minimum; defined in codebook)Linked prediction
ATL-PBdata/atlantic/atl_pb_boundaries.geojsonseg_id, type(ridge/transform/subduction), length_km, basinP1
ATL-DEFdata/atlantic/atl_deformation_sites.csvsite_id, lat, lon, proxy_type, proxy_value, sigma, r_kmP2
ATL-RSLdata/rsl/rsl_timeseries.csvbasin, basin_type, site, Age_BP, RSL_m, sigma, correction_tagP3
ATL-LUBdata/geo/atl_lubrication_markers.csvsample_id, L1..L4, evidence_ref, qa_flagP4
ATL-ONSETdata/strat/atl_opening_onset.csvsegment_id, onset_age, sigma_age, method, refP5

}

Recommended use of an additional field basin_type. For the P3 contrast (including the Indian Ocean), assign each RSL point a basin_type. Recommended values: PACIFIC, ATLANTIC, INDIAN. (basin is the geographic basin name, while basin_type is the mechanism-class label.)

Extension modules (optional): supporting P6/P8 (ATL DataPack v0.2 draft)

P6 (spatiotemporal propagation) and P8 (magnetic stripes) can require more complex data composition than P1–P5. Therefore, minimal schemas are separated as optional extension modules (the core compile/hard-gate can run without them).

{

ModuleFile (example)Key fields (minimum; defined in codebook)Linked prediction
ATL-PROPdata/strat/atl_opening_points.csvpoint_id, lat, lon, onset_age, sigma_age, segment_id, refP6
ATL-MAGdata/mag/mag_anomaly_profiles.csvprofile_id, ridge_seg, x_km, anomaly_nT, sampling_km, proc_tag, refP8

}

Cross-validation extension modules (optional): supporting P9/P10/P11/P12 (ATL DataPack v0.3 draft)

The modules below are not mandatory hard gates for C1–C3, but are included for independent cross-validation of “event-like” claims.

{

ModuleFile (example)Key fields (minimum; defined in codebook)Linked prediction
ORO-UHPdata/orogeny/uhp_exhumation_cases.csvcase_id, belt, P_max_GPa, t_peak_Ma, t_exhum_Ma, dt_Myr, refP9
ATL-SEDdata/sediment/atl_sed_thickness_transects.csvtransect_id, distance_km, crust_age_Ma, sed_thick_m, ref, qa_flagP10
ATL-DRILLdata/sediment/atl_drill_sites.csvsite_id, lat, lon, basement_age_Ma, sed_rate_m_Myr, method, refP10
MN-NODdata/sediment/mn_nodule_samples.csvsample_id, lat, lon, size_mm, age_est_Myr, method, refP11
VOLC-AGEdata/volc/volcano_construction_ages.csvvolcano_id, name, region, T80_ka, volume_km3, method, refP12

}

Minimum codebook/provenance requirements (common to all modules).

ATL DataPack v0.8 draft (v1.34): hydro/climate/friction/isotope extension modules

This extension schema is not for the base mechanism (C1–C3), but for locking mandatory gates of additional variants (V-SLIP / V-HOLO / V-COUPLED) into data.

{

(A) Mechanism / hydro / climate / kinematics modules

Module IDFile (example)Key fields (minimum)Linked prediction
FRIC-PARdata/friction/plate_friction_params.csvregion, mu_s, alpha, Pf_MPa, tau_drive_MPa, refP13
KIN-CONdata/kinematics/plate_velocity_constraints.csvplate, v_now_m_yr, S_total_km, T_event_yr, refP14
THERM-BUDdata/thermal/thermal_budget_params.csvtau_MPa, A_km2, S_km, h_eff_km, eta_h, refP14
HYD-VOLdata/hydro/basin_volume_scenarios.csvscenario, deltaV_km3, T_yr, basin, refP15
RSL-CONdata/rsl/holocene_sea_level_constraints.csvsite, age_ka, rsl_m, sigma_m, method, refP15
CLIM-FWdata/climate/freshwater_proxy_timeseries.csvrecord, proxy, region, age_ka, value, unit, refP16
BIO-SPLdata/bio/atlantic_split_candidates.csvtaxon, pair, t_div_Ma, method, refP17
ISO-CASEdata/isotopes/open_system_casebook.csvcase, known_age_yr, method, apparent_age_Ma, mechanism, refP18
SLB-COMPdata/hydro/sea_level_budget_components.csvdate, sl_obs_mm, steric_mm, mass_mm, land_mm, sigma_mm, refP19
BASIN-PRXdata/hydro/basin_volume_change_proxy.csvproxy, basin, date, dV_km3, sigma_km3, refP19
RIV-MISdata/geomorph/misfit_rivers.csvriver, basin, W_valley_m, W_channel_m, Q_m3s, controls, refP20
DELTA-AGEdata/geomorph/mega_delta_ages.csvdelta, onset_ka, sigma_ka, method, criterion, refP20
KIN-MODdata/kinematics/plate_velocity_models.csvplate, model, epoch, frame, vx_mm_yr, vy_mm_yr, refP21

(B) Resource / sediment / discovery-bias modules

Module IDFile (example)Key fields (minimum)Linked prediction
PET-OILdata/petroleum/oil_provinces.csvprovince, lat, lon, type, basin, refP22
ICE-PROXdata/glacial/ice_extent_proxies.csvregion, proxy, age_ka, value, unit, refP22/P23
REFUGIAdata/glacial/refugia_catalog.csvregion, glaciated, heatflow_mWm2, volc_index, clim_controls, refP23
LAKE-ENDdata/hydro/endorheic_lakes.csvBMN, site, lat, lon, age_kyr, status, trend, num14CP24
SHELF-Wdata/geomorph/shelf_width_profiles.csvprofile, margin_type, shelf_width_km, canyon_density, drainage_area_km2, refP25
UNCON-Sdata/strat/unconformity_sites.csvsite, age_min_Ma, age_max_Ma, weathering, hiE_marker, refP26
POLY-CASEdata/strat/polystrate_cases.csvcase, env, unit, age, notes, refP27
COAL-CASEdata/strat/coal_marine_cases.csvcase, coal_type, marine_fossils, rootlets, sed_struct, refP28
EVT-WINdata/meta/event_window_estimates.csvmodule, proxy_class, t_center_ka, sigma_ka, sign, weight, method, ref, includeP29
CTRL-REGdata/meta/controls_registry.csvmodule, control_type, control_id, selector, notes, refP30
COAL-RANKdata/geo/coal_rank_stress_cases.csvbasin, country, tectonic_setting, rank, Ro, stress_proxy, burial_proxy, refP31
PET-BUDGdata/petroleum/oil_source_budget_cases.csvcase_id, province, oil_in_place_bbl, recoverable_bbl, basin_area_km2, source_thickness_m, toc_frac, eta_conv, eta_exp, refP32
PET-THERMdata/petroleum/oil_thermal_context.csvprovince, geothermal_gradient_C_km, heatflow_mWm2, intrusion_distance_km, Ro, tmax_C, phase, refP33
PET-AGEdata/petroleum/petroleum_chronometers.csvprovince, age_Ma, sigma_Ma, method, material, interpretation, qa_flag, refP34
PET-DISCdata/petroleum/oil_discovery_bias_cases.csvbasin, sed_flux_proxy, shelf_sed_thickness_m, wells, seismic_km2, exploration_years, discovered_bboe, giant_fields, refP35

}