Clade-specific methylation readers
Methylation is read in clade-specific ways because it has three architectures, not one. Plants are read at the bulk level across CG, CHG, and CHH contexts; insects need per-gene resolution, since their methylation, where it exists, is targeted to gene bodies and invisible to a bulk reading. An auto-detector routes each genome.
The methylation substrate of §9 is vertebrate-shaped, reading global CpG depletion, but methylation has three architectures that each need a different reading. Plants are read at the bulk level across CG, CHG, and CHH — six genomes including the basal moss show depletion in CG (0.52–0.77) and CHG (0.68–0.94), the RdDM signature — while insects show no bulk depletion at all (even the methylating honeybee sits near 1.4), so their targeted gene-body methylation is legible only per gene, where honeybee carries a low-CpG gene class that Diptera lack. An auto-detector reads bulk first and escalates to per-gene only when no global signal is present, correctly routing vertebrate, plant, targeted-insect, and non-methylating-insect genomes.
Methylation is an architecture, not just a context
The methylation substrate is not only which context is marked but how the marking is laid down, and that organising scheme differs by clade. Vertebrates lay a global CG blanket whose depletion shows in any window; plants lay a global CG, CHG, and CHH pattern; insects, where they methylate at all, target only a subset of gene bodies.
Reading the wrong way gives a false negative. A bulk-CpG reading, the natural instrument for the global vertebrate blanket, sees nothing in the honeybee because its mark is targeted, so the clade reader must first ask which architecture is present.
Plants: read CG, CHG, and CHH at the bulk level
The plant reader measures all three methylation contexts in bulk, and the non-CG depletion is the plant signature. Across six genomes — Arabidopsis, rice, maize, soybean, tomato, and the basal moss Physcomitrium — CG O/E falls to 0.52–0.77 and CHG O/E to 0.68–0.94, both depleted, while CHH stays near 1.1 as the sparse asymmetric context.
That a vertebrate-style CpG-only reader would miss the entire CHG axis is the point. The depletion in CHG is written by the RdDM pathway and is absent from animals, so it is the clade-distinguishing mark a general reader must capture.
Insects: targeted gene-body methylation, read per gene
No insect shows bulk methylation depletion, so the insect reader works at per-gene resolution. Even the honeybee, which has functional DNA methylation, sits at bulk CG O/E near 1.4 — its mark is targeted, not global — while the non-methylating fly and mosquito sit near 1.0.
The signal appears only when genes are read one by one. The honeybee carries a low-CpG, body-methylated gene class — per-gene CpG O/E spread of 0.31 with 12% of genes below 0.6, the established bimodal signature — that Diptera lack entirely, marking their substrate as inert and their regulation as methylation-independent.
One auto-detector routes every regime
A single detector reads the bulk contexts first and escalates to per-gene resolution only when no global signal is present. With that order it routes all four regimes correctly: vertebrate global CG, plant global CG with CHG and CHH, targeted insect gene-body, and non-methylating insect.
The principle is to read the substrate the way the clade writes it. Bulk reading suits the global blankets of vertebrates and plants; per-gene reading recovers the targeted insect mark that bulk reading would discard as absent.
What no gray zone means here
No gray zone means nothing is silent, not that there are zero unknowns. Of sixteen quantities, fourteen are positively evidenced, and the two open items name their closing datasets — per-context bisulfite levels, and the larger gene panels that would resolve the weak-signal insects such as wasp, silkmoth, and beetle that sit below the per-gene detection threshold at this sample size.