873 Accuracy on structures containing low homology RNA-only or DNA-only complexes from the recent PDB
874 evaluation set. Comparison between AlphaFold 3 and RoseTTAFold2NA (RF2NA) (RNA: N=29 structures, paired
Wilcoxon signed-rank test, p=1.6 * 10−7 875 ; DNA: N=63 structures, paired two-sided Wilcoxon signed-rank test, p=5.2
* 10−12 876 ). Note RF2NA was only trained and evaluated on duplexes (chains forming at least 10 hydrogen bonds), but
877 some DNA structures in this set may not be duplexes. Box, centerline, and whiskers boundaries are at (25%, 75%)
878 intervals, median, and (5%, 95%) intervals. c Predicted structure of a mycobacteriophage immunity repressor
879 protein bound to double stranded DNA (PDB ID 7R6R), coloured by pLDDT (left; orange: 0-50, yellow: 50-70,
880 cyan 70-90, and blue 90-100) and chain id (right). Note the disordered N-terminus not entirely shown. d, Predicted
881 aligned error (PAE) per token-pair for the prediction in c with rows and columns labelled by chain id and green
882 gradient indicating PAE.
883
884 Extended Data Figure 6 | Analysis and examples for modified proteins and nucleic acids. a, Accuracy on
885 structures
886 containing common phosphorylation residues (SEP, TPO, PTR, NEP, HIP) from the recent PDB evaluation set.
887 Comparison between AlphaFold 3 with phosphorylation modelled, and AlphaFold 3 without modelling
phosphorylation (N=76 clusters, paired two-sided Wilcoxon signed-rank test, p=1.6 * 10−4 888 ). Note, to predict a
889 structure without modelling phosphorylation, we predict the parent (standard) residue in place of the modification.
890 AlphaFold 3 generally achieves better backbone accuracy when modelling phosphorylation. Error bars indicate
891 exact binomial distribution 95% confidence intervals. b, SPOC domain of human SHARP in complex with
892 phosphorylated RNA polymerase II C-terminal domain (PDB ID 7Z1K), predictions coloured by pLDDT (orange:
893 0-50, yellow: 50-70, cyan 70-90, and blue 90-100). Left: Phosphorylation modelled (mean pocket-aligned RMSDCα
894 2.104 Å). Right: Without modelling phosphorylation (mean pocketaligned RMSDCα 10.261 Å). When excluding
895 phosphorylation, AlphaFold 3 provides lower pLDDT confidence on the phosphopeptide. c, Structure of parkin
896 bound to two phospho-ubiquitin molecules (PDB ID 7US1), predictions similarly coloured by pLDDT. Left:
897 Phosphorylation modelled (mean pocket-aligned RMSDCα 0.424 Å). Right: Without modelling phosphorylation
898 (mean pocket-aligned RMSDCα 9.706 Å). When excluding phosphorylation, AlphaFold 3 provides lower pLDDT
899 confidence on the interface residues of the incorrectly predicted ubiquitin. d, Example structures with modified
900 nucleic acids. Left: Guanosine monophosphate in RNA (PDB ID 7TNZ, mean pocket-aligned modified residue
901 RMSD 0.840 Å). Right: Methylated DNA cytosines (PDB ID 7SDW, mean pocket-aligned modified residue RMSD
902 0.502 Å). Welabel residues of the predicted structure for reference. Ground truth structure in grey; predicted protein
903 in blue, predicted RNA in purple, predicted DNA in magenta, predicted ions in orange, with predicted modifications
904 highlighted via spheres
905
906 Extended Data Figure 7 | Model accuracy with MSA size and number of seeds. a, Effect of MSA depth on
907 protein prediction accuracy. Accuracy is given as single chain LDDT score and MSA depth is computed by counting
908 the number of non-gap residues for each position in the MSA using the Neff weighting scheme and taking the median
909 across residues (see Methods for details on Neff). MSA used for AF-M 2.3 differs slightly from AF3; the data uses
910 the AF3 MSA depth for both to make the comparison clearer. The analysis uses every protein chain in the low
911 homology Recent PDB set, restricted to chains in complexes with fewer than 20 protein chains and fewer than 2,560
912 tokens (see Methods for details on Recent PDB set and comparisons to AF-M 2.3). The curves are obtained through
913 Gaussian kernel average smoothing (window size is 0.2 units in log10(Neff)); the shaded area is the 95% confidence
914 interval estimated using bootstrap of 10,000 samples. b, Increase in ranked accuracy with number of seeds for
915 different molecule types. Predictions are ranked by confidence, and only the most confident per interface is scored.
916 Evaluated on the low homology recent PDB set, filtered to less than 1,536 tokens. Number of clusters evaluated:
917 dna-intra=386, protein-intra=875, rnaintra=78, protein-dna=307, protein-rna=102, protein-protein
918 (antibody=False)=697, protein-protein (antibody=True)=58. Confidence intervals are 95% bootstraps over 1,000
919 samples.
920
921 Extended Data Figure 8 | Relationship between confidence and accuracy for protein interactions with ions,
922 bonded ligands and bonded glycans. Accuracy is given as the percentage of interface clusters under various
923 pocket-aligned RMSD thresholds, as a function of the chain pair ipTM of the interface. The ions group includes both
ACCELERATED ARTICLE PREVIEW