Documentation Index
Fetch the complete documentation index at: https://mintlify.com/google-deepmind/alphafold3/llms.txt
Use this file to discover all available pages before exploring further.
Overview
TheStructure class is the primary data structure for representing molecular structures in AlphaFold 3. It provides a comprehensive interface for working with protein, DNA, RNA, ligand, and other molecular entities.
Class Definition
Structure class represents molecular structures with a hierarchical organization:
- Chains: Individual molecular chains (proteins, nucleic acids, ligands)
- Residues: Amino acids, nucleotides, or chemical components
- Atoms: Individual atomic coordinates and properties
- Bonds: Chemical bonds between atoms
Constructor
Name or identifier for the structure (e.g., PDB ID)
Release date of the structure
Resolution of the structure in Angstroms (for experimental structures)
Experimental method used (e.g., ‘x-ray diffraction’, ‘electron microscopy’)
Biological assembly information
Chemical component dictionary data for non-standard residues
Table containing chain-level data (IDs, types, entity information)
Table containing residue-level data (names, IDs, author naming)
Table containing atomic data (names, elements, coordinates, b-factors)
Table containing bond connectivity information
Skip validation of foreign keys and table ordering (use with caution)
Properties
Structure Metadata
Structure name or identifier
Structure release date
Structure resolution in Angstroms
Experimental method used to determine the structure
Total number of atoms in the structure
Total number of chains in the structure
Number of models (for NMR structures or ensembles)
Tables
Access to the chains table with chain-level data
Access to the residues table with residue-level data
Access to the atoms table with atomic data
Access to the bonds table with connectivity information
Table of chains that have at least one resolved atom
Table of residues that have at least one resolved atom
Table of residues with no resolved atoms (missing from structure)
Atom-Level Arrays
Unique integer keys for each atom
Atom names (e.g., ‘CA’, ‘N’, ‘C’, ‘O’)
Element symbols (e.g., ‘C’, ‘N’, ‘O’, ‘S’)
X coordinates in Angstroms
Y coordinates in Angstroms
Z coordinates in Angstroms
B-factors (temperature factors) for each atom
Occupancy values for each atom (typically 1.0)
Chain ID for each atom (internal label_asym_id)
Chain type for each atom (e.g., ‘polypeptide(L)’, ‘polyribonucleotide’)
Author chain ID for each atom (from original PDB file)
Entity ID for each atom
Entity description for each atom
Residue sequence ID for each atom (1-indexed)
Residue names for each atom (e.g., ‘ALA’, ‘DA’, ‘U’, ‘HOH’)
Author residue sequence ID for each atom
Insertion codes for each atom
Masks
Boolean mask indicating protein atoms
Boolean mask indicating DNA atoms
Boolean mask indicating RNA atoms
Boolean mask indicating all nucleic acid atoms (DNA or RNA)
Boolean mask indicating ligand atoms
Boolean mask indicating water molecules
Methods
num_residues
If True, includes unresolved residues (missing from coordinates). If False, only counts residues with at least one resolved atom.
filter
Optional boolean array with length equal to num_atoms for direct masking
Whether to apply predicates to each element individually or to the whole column array
If True, removes entities matching the predicates instead of retaining them
Controls deletion of unresolved residues/chains:
FULL: Delete all unresolved residues and empty chainsCHAINS: Delete only chains with no resolved residues (default)NONE: Keep all unresolved residues and chains
Field-based filters using pattern
<table>_<column> (e.g., chain_id='A', atom_name=('CA', 'N'))filter_out
filter).
Same parameters as filter() but with invert=True applied automatically.
Example:
filter_to_entity_type
Include protein chains
Include DNA chains
Include RNA chains
Include ligand molecules
to_mmcif
Number of decimal places for atomic coordinates
to_mmcif_dict
iter_atoms
iter_residues
If True, includes residues with no resolved atoms
copy_and_update
**updates
Keyword arguments for fields to update (e.g.,
name='new_name', atom_x=new_coords)copy_and_update_coords
New coordinates array with shape
(..., num_atoms, 3) containing [x, y, z] coordinatesFactory Functions
from_mmcif
Contents of an mmCIF file
Optional structure name (defaults to mmCIF data_ field)
Convert selenomethionine (MSE) SE atoms to SD (sulphur)
Swap NH1/NH2 in arginine to ensure NH1 is closer to CD
Replace ‘N’ residue names in DNA chains with ‘DN’
Include water (HOH) molecules
Include non-standard entity types
Parse and include bond connectivity information
Which model to parse (integer ID,
ModelID.FIRST, or ModelID.ALL)from_res_arrays
Shape
(num_res, num_atom) indicating which atoms are present (nonzero = present)**kwargs
Field name to values mapping. Arrays should have shape
(num_res,) for chain/residue fields or (num_res, num_atom) for atom fieldsRelated Types
CascadeDelete
Bond
See Also
- mmCIF Parsing Utilities - Functions for parsing mmCIF files
- Structure Tables - Chains, Residues, Atoms, and Bonds tables