Data Exploration Services

RCSB.org is visited by millions of users each year. Traffic is tracked using internally-developed tools and filtered to remove robotic access.


Month Unique Visitors Visits Bandwidth
July 2022 560,727 2,883,781 47.5 TB
August 2022 558,639 2,714,138 33.63 TB
September 2022 632,435 2,795,598 53.48 TB
Decorative icon

RCSB PDB has recently introduced DNS names for programmatic access to PDB archive downloads:

The File Download Services documentation has detailed information.

Starting September 2023, RCSB PDB will start enforcing use of these updated DNS names. URLs in which the DNS name doesn’t match the protocol (e.g., https://ftp.rcsb.org, ftp://files.rcsb.org) will no longer work at that time.

Users who download PDB archive data programmatically are encouraged to switch to the new DNS names as soon as possible. HTTPS protocol is preferred (over FTP) for individual file downloads.

Please contact info@rcsb.org with any questions.

Mol* view of PDB structure 1dtj. To select this coloring scheme, select the “Experimental Support Confidence” option under Quality Assessment.

Mol* view of PDB structure1dtj. To select this coloring scheme, select the “Experimental Support Confidence” option under Quality Assessment.

A new Mol* 3D option displays PDB structures determined with X-ray crystallography using a coloring scheme similar to what is used to show Computed Structure Models confidence levels. This feature is available for most crystal structures in the PDB with structure factors.

This display option was developed using individual-residue Real Space Correlation Coefficient (RSCC) values that indicate how well the atomic coordinates for an individual residue are supported by X-ray crystallographic experimental data. Individual RSCC values vary with both residue chemical structure and structure resolution. Therefore, the RSCC values for each type of amino acid in PDB X-ray structures at different resolutions were ranked from lowest to highest, indicating worse to better experimental data support for residue atomic coordinates.

Amino acid residues in any PDB structure are colored by their RSCC value ordinal ranking:

  • Blue (25-100%): very well resolved residues, very high confidence
  • Cyan (5-25%): well resolved residues, high confidence
  • Yellow (1-5%): outlier residues, low confidence
  • Orange (<1%): extreme outlier residues, very low confidence

With the RSCC-based confidence score and color scheme, RSCC outliers are readily apparent in ribbon representation 3D graphical displays, in a color scheme consistent with the pLDDT (predicted lDDT-Cα)-based confidence coloring scheme used for Computed Structure Models.

For a description of how the quality measures were developed, see
Assessing PDB macromolecular crystal structure confidence at the individual amino acid residue level, Structure (2022) doi: 10.1016/j.str.2022.08.004

This study also demonstrates that while RSCC values in a crystal structure are correlated with pLDDT scores in the corresponding AlphaFold2 model, crystal structures in the PDB are generally more reliable than AlphaFold2 models.

Mol* also supports a coloring scheme based on wwPDB Validation Reports that indicates the number of geometry related problems for a specific residue (e.g., geometry outliers, and clashes).

A new Mol* 3D option displays PDB structures determined with X-ray crystallography using a coloring scheme similar to what is used to show Computed Structure Models confidence levels. This feature is available for most crystal structures in the PDB with structure factors.

This display option was developed using individual-residue Real Space Correlation Coefficient (RSCC) values that indicate how well the atomic coordinates for an individual residue are supported by X-ray crystallographic experimental data. Individual RSCC values vary with both residue chemical structure and structure resolution. Therefore, the RSCC values for each type of amino acid in PDB X-ray structures at different resolutions were ranked from lowest to highest, indicating worse to better experimental data support for residue atomic coordinates.
Amino acid residues in any PDB structure are colored by their RSCC value ordinal ranking:

  • Blue (25-100%): very well resolved residues, very high confidence
  • Cyan (5-25%): well resolved residues, high confidence
  • Yellow (1-5%): outlier residues, low confidence
  • Orange (<1%): extreme outlier residues, very low confidence

With the RSCC-based confidence score and color scheme, RSCC outliers are readily apparent in ribbon representation 3D graphical displays, in a color scheme consistent with the pLDDT (predicted lDDT-Cα)-based confidence coloring scheme used for Computed Structure Models.

For a description of how the quality measures were developed, see

Assessing PDB macromolecular crystal structure confidence at the individual amino acid residue level, Structure (2022) doi: 10.1016/j.str.2022.08.004
This study also demonstrates that while RSCC values in a crystal structure are correlated with pLDDT scores in the corresponding AlphaFold2 model, crystal structures in the PDB are generally more reliable than AlphaFold2 models.
Mol* also supports a coloring scheme based on wwPDB Validation Reports that indicates the number of geometry related problems for a specific residue (e.g., geometry outliers, and clashes).

Use the Chemical Sketch Tool to draw or edit a molecule, then use the resulting SMILES or InChI string to search for matching molecular definitions in the PDB Chemical Component Dictionary.

This tool uses the web-based chemical drawing tool Marvin JS from ChemAxon to compose chemical sketches for chemical similar and substructure searches. Documentation is available.

Access chemical sketch tool

Access the Chemical Sketch Tool from the top Search menu or from Advanced Search in the Chemical Search options.

Activate CSM search on rcsb.org

Basic Sequence Search. Use the slider button to include Computer Structure Models in the Search Results.

Enter a sequence in the top search bar to search for matching PDB entities.

Sequence Searches at RCSB.org use MMseqs2 to find similar protein and nucleic acid sequences in the archive.

For example, a search for the sequence FVNQHLCGSHLVEALYLVCGERGFFYTPKT will currently return >400 entities for exploration. Including the option for CSMs will add ~100 more examples.

Filtering the search results on rcsb.org

Search Results including PDB experimental structures and CSMs. A) Use the Refinements panel on the left to limit the results based on characteristics including Structure Determination Method and Source Organism. B) Results can be sorted using the pulldown feature on the right.

References

RCSB PDB Team Photo

RCSB PDB team pictured at the January 2020 Cloud Technologies Best Practices hosted by the Institute for Quantitative Biomedicine at Rutgers, The State University of New Jersey.

Join RCSB PDB to design, develop, & deploy modern web and data applications & complex user interfaces. Help accelerate research and training in biology, medicine, & related disciplines. Positions are available at Rutgers and SDSC/UCSD.

For more, visit our Careers page or Contact Us with questions.