Published quarterly by the Research Collaboratory for Structural Bioinformatics Protein Data Bank

Message from RCSB PDB


Biocuration

The first SARS-CoV-2 structure, main protease (PDB ID 6lu7), was deposited on January 26, 2020 and was immediately biocurated so that it could be included in the earliest possible weekly update of the PDB.
Since then, more than 200 COVID-19-related structures have been submitted to the PDB archive. Each entry has been quickly reviewed and annotated by wwPDB biocurators following these “guiding principles”:

  1. Biocuration of COVID-19 structures is prioritized over that of other structures, including post-release revisions such as citation updates
  2. Authors are encouraged to release their structures immediately
  3. Consistent taxonomy name and ID (Severe acute respiratory syndrome coronavirus 2; 2697049) are applied to all COVID-19 structures
  4. Consistent UniProt referencing is incorporated: P0DTD1, P0DTC1, P0DTC2, P0DTC9

All released SARS-CoV-2 structures and related resources are highlighted at http://RCSB.org/covid19.


Data Access and Exploration

A major focus of the development team during the past year has been improvement of the RCSB.org website architecture to significantly revise and improve tools for searching and exploring PDB data. Working remotely, the team has been able to release the next generation of RCSB PDB tools, including search functionality that combines text, sequence, and coordinate queries. Other new and improved features include

  1. Basic Search available from the search box at the top of every RCSB PDB website page
  2. Advanced Search Boolean options
  3. Display of Advanced Search results for PDB structures, assemblies, or unique polymeric molecular entities (unique sequences)
  4. Left sidebar menus for refining Search Results
  5. Stored Search History
  6. New MyPDB features, including registration using Google, Facebook, or ORCID authentication, and email notifications of stored searches
  7. Improved Download Tool for downloading data files in batches
  8. New GraphQL and REST APIs for accessing and searching PDB data
  9. Sequence motif search

These tools can be used to search and analyze SARS-CoV-2 structures in the context of other structures in the PDB, including those from SARS and MERS.


Outreach and Education

To coincide with the release of the first SARS-CoV-2 structure (6lu7), the February Molecule of the Month focused on the Coronavirus Main Protease.

Since then, materials including molecular landscape paintings, downloadable illustrations, and a video demonstrating the effects of hand washing at the molecular level have been released into a Coronavirus Collection at PDB-101.

Newest features include the June Molecule of the Month article on the Coronavirus Spike Protein.


Snapshot: July 1, 2020
165,956 Released atomic coordinate entries
Molecule Type
153862 Proteins, peptides, and viruses
3405 Nucleic acids
8041 Protein/nucleic acid complexes
648 Other
Experimental Technique
147,393 X-ray
13,017 NMR
5,271 Electron Microscopy
169 Multi Method
69 Neutron Diffraction
37 Other