DATA QUERY, REPORTING AND ACCESS

FASTA SEQUENCE FILES ON RCSB PDB FTP ARCHIVES

T
he RCSB PDB maintains several FASTA formatted sequence files on the FTP archives. The sequences for all currently released experimental structures are contained in pdb_seqres.txt, available in uncompressed form at ftp://ftp.rcsb.org/pub/pdb/derived_data/pdb_seqres.txt and in Unix compressed (".Z") format at ftp://ftp.rcsb.org/pub/pdb/derived_data/pdb_seqres.txt.Z. These two files contain all sequences for structures queried on the Home Page, QuickSearch, SearchLite, and SearchFields.

PDB depositors are given the opportunity to prerelease the sequences of their structures before releasing the coordinate data. Prereleased sequences for unreleased structures are contained in the separate file pre-released.seq, available in uncompressed form at ftp://ftp.rcsb.org/pub/pdb/derived_data/index/pre-released.seq. Unreleased structures can be queried on the Status Query page.


BATCH FILE DOWNLOAD SCRIPT NOW AVAILABLE

A
script to download large numbers of files from the PDB FTP site is now available at ftp://ftp.rcsb.org/pub/pdb/software/getPdbStructures.pl. This simple Perl script can be run locally to download files from a user's list of PDB IDs. Options are available to download coordinate files in either PDB or mmCIF format, as well as experimental data files. The script creates a directory structure for the downloaded files. Further details regarding usage of this script can be found at ftp://ftp.rcsb.org/pub/pdb/software/getPdbStructures.html.


WEBSITE STATISTICS

T
he RCSB PDB is available from several Web and FTP sites located around the world. Users are also invited to preview the newly reengineered RCSB web site at www.rcsb.org/pdb.

The access statistics are given below for the primary RCSB PDB website at www.pdb.org.