ECCC Paper 25 -- Annotating PDB Files with Scene Information

A poster by Gregory S. Couch, Eric F. Pettersen, Conrad C. Huang, and Thomas E. Ferrin

Computer Graphics Laboratory
University of California
San Francisco, California 94143-0446

Abstract

We have implemented extensions to the Brookhaven Protein Data Bank (PDB) file format for incorporating scene information such as viewing parameters, additional molecular information (e.g., van der Waals radii and atom colors) and user-defined graphics. These extensions were made in conformance with the PDB standard and provide sufficient information to render the scene in various styles such as space-filling images and ribbon diagrams. For the past five years these extensions have been used in the MidasPlus molecular modeling system and have proved both powerful and sufficient for generating complex molecular images. We propose that the extensions to the PDB presented here be adopted by the molecular modeling community for incorporation into visualization programs.

Introduction

The following four images, generated from the same annotated PDB file, show flavodoxin (colored red at the N terminus to blue at the C terminus) and a flavin mononucleotide (FMN) prosthetic group (colored magenta). Just the backbone atoms of flavodoxin are shown as well as the residues that are 3.5 Ångstroms or less away from FMN. Follow the links below to view a full resolution version of the thumbnail images (i.e., a 24-bit 640x512 TIFF image). The flavodoxin complex is the 3fxn entry from the PDB and the reference for the coordinates is: W. W. Smith, R. M. Burnett, G. D. Darling, M. L. Ludwig, "Structure of the Semiquinone Form of Flavodoxin From Clostridium MP. Extension of 1.8 Angstroms Resolution and some Comparisons with the Oxidized State," J. Mol. Biol., 117(?):195, 1977.

Original A PDB file being annotated in an interactive molecule viewer

Ribbon The annotated PDB file in a ribbon representation

Tube The same annotated PDB file in a tube representation

CPK The same annotated PDB file in a CPK representation

Advantages and Limitations

The advantages of our approach to annotating PDB files are:

  1. A scientist starts with any PDB file that conforms to the PDB standard.
  2. An annotated PDB file is produced that still conforms to the standard and that the scientist can still easily understand. There are no auxiliary files to worry about.
  3. The annotated file is editable with a standard text editor or other text processing software tools. In particular, if a interactive molecular viewer doesn't know how to add an annotation that a particular renderer needs, the scientist may do so manually.
  4. Scientists with only limited programming experience should still be able to write programs that manipulate annotations, since they consist of simple lines of text.
  5. Rendering is decoupled from annotating, so it is easy to incorporate new scene renderers.

Notwithstanding the advantages noted above, there are limitations to our method:

  1. The annotations are verbose and may consume a lot of space. However, this situation is mitigated by the omission of the most common scene annotations (i.e., color and radius records) when they duplicate a preceding annotation.
  2. The annotations listed are limited to a single scene -- they do not allow for multiple viewpoints to specified.
  3. More types of annotations could be defined: for instance, bond order (double, triple, resonance), bond color (for ball & stick representations), per residue representation styles (ribbon, ball & stick, CPK), extensible properties (e.g., backbone rms deviation in NMR ensembles, sequence homology classification, etc.), and other OpenGL features (e.g., material properties, textures, light sources). Of course, future extensions to the annotation specification can incorporate these attributes as need arises.

Other Methods

Other popular molecular visualization programs use different approaches to combining scene information with PDB files. We briefly discuss a few of these programs below.

Raster3D version 2.0 (with its companion interactive viewer) also separates interactive annotation and rendering into two phases, but abandons the PDB format in favor of a more space-efficient one, thus losing compatibility with existing programs which understand PDB format.

RasMol v2.4 combines an interactive viewer with a renderer. This technique has the advantage that one program is used for both functions, but loses the flexibility of easily incorporating different renderers or being able use filter programs to modify the annotations.

MolScript v1.4 is a PostScript renderer which can embed PDB files into its own input file format. This file format is very flexible, but has the disadvantage of being unintelligible to PDB-conformant programs.

Kinemage is both a file format and an interactive viewer for personal computers. Its chief advantage is that it supports multiple views and bond rotations, but its data files are not in PDB format and are also difficult to edit.

Acknowledgements

Parts of the preceding poster have been excerpted and paraphrased from a paper published in the Journal of Molecular Graphics 13, 3 (June 1995). A hard-copy reprint of this paper, with color images, is available by sending e-mail to tef@cgl.ucsf.edu.

This work was supported by the NIH National Center for Research Resources, grant P41-RR01081, and by the National Science Foundation, grant IRI-9116999.


25 October 1994 <gregc@cgl.ucsf.edu>