Changes between Version 2 and Version 3 of mmCIFWriting


Ignore:
Timestamp:
Feb 5, 2018, 4:20:00 PM (8 years ago)
Author:
Greg Couch
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • mmCIFWriting

    v2 v3  
    55* Information needed to reconstruct some tables is discarded after being used (//e.g.//, the sheet information for strands)
    66* Whether or not the structure has been edited incompatibly is unknown, so associated metadata may or may not still be valid
    7 * intra-residue metal coordination bonds
    87
    9 == First Pass ==
     8== Milestones ==
    109
    11 The [[http://www.rbvi.ucsf.edu/chimerax/docs/devel/mmcif.html|ChimeraX Fast mmCIF Guidelines]] give ChimeraX's requirements for reading.
     10The [[http://www.rbvi.ucsf.edu/chimerax/docs/devel/mmcif.html|ChimeraX Fast mmCIF Guidelines]] gives ChimeraX's requirements for reading.
    1211
    13 The minimal set of tables needed to properly reconstruct the connectivity are the **entity_poly_seq**, **atom_site**, and **struct_conn** tables.  Minimal work for the **struct_conn** table, for example, hydrogen bonds would be skipped.
     12 v1::
     13  The minimal set of tables needed to properly reconstruct the connectivity are the **entity_poly_seq**, **atom_site**, and **struct_conn** tables.  Minimal work for the **struct_conn** table, just disufide and inter-residue metal coordination bonds
    1414
    15 After that, for completeness, we should add the **entity**, **chem_comp**, and **atom_site_anisotrop** tables.  And the **audit_conform** table with our annotation that all keywords are in lowercase.
     15 v2::
     16  For completeness (define stuff that **id**'s refer to), add the **entity** and **chem_comp** tables.  And the **audit_conform** table with our annotation that all keywords are in lowercase.  Reuse entity descriptions from metadata if possible.
    1617
    17 Next, we should add support for writing the **atom_site** and **atom_site_anisotrop** tables with fixed width columns and add that annotation to the **audit_conform** table.
     18 v3::
     19  Add support for writing the **atom_site** and **atom_site_anisotrop** tables with fixed width columns and add that annotation to the **audit_conform** table.
    1820
    19 == Completing Connectivity ==
     21 v4::
     22  Add secondary structure annotations.  Just output helices and each strand in a separate sheet.  Fix someday when sheet information becomes available.
    2023
    21 All of the issues for the **struct_conn** table should be addressed:
     24 v5::
     25  Add symmetry metadata if saved (**pdbx_struct_assembly**, **pdbx_struct_assembly_gen**, **pdbx_struct_oper_list**)
    2226
    23 * intra-residue metal coordination bonds
    24 * hydrogen bonds
    25 * bonds missing from residue templates
     27 v6::
     28  Add hydrogen bonds -- maybe limit to inter-strand h-bonds?
    2629
    27 At this point, we could embed the residue templates (**chem_atom** and **chem_bond** tables) like the [[http://www.pdbe.org|PDBe]] does in its "updated" mmCIF files.
     30 v7::
     31  Add bonds missing from residue templates
    2832
    29 == Secondary Structure ==
     33 v8::
     34   Make sure intra-residue metal coordination bonds are handled correctly.
     35
     36 v9::
     37   Embed residue templates (**chem_atom** and **chem_bond** tables), like the [[http://www.pdbe.org|PDBe]] does in its "updated" mmCIF files.  Residues with metal coordination bonds in them have "ambiguous" connectivity.
     38
     39 v10::
     40   Review other metadata that is keep and try to incorporate it.
     41
     42== Secondary Structure Notes ==
    3043
    3144The secondary structure annotations, specifically the sheet annotations, are difficult since ChimeraX does not keep the original sheet information nor does it compute it.  Functionally, the currently needed secondary structure information can be computed with the internal DSSP code.  But those annotations don't always match the author's.  Ideally, the sheet information would both preserved and be computed, so the mmCIF file could contain that information.  (The sheet information would eventually be useful for a fancy cartoon representation too.)
    32 
    33 == Assemblies ==
    34 
    35 ChimeraX keeps the information needed to reconstruct various assemblies as metadata, and that can be written as is.  If the structure has been edited, it may or may not be valid.
    3645
    3746== Other Metadata ==