FindGeo is used to determine the coordination geometry of metals in complexes and proteins for which a PDB structure is available. It works by finding the best superposition between the atoms that coordinate the metal in the input structure and the atoms that coordinate the metal in structural templates with idealized geometries. The RMSD calculated over the superimposed structures provides a quantitative measure of the similarity between the configuration of the coordinating ligands and the various geometries possible for the metal coordination number, which can then be ranked to identify the best geometry assignment.
In this short guide, numbers in parentheses refer to red boxes in the Figures.
The output files produced by FindGeo for each metal are:
A PDB file called findgeo.input with the coordinates of the metal and of the atoms that coordinate the metal, as extracted from the input PDB file. In this file the metal is always in the first row, and the atoms that coordinate the metal are listed according to their order in the input PDB file. An example of this file is as follows:
HETATM 2029 ZN ZN A 262 -6.677 -1.626 15.471 1.00 5.67 ZN ATOM 722 NE2 HIS A 94 -7.486 -0.334 13.493 1.00 5.58 N ATOM 920 ND1 HIS A 119 -8.295 -2.377 16.601 1.00 4.43 N ATOM 743 NE2 HIS A 96 -5.890 -3.503 14.906 1.00 4.07 N
------------------------------------------------------------------------------------------------ Geometries tested | Tag| RMSD ------------------------------------------------------------------------------------------------ tri - TRIGONAL PLANE |Irregular| 1.186 tev - TETRAHEDRON WITH A VACANCY | Regular| 0.281 spv - SQUARE PLANE WITH A VACANCY |Irregular| 1.454 ------------------------------------------------------------------------------------------------ Best geometry of the site: tev (regular)
----------------------------------------------- Coordinates of template with ideal tev geometry ----------------------------------------------- 0.000 0.000 0.000 1.732 -1.732 -1.732 -1.732 -1.732 1.732 1.732 1.732 1.732 --------------------------------------------- Coordinates of the fitted metal site and RMSD --------------------------------------------- 0.000 0.000 0.000 0.000 1.744 -1.759 -1.692 0.050 -1.550 -1.617 1.996 0.340 1.607 1.555 2.000 0.345 -------------------- Total RMSD: 0.281 --------------------
ATOM 1 M TEV 1 0.000 0.000 0.000 ATOM 2 L TEV 1 1.732 -1.732 -1.732 ATOM 3 L TEV 1 -1.732 -1.732 1.732 ATOM 4 L TEV 1 1.732 1.732 1.732 TER ATOM 2029 ZN ZN A 262 0.000 0.000 0.000 ATOM 722 NE2 HIS A 94 1.744 -1.759 -1.692 ATOM 920 ND1 HIS A 119 -1.550 -1.617 1.996 ATOM 743 NE2 HIS A 96 1.607 1.555 2.000
ATOM 1 M TEV 1 0.000 0.000 0.000 ATOM 2 L TEV 1 1.732 -1.732 -1.732 ATOM 3 L TEV 1 -1.732 -1.732 1.732 ATOM 4 L TEV 1 1.732 1.732 1.732 TER ATOM 2029 ZN ZN A 262 0.000 0.000 0.000 ATOM 722 NE2 HIS A 94 1.452 -1.464 -1.409 ATOM 920 ND1 HIS A 119 -1.091 -1.138 1.405 ATOM 743 NE2 HIS A 96 1.131 1.095 1.408
FindGeo is also available as a stand-alone version which has all the functionality of the web version. It produces the same output files described above (see Output formats), which are placed in subdirectories each called by the identifier of the metal that they refer to (e.g., ZN_262_2041_A). In addition, an output file called FindGeo.summary is produced, which lists the coordination geometries for all the metals identified in the input structure.
python findgeo.py [options] –p <pdbfile> (To use a local PDB file as input) python findgeo.py [options] –c <pdbcode> (To download a PDB file from the PDB website)
-h --help Help; print a brief reminder of command line usage and all available options. -w --wdir Specify the directory where the input PDB file is found (when the –p option is used) or is to be downloaded (when the –c option is used), and where the output is to be placed. Default is ./. -t --threshold Specify the threshold distance for metal coordination by donor atoms. Default is 2.8 Å. -e --excluded_donors Specify the chemical symbol(s) of the atom(s) (separated by commas) that must not be taken into account for metal coordination. Default is C and H. -m --metal Specify the chemical symbol of the metal whose coordination geometry is to be determined. Default is all metals. -o --overwrite Overwrite existing files and directories with the same names as those generated by the program. By default, this option is not active and the program will stop when trying to create files or directories that already exist.