Difference between revisions of "File formats/Coordinates"

From Jmol
Jump to navigation Jump to search
(PQR: update: how to open PQR files)
(load CIF ligands direct from PDB)
Line 10: Line 10:
 
''Feel free to add a subsection with a description of a format you know well.''
 
''Feel free to add a subsection with a description of a format you know well.''
  
 +
<br />
  
 
== MOL and SD (Symyx MDL) ==
 
== MOL and SD (Symyx MDL) ==
Line 60: Line 61:
  
 
[{{SVN Trunk}}Jmol-datafiles/mol2/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/mol2/ Example files].
 +
 +
<br />
  
 
== PDB ==
 
== PDB ==
Line 80: Line 83:
 
Jmol can get files directly from the Protein Data Bank website.  
 
Jmol can get files directly from the Protein Data Bank website.  
 
To do this, you put an equal sign (=) just before the 4-character PDB ID code (without any extensions). Note that the unsigned applet is not allowed connection to external servers and so does not support this method.
 
To do this, you put an equal sign (=) just before the 4-character PDB ID code (without any extensions). Note that the unsigned applet is not allowed connection to external servers and so does not support this method.
 +
 +
See also [[#CIF|CIF]] for loading ligands directly from the PDB database.
  
 
Example:
 
Example:
Line 93: Line 98:
 
  set loadFormat = "<nowiki>http://www.rcsb.org/pdb/files/%FILE.pdb</nowiki>"
 
  set loadFormat = "<nowiki>http://www.rcsb.org/pdb/files/%FILE.pdb</nowiki>"
 
  set loadFormat = "<nowiki>http://www.ebi.ac.uk/msd-srv/oca/oca-bin/save-pdb?id=%FILE</nowiki>"
 
  set loadFormat = "<nowiki>http://www.ebi.ac.uk/msd-srv/oca/oca-bin/save-pdb?id=%FILE</nowiki>"
 +
 +
<br />
  
 
== XYZ ==
 
== XYZ ==
Line 108: Line 115:
  
 
[[File_formats/Formats/XYZ|Details about the XYZ format]].
 
[[File_formats/Formats/XYZ|Details about the XYZ format]].
 +
 +
<br />
  
 
== CIF ==
 
== CIF ==
Line 120: Line 129:
 
[{{SVN Trunk}}Jmol-datafiles/cif/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/cif/ Example files].
  
 +
'''''Special use:'''''
 +
 +
Jmol can get files directly from the ligand collection at the Protein Data Bank website.
 +
To do this, you put two equal signs (==) just before the 3-character PDB CIF ID code (without any extensions). Note that the unsigned applet is not allowed connection to external servers and so does not support this method.
 +
 +
Example:
 +
load ==2TY
 +
 +
By default, Jmol will connect to the PDB server at http://www.rcsb.org/pdb/files/ligand/
 +
To force the use of another server, you can do:
 +
set ligandLoadFormat = "  "
 +
and put between the quotes the url in the proper request format; use <code>%FILE</code> at the position where the PDB CIF ID should be inserted into the url.
 +
 +
<br />
  
 
== mmCIF ==
 
== mmCIF ==
Line 128: Line 151:
 
[{{SVN Trunk}}Jmol-datafiles/cif/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/cif/ Example files].
  
 +
<br />
  
 
== Alchemy (Tripos) ==
 
== Alchemy (Tripos) ==
Line 137: Line 161:
 
[{{SVN Trunk}}Jmol-datafiles/alchemy/ Example files] supported by Jmol.
 
[{{SVN Trunk}}Jmol-datafiles/alchemy/ Example files] supported by Jmol.
  
 +
<br />
  
 
== GAMESS ==
 
== GAMESS ==
Line 145: Line 170:
 
[{{SVN Trunk}}Jmol-datafiles/gamess/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/gamess/ Example files].
  
 +
<br />
  
 
== Gaussian ==
 
== Gaussian ==
Line 152: Line 178:
 
There are [{{SVN Trunk}}Jmol-datafiles/gaussian/ example files] of Gaussian input, output and log.
 
There are [{{SVN Trunk}}Jmol-datafiles/gaussian/ example files] of Gaussian input, output and log.
  
 +
<br />
  
 
== Cube (Gaussian) ==
 
== Cube (Gaussian) ==
Line 163: Line 190:
 
[{{SVN Trunk}}Jmol-datafiles/cube/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/cube/ Example files].
  
 +
<br />
  
 
== GROMACS ==
 
== GROMACS ==
Line 182: Line 210:
 
:#  Load the .pdb file into Jmol (using the [http://www.stolaf.edu/academics/chemapps/jmol/docs/#loadtrajectory trajectory] option) and animate.
 
:#  Load the .pdb file into Jmol (using the [http://www.stolaf.edu/academics/chemapps/jmol/docs/#loadtrajectory trajectory] option) and animate.
  
 +
<br />
  
 
== HIV / HIN (Hyperchem) ==
 
== HIV / HIN (Hyperchem) ==
Line 190: Line 219:
 
[[File formats/Formats/HIN|Details of HIN format]].
 
[[File formats/Formats/HIN|Details of HIN format]].
  
 +
<br />
  
 
== MOPAC ==
 
== MOPAC ==
Line 200: Line 230:
 
[{{SVN Trunk}}Jmol-datafiles/mopac/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/mopac/ Example files].
  
 +
<br />
  
 
== PQR ==
 
== PQR ==
Line 215: Line 246:
  
 
[{{SVN Trunk}}Jmol-datafiles/pqr/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/pqr/ Example files].
 +
 +
<br />
  
 
== Amber ==
 
== Amber ==
Line 228: Line 261:
 
[{{SVN Trunk}}Jmol-datafiles/amber/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/amber/ Example files].
  
 +
<br />
  
 
== V3000 (Symyx MDL) ==
 
== V3000 (Symyx MDL) ==
 
See [[File_formats#MOL_and_SD_.28Symyx_MDL.29|MOL]].
 
See [[File_formats#MOL_and_SD_.28Symyx_MDL.29|MOL]].
  
 +
<br />
  
 
== Q-Chem ==
 
== Q-Chem ==
Line 238: Line 273:
 
[{{SVN Trunk}}Jmol-datafiles/qchem/ Example files].
 
[{{SVN Trunk}}Jmol-datafiles/qchem/ Example files].
  
 +
<br />
  
 
== JME ==
 
== JME ==
Line 243: Line 279:
  
 
See [[File_formats/Chemical_Structure#JME]]
 
See [[File_formats/Chemical_Structure#JME]]
 +
 +
<br />
  
 
== CASTEP ==
 
== CASTEP ==
Line 248: Line 286:
 
: CASTEP is a software package which uses density functional theory to provide a good atomic-level description of all manner of materials and molecules.  
 
: CASTEP is a software package which uses density functional theory to provide a good atomic-level description of all manner of materials and molecules.  
  
 +
<br />
  
 
== FHI-aims ==
 
== FHI-aims ==
Line 253: Line 292:
 
:  FHI-aims (from Fritz-Haber-Institut der Max-Planck-Gesellschaft) is an all-electron electronic structure code based on numeric atom-centered orbitals.
 
:  FHI-aims (from Fritz-Haber-Institut der Max-Planck-Gesellschaft) is an all-electron electronic structure code based on numeric atom-centered orbitals.
  
 +
<br />
  
 
== DGRID ==
 
== DGRID ==
 
Jmol (11.9.14) can read DGRID files; this format is a generalized representations of output from a variety of quantum mechanical calculation packages, including especially ADF (Amsterdam Density Functional).
 
Jmol (11.9.14) can read DGRID files; this format is a generalized representations of output from a variety of quantum mechanical calculation packages, including especially ADF (Amsterdam Density Functional).
 +
 +
<br />
  
 
= Exporting Coordinates =
 
= Exporting Coordinates =

Revision as of 18:55, 30 August 2011

File Formats

File Formats for the Atomic Coordinates of the Molecule

These files are loaded into Jmol by either of the following methods:

  • using the application's topbar menu, File > Open,
  • using the load command in the command line, in the Jmol scripting console, or in a script,
  • dragging and dropping the file onto the Jmol application window

What follows is a collection of specifications and comments on several formats, but is not a complete listing of all formats supported. Feel free to add a subsection with a description of a format you know well.


MOL and SD (Symyx MDL)

MOL = MDL molfile = MOL v2000

SD = SDF = Structure Data Format

SD files share the MOL format but may contain several structures (separated by lines with $$$$), which will be read by Jmol as multiple models or frames.

MOLv3000 = extended molfile or extended connection table

This newer format applies to both MOL and SDF, hasn't got the 1000-atom limit and is also supported by Jmol.

Jmol reads MOL v2000 and v3000, and SD files (and can write them too). Original from Molecular Design Limited, then Elsevier MDL, now Symyx Technologies, widely adopted by many other programs. Contains atom coordinates and bonds. V2000 (the most common) is limited to 1000 atoms.

These formats support formal charges and isotopes; both are read by Jmol. A variant used in PubChem adds partial charges using a custom field <PUBCHEM_MMFF94_PARTIAL_CHARGES>, and this is now also read by Jmol.

There are often MOL and SD files with two-dimensional data (i.e, all atoms have Z=0); Jmol will read them too, but the resulting flat model will not be realistic. The defining tag (2D or 3D) must be located in line 2, columns 21-22, but is ignored by Jmol, which just uses the Z coordinates provided, be they zero or not.

Jmol v.12 has two ways to deal with such flat models:
  • you can run the minimize command after loading; this will apply a simple UFF force field to reach a reasonable 3D structure;
  • or you can load the file using the filter "2D" switch of the load command, in this way:
load myfile.mol filter "2D"
That will load the flat model and, before it is displayed, will add hydrogens if needed and then run a minimization; the result is a 3D model of the input 2D structure with a feasible conformation.

MOL header lines:

  • The first line is reserved for the molecule name and will be so used by Jmol in the popup menu.
  • The second line is in principle reserved for information on the originating program, date, user, etc. (Jmol will ignore this line).
  • The third line is for comments, and may contain an inline script starting with jmolscript:.

Official document (PDF): http://www.symyx.com/downloads/public/ctfile/ctfile.pdf, copied here.

Some extra information on SD files at US EPA DSSTox.

Example files for MOL and SDF v2000 and v3000.


MOL2 (Sybyl, Tripos)

Jmol reads MOL2 files. Original from Tripos. Contains atom coordinates, bonds, substructure information.

This format supports partial charges and isotopes, but only partial charges are currrently supported by Jmol.

A single MOL2 file may contain several structures, which will be read by Jmol as multiple models or frames.

Official document: http://www.tripos.com/data/support/mol2.pdf

Example files.


PDB

Jmol reads PDB files (and can write PDB files under some circumstances). Contains atom coordinates and information on biomolecular residues, sequence, chains, hydrogen and disulfide bonds, secondary structure, biologically relevant sites, cofactors. Can also contain temperature factor, formal charge, element symbol, alternate locations.

This format supports formal charges, and only hydrogen isotopes (as D and T); they are all read by Jmol. If you need support for partial charges, see the similar PQR format below.

Files may contain an inline script starting with REMARK jmolscript:.

(Official Protein Data Bank document) Atomic Coordinate Entry Format. Description: http://www.wwpdb.org/docs.html

Example files.

Special use:

Jmol can get files directly from the Protein Data Bank website. To do this, you put an equal sign (=) just before the 4-character PDB ID code (without any extensions). Note that the unsigned applet is not allowed connection to external servers and so does not support this method.

See also CIF for loading ligands directly from the PDB database.

Example:

load =1CRN

By default, Jmol will connect to the PDB server at http://www.rcsb.org/ To force the use of another server, you can do:

set loadFormat = "  "

and put between the quotes the url in the proper request format; use %FILE at the position where the PDB ID should be inserted into the url. More details

Examples:

set loadFormat = "http://www.rcsb.org/pdb/files/%FILE.pdb"
set loadFormat = "http://www.ebi.ac.uk/msd-srv/oca/oca-bin/save-pdb?id=%FILE"


XYZ

Jmol reads XYZ files (and can write XYZ files under some circumstances). This format was originally from the XMol package, but has been widely adopted by many other programs. Contains only atom coordinates (no bonds) and, optionally, charges and vectors (e.g. for atom vibration). Supports multi-model data (multi-frame, animations).

An extension of this format supports isotopes, and they are read by Jmol.

XYZ header lines:

  • The first line is reserved for the number of atoms.
  • The second line is for comments, and may contain an inline script starting with jmolscript:.

Example by Paul Bourke.

Example files.

Details about the XYZ format.


CIF

Jmol reads CIF files. Crystallographic Information File, the official format from the International Union of Crystallography:

Acta Crystallographica A47: 655-685 (1991)
doi:10.1107/S010876739101067X

CIF files may contain an inline script starting with #jmolscript:.

Example files.

Special use:

Jmol can get files directly from the ligand collection at the Protein Data Bank website. To do this, you put two equal signs (==) just before the 3-character PDB CIF ID code (without any extensions). Note that the unsigned applet is not allowed connection to external servers and so does not support this method.

Example:

load ==2TY

By default, Jmol will connect to the PDB server at http://www.rcsb.org/pdb/files/ligand/ To force the use of another server, you can do:

set ligandLoadFormat = "  "

and put between the quotes the url in the proper request format; use %FILE at the position where the PDB CIF ID should be inserted into the url.


mmCIF

Jmol reads mmCIF files. Macromolecular Crystallographic Information File, an expanded format to cope with macromolecules. Official documentation.

Example files.


Alchemy (Tripos)

Jmol does not fully support reading of Alchemy and Alchemy2000 files. A simple Alchemy reader is implemented starting Jmol 11.7.18. Alchemy example and Alchemy2000 description by Paul Bourke.

A complete specification of these formats would be needed to fully implement the reader. If you have those details, please contact the developers team.

Example files supported by Jmol.


GAMESS

Jmol reads GAMESS files (General Atomic and Molecular Electronic Structure System, by Gordon research group at Iowa State University).

Official documentation.

Example files.


Gaussian

Jmol reads only the output format. Recent versions of Jmol application can also export to files in Gaussian input format.

There are example files of Gaussian input, output and log.


Cube (Gaussian)

Jmol reads Cube files, original from Gaussian software (Gaussian website).

Description of Cube Input and Cube Output formats: http://www.nersc.gov/nusers/resources/software/apps/chemistry/gaussian/g98/00000430.htm

Description by Paul Bourke.

Example files.


GROMACS

Jmol (11.7.47 or later) reads GRO files; coordinates are assumed to be in nanometers; data are read as if a PDB file, with secondary structure assigned automatically and elements inferred from atom names. UREA and SOL are included in the solvent set. Also, force vectors are read and included as vibrations (activated by default on file load).

File format is called gro or Gromos87. Usual extension is .gro

Description of the format.

Example files.

Utilities:

  • You can convert from gro to pdb using the editconf program, which is a part of the GROMACS package that can be run from the command line:
editconf -f whatever.gro -o whatever.pdb
  • You can also use the trjconv utility, included in the GROMACS installation. It will read a GROMACS trajectory file (.xtc or .trr) and output a .pdb file. So:
  1. Use trjconv to generate a .pdb file.
  2. Load the .pdb file into Jmol (using the trajectory option) and animate.


HIV / HIN (Hyperchem)

Jmol reads HIV (or HIN) files, the native format of Hyperchem, a software sold by Hypercube Inc..

Example by Paul Bourke, and other example files.

Details of HIN format.


MOPAC

Jmol reads mopout output files from MOPAC and the new graphf output from MOPAC2007 (.mgf files), which contains coordinates, charges, and molecular orbitals.

openMOPAC, Molecular Orbital PACkage, public domain.

Example files.


PQR

Jmol (11.1.30 or later) reads PQR files. However, since they are very similar to pdb files, they may be misread.

If the file contains a line that starts with "REMARK 1 PQR" (3 spaces before the '1', one after it), it will be interpreted as pqr format. Otherwise, Jmol will interpret it as a pdb file (and so will not get the charge and radii correct). To force the file be read as pqr, add pqr:: just before the filepath in the load command, like this:
load pqr::somePath/someFileName.pqr

pqr is a format based on pdb, where the occupancy is replaced with the atomic charge and the temperature (or B factor) is replaced with atomic radius (however, the column positions in many pqr files do not match those of pdb files). This gives the acronym: P for pdb, Q for charge, R for radius. Jmol interprets the charge values (property partialcharge) and the radii (property vanderwaals), and can hence use them e.g. in color atoms partialCharge and spacefill.

The pqr format has somewhat uncertain origins, but is used by several computational biology packages, including MEAD, AutoDock and APBS, for which it is the primary input format.

PQR format description. Note that APBS reads PQR loosely, based only on white space delimiters, but Jmol may be more strict about column positions.

pdb files can be converted to pqr by the PDB2PQR software, which adds missing hydrogen atoms and calculates the charge and radius parameters from a variety of force fields.

Example files.


Amber

Jmol (11.7 or later) reads molecular dynamics output files from Amber. The fileset must have a structure like:

1 (topology file) + n (coordinate files)

The filter option of the load command can be used, as well as a new option to allow selective "first,last,step" loading of coordinate trajectories.

(This is preliminary and needs testing). You can see an example.

Example files.


V3000 (Symyx MDL)

See MOL.


Q-Chem

Jmol reads files output from the computational chemistry package Q-Chem. See the Q-Chem specific section.

Example files.


JME

This is the native format from JME Molecular Editor (by Peter Ertl).

See File_formats/Chemical_Structure#JME


CASTEP

Jmol (11.8) can read .cell files in the CASTEP format (only periodic boundary conditions).

CASTEP is a software package which uses density functional theory to provide a good atomic-level description of all manner of materials and molecules.


FHI-aims

Jmol (11.8) can read geometry.in files in the FHI-aims format (both cluster models and periodic boundary conditions).

FHI-aims (from Fritz-Haber-Institut der Max-Planck-Gesellschaft) is an all-electron electronic structure code based on numeric atom-centered orbitals.


DGRID

Jmol (11.9.14) can read DGRID files; this format is a generalized representations of output from a variety of quantum mechanical calculation packages, including especially ADF (Amsterdam Density Functional).


Exporting Coordinates

Jmol has a limited capability for writing molecular coordinates, including only the currently selected set of atoms. (This allows to save the whole model, maybe in a format different from the original one, or to extract a subset of the atoms in the original model).

Due to Java security restraints, only the application and the signed applet can write to files on disk. The unsigned applet can however display the exported data in the console window, from where you can copy it, paste into a text editor and save it to file.

Only MOL, XYZ and PDB formats are supported for export. The method is to use the write command (from script or from the console).

The application and the signed applet also allow to save a copy of the loaded file (any format) via the pop-up menu > Save > File.

The application can also export to Gaussian input format using the top menu and a dialog.

Contributors

AngelHerraez, Pbourke