XtalView Guide

advertisement
XtalView, SHELX-97 and LINUX
SHELX Workshop, ACA Meeting, 2000
Duncan McRee, The Scripps Research Institute, La Jolla, California
dem@scripps.edu
XtalView help and questions: ccms-help@sdsc.edu
Contents
Contents ........................................................................................................................................................1
Obtaining XtalView.......................................................................................................................................3
Fitting Features .............................................................................................................................................3
SHELX Fitting Features.................................................................................................................................3
LINUX Installation........................................................................................................................................4
A typical manual fitting session with Xfit.....................................................................................................4
Load the Map ............................................................................................................................................5
Contouring the Map ..................................................................................................................................7
Saving Phases ............................................................................................................................................8
XtalView Tips and Techniques .....................................................................................................................9
The shortcut menu ....................................................................................................................................9
Dictionary ................................................................................................................................................10
Turning the Map on and Off ...................................................................................................................10
Label The Residues .................................................................................................................................10
Go to the N-terminus ..............................................................................................................................10
Moving along the Chain ..........................................................................................................................11
Fitting A Residue.....................................................................................................................................11
Torsions ...................................................................................................................................................11
Save .........................................................................................................................................................12
Real-space Refinement ............................................................................................................................12
Geometry refinement ..............................................................................................................................12
Ending Refinement ..................................................................................................................................13
Refine While Fitting.................................................................................................................................13
Viewing thermal parameters...................................................................................................................13
Control Window......................................................................................................................................14
PostScript Plots........................................................................................................................................15
Raster 3D .................................................................................................................................................16
Model Window........................................................................................................................................17
SfCalc window.........................................................................................................................................18
Semi-automated fitting ...............................................................................................................................19
Tracing the main chain ............................................................................................................................19
Auto Fit Menu Commands .....................................................................................................................20
Auto Fit Strategy .....................................................................................................................................20
Assigning the Sequence...........................................................................................................................22
Editing waters .............................................................................................................................................23
MAD Phasing with XtalView......................................................................................................................23
Nomenclature..........................................................................................................................................24
Data Collection........................................................................................................................................24
Making the Fin Files ................................................................................................................................25
Difference Patterson Maps ......................................................................................................................25
Phasing ....................................................................................................................................................26
Absolute Configuration or "Hand"..........................................................................................................27
Improving the Model ..............................................................................................................................27
Appendix ..........................................................................................................................................29
Quick Start Guide to XtalView................................................................................................................29
Xfit environment variables ......................................................................................................................29
Menu Buttons ..........................................................................................................................................29
Xfit Canvas Window Keys ......................................................................................................................30
X
talView is a GUI software package for protein crystallography. It uses an X-windows interface on
a number of UNIX platforms - SGI, SUN, DEC and LINUX are all supported from a single source
tree. It is supported and distributed by CCMS at the San Diego Supercomputer Center
(http://www.sdsc.edu/CCMS). Version 4 should be available by the time you read this.
XtalView is used for solving structures by MIR or MAD, for de novo model building, fitting between
refinement cycles and modeling. Much of XtalView is covered in the book “Practical Protein
Crystallography”1 , McRee and Israel2, McRee993. The online help is at
http://www.sdsc.edu/CCMS/Packages/XTALVIEW/XV1TOC.html.
In this paper I will discuss the use of the XtalView program Xfit for fitting between refinement cycles.
Most of what is covered is applicable to all refinement programs but here we will specifically discuss
using SHELX-97 with Xfit running on LINUX. However XtalView also runs on SGI’s, SUN and DEC.
Obtaining XtalView
XtalView is free to academics and non-profits through CCMS. Send an email with the message to ccmsrequest@sdsc.edu with the message ‘get xtalview’. Complete instructions will be return emailed to you.
Others may purchase XtalView by contacting the author at dem@scripps.edu.
Fitting Features
•
•
•
•
•
•
•
•
•
•
•
•
Point and click interface
Rapid mouse-based fitting
Built-in FFT to calculate maps internally
Structure factor calculator
On-the-fly omit maps
Shake (jiggle) anti-bias option
Extensive model handling facilities
Automated waters
Fit while refining mode
Raster3D interface
Up to 50 models and 20 maps
Dynamic memory allocation
SHELX Fitting Features
Atomic-resolution structures are becoming more the norm these days. Several features have been added
to XtalView to support SHELX-97.
•
•
•
•
1
Reads SHELX Reflections files (LIST 6 and LIST 5)
Calculates Sigma-A coefficient maps from Fo, Fc
Improved split side-chains
Displays Anisotropic B’s (i.e. ORTEP style)
D.E. McRee (1999), Practical Protein Crystallography, 2 nd Edition, Academic Press, San Diego.
2
Israel, M. and McRee, D.E. (2000) "XtalView" in Crystallographic Computing 7, P.E Bourne and K.D. Watenpaugh, Eds.,
Oxford University Press, http://www.sdsc.edu/projects/Xtal/IUCr/CC/School96/IUCr.html
McRee, D.E. (1999) XtalView/Xfit - A Versatile Program for Manipulating Atomic Coordinates and
Electron Density. J. Structural Biology, 125, 156-165.
3
LINUX Installation
XtalView has been tested with a number of LINUX configurations. If you are installing a new system we
recommend a Pentium 100+ and >= 16 Mb memory, preferably 32 Mb and a 3-button mouse. You need
to install the xview package option and the 75-dpi font package. If you want to recompile, install the
kernel building package for the needed include files as well as the gcc compiler. The X-server should be
Xfree-86 3.2+. We’ve used Red Hat 4.0-6.1 successfully but other flavors can be used if the kernel is 2.0
or higher. In a future release we will support the Crystal Eyes PC which works with PC monitors and
uses the same glasses as Crystal Eyes for the SGI. There are no dials for LINUX. We know of a number
of LINUX issues that occasionally cause problems. Email ccms-help@sdsc.edu with your problem and
we can probably help.
A typical manual fitting session with Xfit
Note: Items that appear on the Xfit windows appear in bold.
To make a PDB file for Xfit, use the WPDB 2 command in SHELXL. (Xfit reads and understands the
ANISOU card.) To make the map either use the LIST 6 with the newest version of Xfit to create a
reflection file or you can use LIST 5 with versions 3.1 or less. You can also use LIST 6 and SHELXPRO
with the [X] XtalView option to generate an XtalView format phase file. SHELXPRO can calculate sigmaA coefficients and you then make an Fo map after reading the file in Xfit. The new version of Xfit can
calculate sigma-A coefficients on the fly, anisotropically scale the data and add a bulk solvent correction.
Start Xfit.
The first thing to do is to make sure you have the correct crystal chosen. Crystals can be entered using
Xtalmgr (before starting xfit).
The crystal contains the unit cell and symmetry information.
To load a model open the Files window. Click on the Windows button with the right mouse button and
select Files…
In the Files window you will see a list of all the files in the directory. If you did not start in the correct
directory, you can use the directory list, below the file list, to change directories or enter the directory on
the Directory line and press enter.
Click on your model in the window and then click on the Load Model button. You can load several
models at once. The slot the model is loaded into is indicated by the As Number field. This number is
automatically incremented after you load each model. If you want to overwrite a model then click this
button back to the model number you want to load and then click Load Model. The program will ask you
to confirm the overwrite.
Load the Map
Although there is a binary map format that X-fit reads (the FSFOUR format4) it is much more
advantageous to take advantage of Xfit’s FFT capabilities and read in a phase file and FFT it to a map. It
is actually takes less time to do this than to load a binary map from a file. There are also other
advantages:
•
•
•
•
You can change the resolution at will
You can change the map coefficients at will
You can make omit maps at will*
You can update the phases after changing the model*
There are two map converters available – map2fs by Kevin Cowtan for CCP4 to FSFOUR map
conversion and CNS2FSFOUR by Chris Putnam for CNS to FSFOUR map conversions for those times
when the coefficients for the map are unavailable or hard to duplicate. See the XtalView web site:
http://www.scripps.edu/pub/dem-web.
4
The starred items can only be done if you are using isotropic B’s and the resolution is below about 1.5 Å.
This is because X-fits uses an FFT that cannot handle anisotropic U’s and like all FFT’s it becomes
inaccurate at very high resolutions. However, at these higher resolutions sigma-A maps usually are all
that is needed - if you are still making decisions about where to place side-chains you are almost certainly
not at very-high resolution.
Load the map by clicking on its name. (If your Xfit isn’t listing .fcf files, add *.fcf to the File Filter field
and click on the List button. Click on the Load Map button. After the phases are read in, the FFT window
will pop up. Use the right mouse button on the Coefficients menu button and choose the coefficients you
want. If the file contains Fo and Fc, I highly recommend using one of the sigma-A map types. At this
time a map will appear in the canvas. If you want to change resolution ranges or coefficients later you
can do so by bringing up the FFT window again.
Note: If you are one of those people who likes to use both a difference map and a 2Fo-Fc map at the same
time, just load the map twice, changing the coefficients each time.
Contouring the Map
You can change the contour levels and box size in the contouring window. If you are going to inspect
your map at the residue level leave the box size at 5 and Auto Contour On Scroll on, and as you scroll
through the map the program will automatically re-contour.
The first level is set to 1 sigma of the map and the second 2 sigma and so forth. When the map is FFT’d
the sigma was scaled to 50. In the messages window the scale was printed so that later you can calculate
the equivalent unscaled map values5.
If you use the Xfit structure factor calculator then your data will be placed on an absolute scale. You
can get electrons/Å-cubed by dividing the contour level by the scale value. Most refinement programs
scale Fc to fit Fo, which are usually on an arbitrary scale. Thus the scale for imported Fc data is also
arbitrary. To get around this either put the Fo data on an absolute scale, or use the Sfcalc/Calculate All
and Scale option to recalculate the scale.
5
Use the Styles menu button to quickly choose some preset contouring options. The default is the Blue
map shown above. Another useful one is Diff Map.
The canvas has a gnomon in the upper left corner that shows the direction of the Cartesian axes. At the
bottom is a scale bar. At the lower right is a shameless advertisement. In the center is a white cross
showing the rotation center. To rotate the picture use the Left mouse button. To re-center use the
Middle mouse button. To bring up a shortcut menu use the Right mouse button. (Note: if you like the
Insight mouse conventions you can choose this option in the Control window.) The Middle mouse button
changes function as you choose different fitting options. The default model color is yellow for carbons
with other atoms colored by atom type. You can change the coloring with the Color window. Near the
top of the screen the cursor changes to a Z to indicate the z-rotation area where the rotation operations
rotate in the plane of the screen and translations are in and out of the screen.
Saving Phases
If you calculate the phases from within Xfit and want to save them for the future or another program, you
can use the map file menu button on the Files window. Click the As Number field to the number of the
map to be saved and then pull down the menu with the right mouse button and choose Save Phases
(*.phs).
XtalView Tips and Techniques
The shortcut menu
The shortcut or canvas menu is reached by using the right mouse button. It comes up wherever the
mouse is positioned in the canvas window. This saves a lot of back-and-forth, back-and-forth mouse
movement. The same menu can be reached from the toolbar as the Canvas Menu. The items on the
menu (moving down the columns from the left):
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Center at - centers at the last pick
Center middle mouse mode
Translate middle mouse mode
Rotate middle mouse mode
Torsion middle mouse mode
Contour maps
Atom - Make last atom picked fittable
Atoms - Make all atoms on stack fittable
Residue - make the last residue picked fittable
Residues - make all the residues on the stack fittable
Group - make the group containing the last pick fittable
Molecule - make the entire molecule fittable
X +90 - rotate plus 90 degrees about the x (horizontal axis)
X -90 - rotate minus 90 degrees about the x (horizontal axis)
Y +90 - rotate plus 90 degrees about the x (horizontal axis)
Y -90 - rotate minus 90 degrees about the x (horizontal axis)
Z +90 - rotate plus 90 degrees about the x (horizontal axis)
Z -90 - rotate minus 90 degrees about the x (horizontal axis)
Toggle stereo mode
Swap regular menu and expert menu (useful in hardware stereo mode)
Dictionary
There are two dictionaries supplied with XtalView: $XTALVIEWHOME/data/dict.polar.pdb which has
hydrogens on the polar residues (i.e. XPLOR), $XTALVIEWHOME/data/dict.noh.pdb which has no
hydrogens and $XTALVIEWHOME/data/dict.full.pdb. The dictionary is a list of residues in PDB format
with some extra cards that define planes and torsions. For alternate conformers each residue has several
consecutive entries. As you cycle through conformers the next example is used to replace the current
residue. If you have a prosthetic group that is not in the dictionary then simply append a PDB file with
the prosthetic group you want with the Load Dictionary menu:
There is also a full hydrogen dictionary supplied. Remember, though, that SHELXL adds hydrogens for
you, and this method is the better way to go as SHELXL will generate all the correct entries in the .ins
file.
Turning the Map on and Off
Use the Show window to turn objects on and off. Here I have two maps which I can either alternate
between or show at the same time by clicking on the entry.
Label The Residues
Open the Label Window, set it up as shown here and click Label Every. This will facilitate identification
during fitting.
Go to the N-terminus
To get to the N-terminus click on the chain whose N-terminus want to go to and then press the “[“ key.
The C-terminus is reached by using the “]” key. (You can also use the Auto-Fit menu if you forget the
key.)
Moving along the Chain
To move to the next residue hit the space bar - this pops you to the next residue and the map will recontour (unless the Auto Contour on Scroll option is off on the Contour window). By hitting the space
bar you can quickly inspect every residue in the model.
Fitting A Residue
In the figure below we see that Val 3 is in the wrong conformer and doesn’t match the density. We’ll
discuss manually moving the residue. This will seem complicated as it is described but after you do it a
few times it will be simple.
Practice fitting a residue - don’t worry it is easy to undo any changes you make - even later after fitting
several other residues.
First click on any atom in Val 3 to place it on the top of the stack. The use the Residue option either on
the shortcut menu or on the main window to make the residue fittable. It will turn green to indicate it is
in fitting mode. Now you can move it with the mouse. If you bring up the shortcut menu with the right
button you can click on either rotate or translate. These connect the middle mouse button to the selected
operation. The left mouse will continue to rotate the viewpoint. Select translate and drag with the
middle mouse button. The green residue will move - the old, yellow version stays in place so that you
know where it was. Move the residue a lot and you will notice that the main-chain bonds will become
red to let you know they are ridiculously long. Move it back and they become green again. Move it a
long ways for now. Now choose rotate and then rotate the residue with the middle mouse again. Note
that the residue rotates around the atom originally picked. (If you need to move the viewpoint center
you can choose center on the shortcut menu to restore the middle mouse to its default mode.) To put the
residue back at the starting position, choose Reset on the main window.
Torsions
To torsion a bond, pick the atoms at the ends, picking the end that will rotate second. Then choose
torsion from the shortcut menu. Now as you drag the middle mouse horizontally, the bond will twist.
For the chi angles of protein side-chains there is a quicker way - just type 1 on the keyboard to set up
chi1, 2 to set up chi2 and so forth. Remember that the input focus needs to be in the canvas window or
else the keystroke is sent to the window with input focus. By using the number keys to set up the chi
angles and dragging with the mouse you can quickly position a side-chain.
When you are finished fitting use either Apply, to keep the changes, or Cancel to return the residue to its
starting position, on the main window. The semi-colon “;” key can be used to apply the fitting. The
semi-colon key means stop everything and it will also halt the refinement option we will talk about later.
Save
Save your model by opening the files window, and entering a name in the Output PDB field and pressing
save. Save often to avoid losing your work. You can also start Xfit with the -auto flag to automatically
save the file every time you do an operation that changes it.
Real-space Refinement
While the manual stuff is fun, after a while you just want the computer to do it. With a residue set up in
fitting mode, select RealSpace Search on the canvas menu. The computer tries all the possible torsions for
that residue to get the best fit to the density. If you have several residues being fit at once, the one last
clicked on is searched.
If you have more than one map, set the map to search against on the Refine window:
Geometry refinement
With a residue selected by clicking on it, choose one of the Refinement options on the canvas menu. The
residues being refined against the dictionary will be highlighted as a cyan color. Refine Range will refine
all the residues between the last two picked and Refine Region will do the last picked plus its two
neighbors. Pressing the space bar will do another round of refinement.
Ending Refinement
Press the semi-colon “;” key to stop all refinement and fitting. You can undo a refinement with
Refine/Undo Last Refi.
Refine While Fitting
This is a very powerful way to fit. In this mode the geometry is refined while you drag atoms around,
giving the model the feel of being mechanically linked. To do this easily you need xfit version 4 and
above (it works in the older xfit versions but it requires more setup). Pick a residue with the mouse.
Open the canvas menu and under Refine pick RealSpaceRefi Region. This sets up the residue clicked on
plus the two neighbors to be geometry refined against the map (if you have several maps loaded, the
map refined against is set in the Refine window). The refineable residues turn cyan. Click an atom – it
turns green. Drag it with the middle mouse and watch what happens. Click another atom and repeat.
Try just wiggling an atom back and forth to do some local annealing. Pressing the space bar will do
another round of refinement. Press the spacebar until you are done and then press the semi-colon “;” key
to end fitting and refinement.
Viewing thermal parameters
A lot of information can be gleaned from the thermal parameters of the model. You can use the new
options on the View window to view either isotropic B’s or anisotropic U’s as 50% probability surfaces.
Click on a range of residues and choose the Pop button next to First Res and Last Res (or type in the
residue numbers) to set up a range (you can do the whole model but it may be slow). Then choose
Thermal Ellipses and click on Make Vu.
Control Window
A number of miscellaneous options have ended up on the control window. On slower machines the
dynamic sliders may be too slow. In this case switch Dynamic Sliders off. Also useful for model building
is the Draw Contacts option which will show all contacts within the distance specified. These contacts are
dynamic and update as you fit residues.
PostScript Plots
The contents of the screen can be sent to a PostScript printer from this window. The PostScript can also
be saved into a file. To make a stereo plot first turn on side-by-side stereo from the View window.
Raster 3D
With the help of Ethan Merritt at the University of Washington, we have developed a Raster3D interface.
It will be necessary to obtain Raster3D from Ethan via his Web page. With this option you can make
pictures composed of lighted cylinders and spheres of cover quality.
Model Window
There are a number of options on the model window but the one that is especially important to point out
to SHELX users is the Split Residue menu item on the Insert Res menu button. To use this, first set up a
residue to be fit (click on it and choose residue on the shortcut menu) and then activate the torsion you
want to split the side chain about.
Now choose Split Residue and the residue will be divided about the torsion and the two halves
numbered A and B and given occupancies of 0.5. One half will become active and you can move it into
the density with the middle mouse.
These are the other menus available on the Model window.
SfCalc window
The SfCalc window is used to calculate structure factors from the model (i.e. Fc, phi). At resolutions
below 1.5 and with isotropic B models, this window can be used to calculate the structure factors for a
model. For other refinement programs we usually keep an empty phase file around and calculate the
structure factors from the model and then FFT the map. The advantage of this is that the structure
factors will be correctly scaled so that the omit map and update options can be used. To make an omit
map you need to do Calculate All and Scale once after reading in the phase data. You may want to turn
the Shake option on as this can help substantially in reducing phase bias. This works by adding a small
number to each atom coordinate (+/- 1/6 dmin) before the FFT (but does not change the model). The
effect of this is to jiggle atoms which have been moved a small amount in order to satisfy the error
residual of wrong piece of structure.
To make an omit map you choose one or more residue and make them current (green). Now choose
Omit Current Atoms. The partial structure factors for the current atoms are calculated and then
subtracted from the structure factors of the whole model and the omit map is FFT’d and contoured.
At higher resolutions, above 1.5, and especially if the B’s are anisotropic, the program cannot do an
accurate structure factor calculation as it uses an isotropic reverse FFT algorithm. However, if you are at
the point of making your B’s anisotropic and are still wondering where side-chains are located you may
want to rethink your fitting strategy!
Semi-automated fitting
Tracing the main chain
Xfit v3.5+ has a new menu added “Auto Fit” which incorporates features for automated fitting
procedures that used to be done one step at a time. The new menu allows very fast fitting in the parts of
the map that are clear allowing you to concentrate on the difficult bits without having to doing the
repetitive fitting tasks on most of the map. By clear we mean that the main-chain density should be
mostly connected at positive contour levels and the side-chains are visible.
Note that some menu items have a character in single quotes that indicate a keyboard shortcut for the
operation. After you become accustomed to the keys you will find this much quicker than using the
menu. Before we explain how to use the menu we need to discuss some notation.
MRK -. A MRK is a residue with a single CA atom used to mark the positions of the Ca atom of a
residue. MRKs are part of the standard xfit dictionary found in $XTALVIEWHOME/data/dict.pdb6. If
you don’t have a dictionary being automatically loaded read the beginning of the xfit man page before
continuing. You must have a dictionary before automated fitting can proceed.
Fragment/Chain - the basic unit of fitting is a string of residues connected together. The criteria for
connectedness for MRK residues (i.e. CA’s) is two successive ones be within 4.5 Ångstroms of each other.
The Model window automatically numbers each fragment and marks the chain termini. The Sequence
menu button on the Model window has a command Fix Chain Numbers that can be used to update the
There are three dictionaries available in the distribution. They differ in the treatment of hydrogen
atoms. One has no hydrogens, one has polar hydrogens only (i.e. hydrogens involved in hydrogen
bonding) and the third has all hydrogens. The default is the polar H dictionary. You can change the
dictionary type in the Model window or you can have a different dictionary load by inserting this line in
.cshrc after you source XtalView.env:
6
setenv XFITDICT $XTALVIEWHOME/data/dict_noh.pdb
to load the no hydrogen dictionary. To load the all hydrogen dictionary change dict.noh.pdb to
dict.allh.pdb. The polar hydrogen dictionary is dict.polarh.pdb.
chain numbers.
Focus Residue - This is the last picked residue either on the canvas or on the Model Window. It shows up
as highlighted in Model window residue list and on the lower right footer of the canvas. All of the
commands discussed here start with the focus residue and use it to identify the chain to be acted upon.
Current map - the map used in the auto fit commands is the one enumerated in the Refine window in the
map number field. The default is map 1. If another is desired it must be changed here before using the
auto fit commands.
Auto Fit Menu Commands
Add CA (MRK) After - Adds a MRK residue to the C-terminal end of the current focus residue (see
above). The “>” key is a shortcut for this operation. If there is already a residue here the user is warned
and given the opportunity to cancel. If there is more than one residue in the fragment the vector
between the last two residues and the density of the current map (see above) is used to find the position
of the residue. The 5 most likely positions are saved and can be cycled through by pressing the space
bar or by the “Cycle CA” option on the menu. For small adjustments or if none of the positions looks
correct, the middle mouse button moves the CA atom in refine while fitting mode keeping the distance
3.8 Ångstroms to the focus residue. The MRK residue is automatically numbered one less than the focus
residue. Lastly the command makes the added residue the focus residue so that the next invocation of
this command will add another residue to the end of the chain before the new one. In maps with clear
main-chain density the command finds the next CA position and no further fitting is needed.
Occasionally the side chain density is stronger than the main chain and in this case pressing the space-bar
will find the correct choice.
Add CA (MRK) Before - Similar to the Add CA (MRK) After except that it adds to the N-terminal end of
the chain/fragment starting at the focus residue. The “<” key is the shortcut.
Next CA Choice - Cycles through the CA choices for the Add MRK commands. The spacebar is a
shortcut to this command.
Cancel Baton mode - Accepts the CA position and ends the refinement mode. The “;” key is the keyboard
shortcut. It is good practice to always issue this command when finished adding to the ends of a
fragment otherwise a later press of the spacebar may have surprising results. Note this command and
the “;” key will accept any fitting operation and any refinement mode so it can be used anytime after a
fitting or refinement operation t accept the results.
Go to Fragment Nter - moves to the N-terminal end of the fragment/chain containing the focus residue
and makes it the new focus residue. Use it as a precursor to the Add MRK Before command.
Go to Fragment Cter - moves to the C-terminal end of the fragment/chain containing the focus residue and
makes it the new focus residue. Use it as a precursor to the Add MRK After command.
Poly-Ala Fragment - Turns all the MRK or VEC residues in the fragment/chain containing the focus
residue into alanines using overlapping pentamer commands. If the best pentamer contains either a
glycine or proline it is kept as such instead of an alinine to prevent inserting alanines in bad positions.
Auto Fit Strategy
Starting a new model - After locating a stretch of density center the position of a CA atom on the cross in
the center of the screen and open the Model window. Set the active model to an unused number, the
Type to MRK, enter a likely residue number (e.g. 100) and select New model command on the Insert Res
menu. A new reidue will appear in the center of the screen. Select this residue to make it the focus
residue by clicking on it with the mouse and then use the “>” key to add a second MRK residue. Since
there is an ambiguity as to which direction the chain should go, the program leaves it up to you to move
it. If you have some indication of the chain direction (see Fig. 3.40 and 3.43) then you can place the
residue in that direction. Otherwise, just guess and reverse the chain direction later if needed. To place
the CA atom use the middle mouse button and drag the residue until is in the desired position. Use the
“>” key again to place the third and subsequent residues automatically.
The program finds the position of the next residue by considering the map density and the direction
between the previous two residues. In considering the map density the program tries to eliminate strong
side-chain paths by considering a second move and adds the two probabilities together to find the final
probability. Thus side-chains are eliminated because they will not provide a good position for the second
move. If the program still finds a side-chain press the spacebar to cycle through the other 4 choices.
Pressing the spacebar 5 times brings you back to the original choice. You can also move the CA position
with the middle mouse button as you did for the second position. In a medium resolution map it is not
possible to precisely determine the CA position so instead concentrate on getting the path right and use
the pentamer fitting followed by real-space refinement to better position the fragment.
Save Frequently! You never know when there is going to be an accident. Open the File window, and
enter a filename in the Output PDB field and then click on Save just below the field. You can also use the
Save button on the main window to write out the file.
To add residues to the N-terminal end of the fragment hit the "[" key to move to the N-terminus and then
type "<" to add residues at that end. When you get to a spot where the map is not clear it is usually best
to move on to another spot and fill in another chain fragment. You'll also want to hit the Symm Atoms on
the main window occasionally to look for symmetry related residues to make suer you don't trace the
same density twice. As you add you will find that a fragment has the wrong chain direction. It can be
easily reversed by selecting a residue in the fragment and then in the Model window issue the Reverse
Chain containing Selection on the Sequence menu button.
Putting fragments into order - In the best case you can find the entire main chain with one fragment.
However, this is rarely the case. Since you added the fragments arbitrarily they will be appear in the
model window in no particular order. In general this is no problem unless you want two fragments to
join and they are won't because they are in the wrong order. To fix this you can renumber one of the
fragments. Click on the first residue in the fragment to be renumbered in the Model window, enter the
new number in the Name field for the start of the chain, and then issue a Replace Name command from
the Sequence menu button followed by a Renumber Chain. After you have your fragments numbered as
you like, you can issue a Sort by Sequence Number command and resort the fragments into ascending
order by number. You can also reorder fragments by cut and pasting if for some reason you don't want
to change the numbering system you are using. To do this, first select the Non-Exclusive check-box. This
lets you select more than one residue at a time. Highlight all the residues in the fragment in the list by
clicking on them, and then use Cut selected into buffer on the Insert Res menu button to put the fragment
into the residue buffer. Then select the residue at the end of fragment you want to paste in front of and
use Paste buffer after to put the residues back in the list. You can also Paste buffer before by selecting the
appropriate residue. Note: the cut and paste commands can be very useful during model building to
shuttle residues back and forth between models. By changing the Model # field between the cut/copy
and the paste you can cut or copy residues between models as well as within models. For instance, you
can cut a fragment out of a related protein and save a lot of fitting effort. You can then rename the first
residue to the correct sequence number and renumber the chain to put it in the correct sequence order.
Reversing the chain- If you get a chain in reverse order its simple to reverse it. Open the Model window,
select a residue in the fragment and then issue a Reverse Chain containing Selection command from the
Sequence menu button. This command works very well at the MRK stage but later it will all your
residues with the N and C terminii facing the wrong way.
After the trace is complete or you get stuck and can't go farther, the fragment is poly-Ala’ed to form a
backbone. The poly-ala command builds overlapping pentamer fragments from the N-terminal end of
the fragment to the C-terminal end. It is best to get the trace fairly complete before you do this step.
You can't undo it, so save the MRK trace first. Now select a residue and from the Auto Fit menu button
on the Toolbar, issue a Poly Ala Fragment and the program takes over. After it is finished the MRKs will
be replaced with a poly-Ala backbone. You are now ready for the next step - assigning sequence.
Assigning the Sequence
Sequence file. Before you can assign the sequence you need a sequence. You can almost always find
your sequence on the Web using one of the sequence databases and then cut and paste this into a file
using your browser. Give the file the extension .seq and edit the file to check the format. The format xfit
wants is as follows. The first line is the title. Subsequent lines should contain the sequence in singleletter in upper case. Numbers and punctuation will be ignored. That's it. Read this file into xfit using
the File window. Put the name in the Sequence File field and push Load Sequence. In later sessions, you
can put the sequence file on the xfit command line and as long as it has the .seq extension it will be
loaded as a sequence file. Hint: since there isn't anyway to enter in sequence numbers in the sequence
file, xfit numbers the sequence starting at the first residue. If you want to change the numbering you can
insert some dummy residues (e.g. A) at the beginning of the sequence or in a gap.
Identify sequence. Now you have to find at least one residue that you can identify in the sequence.
Sometimes you can identify one by a special position such as binding a prosthetic group. For instance,
the histidine ligand to a heme group. Otherwise you need to look for a pattern of residues that you can
identify by shape and size. Usually, the easiest to identify are the large residues such as a tryptophan.
The core is the most reliable place to assign sequence as surface residues are often disordered or given a
haircut by solvent flattening. Look for a repeating pattern and then try to identify that pattern in your
sequence. remember the chain might be reversed so try the match in both directions. If you are still
stumped or aren't sure about what a residue should look like anyway you can always just try some
guesses. Its very easy in xfit to change the sequence around so you can try different possibilities.
Assign sequence. Having a candidate residue click on it to select it. Open the sequence window using
the sequence Toolbar/Auto Fit/Set Sequence... and scroll down the sequence to find the corresponding
amino acid in the list and click it to select it. Now click on Change and Fit to Density and stand back. All
the residues in the fragment will be converted according to the sequence and then all the rotamers tried
to find the best match to density. You can do this as many times as you like. Just remember to save any
models you want to keep. This way you could, for example, try out all the tryptophans in the sequence
and then look along the model to see how it matches the sequence.
Editing waters
Two shortcut keys have been added specifically to make editing the water list faster to encourage users
to actually look at the waters. Note that in the majority of cases editing the water list and removing bad
waters will cause your R-free to drop even if there is a very small increase or no change in the overall Rfactor. A bad water is one in which there is no density, the density is not shaped like a water, or the Bvalue is higher than say 80.0. For high B-value waters with sensible density you can try halving the
occupancy.
•
Shift+W – Adds a water where the cursor is and at the center of the slab. The new water is in fitting
mode and its position can be adjusted using the middle mouse button. To quickly position the water
leave the refine window up in a corner and after the water is added click on the Translation button to
real-space refine the water into the center of the peak. Use a thin slab (5 Å) to get the out-of-screen
direction close enough for the translation to pull it in. Use the semi-colon key to end the water fitting
or go on to the next one and use the semi-colon key at the end of the water adding sequence.
•
Shift+D – Deletes the last picked water/residue. Click on a water and then issue this command. If
it’s a water the program deletes it. If it’s a residue the program asks for confirmation of the delete.
You can undelete a water by going to the model window, finding the deleted water and deleting it
again to toggle the DEL flag.
MAD Phasing with XtalView
With the proper treatment, xheavy can be used to phase MAD experimental data. No modifications are
needed to the program and the method used is similar to that used by others to phase MAD data by
treating it as a special case of MIR phasing. The anomalous differences, between Bijvoet pairs, are treated
as native anomalous, and the dispersive differences, between wavelengths, are treated as isomorphous
pairs. Care must be taken to treat the member of the dispersive pair with the smaller value of f' as the
native while the other is the derivative.
Nomenclature
First some nomenclature. MAD experiments are done by measuring 3 or more wavelengths. The 3
wavelengths are identified by an XAS scan at the absorption edge of the anomalous scatterer in the
crystal:
•
PI - point of inflection of the edge, minimum of f'
•
PK - peak of absorption above the edge in energy, maximum of f'', intermediate f’
•
RH - remote edge above the edge in energy, maximum of f'
•
RL - remote edge below the edge in energy, maximum of f'
The 4th wavelength, RL, is sometimes measured below the edge. Its contribution to phasing is minimal,
because it has no anomalous signal and its dispersive differences are redundant with RH, although it may
be used to increase redundancy. It would be handled identically to RH except that it will have no
anomalous signal. More useful may be another remote edge above the edge in energy, which would
have both a dispersive difference and an anomalous signal and increase data redundancy.
If the MAD signal is strong, a decent MAD experiment can be done with just two wavelengths, PK and
RH. Both wavelengths have a good anomalous signal and there is a dispersive difference in f’ between
the two.
Always collect PK first. It has the best signal for solving the heavy atom positions and in the best of cases
you can solve a structure with just this one wavelength. The second wavelength to collect is then RH as
these two wavelengths suffice to make a minimal MAD experiment. Next, collect PI, as it has the largest
change in f’, and collect RL only if there is plenty of time. It is probably better to get more redundancy at
the other wavelengths than to collect RL, though.
Data Collection
The best way to collect the data seems to be to simply collect 3 data sets on a frozen crystal with exactly
the same strategy. Use inverse beam strategy to collect the Bijvoet pairs in lower symmetry spacegroups.
In higher symmetry spacegroups it is usually trivial to get the Bijvoet pairs on a large detector face. Make
sure you collect enough data to get about 95% completeness on the Bijvoet pairs. Offset the crystal that it
rotates about 20º off an axis to up the completeness by avoiding the blind zone overlapping an axis. We
monitor the data by integrating it continuously as it comes off the machine to find the point at which the
data is complete and to make sure we are getting good R-symms. The quality of the data is the ultimate
key to MAD phasing. Additional redundancy is very helpful as the signal is small and averaging over a
large number of reflections is helpful. We have tried strategies to collect the data in blocks and find this
unnecessarily complicates data collection, reduction and phasing. Our best results have come from the
simplest strategy with the highest redundancy. We have seen large improvements in map quality due to
increased redundancy and high redundancy is needed for the best results in finding large number of
selenium positions in larger proteins by allowing the selection of the best Bijvoet differences. R-symms
should range from 2-3% for the low resolution data to 10% for the high resolution with an average of
about 6% (without merging Bijvoets). Scale the Bijvoets together, but do not merge them. Output the
data as I’s.
Making the Fin Files
Always run the data through xprepfin, even if you think you have a fin format file to avoid bad
reflections and problems with centric vs. acentric, etc. If you are using scalepack, xprepfin can read the
output file and convert it to a fin file. If you made I’s as suggested above, then select the Convert I to F
option. Also select the Data are: Bijvoet Pairs option:
You should have one fin file for each wavelength collected. For the standard 3 wavelength MAD
experiment these will be pk.fin, pi.fin and rh.fin
Difference Patterson Maps
The Bijvoet differences are treated as would any anomalous scatterer. Make the Patterson maps as usual
with xfft. In xtalmgr, slect xfft as the application. Click on pk.fin as Input Argument 1, click Auto Name
Output to set the output file to pk.map, and then use Add Args followed by Run Command. This should
start xfft with pk.fin as the input and pk.map as the output. Input the Resolution Filter – I usually choose
5-3.5 Å and set the Outlier Filter to 30%. If you have the newer XtalView 4 you can also choose Remove
Patterson Origin which helps if there happens to be a peak near the origin. It also lets you see the size of
the largest peak printed in the message window. You can change the resolution and outlier filters to
maximize this number to get the best Patterson map. Generally, the highest peak will be somewhere
above 400 (the map is scaled such that 1 sigma or 1 rms is equal to 50), although this is also dependent on
how many sites you have; the more sites the smaller the size of each one. View the Patterson map with
xcontur. In the xtalmgr, pick xcontur as the application, click on pk.map as the input argument 1, and
click Add Args, Run Command.
The dispersive differences need to be merged with xmerge. Merge the pairs in this order using xmerge:
Wavelengths
Input file 1
Input File 2
Output File
PI-PK
Pi.fin
Pk.fin
Pi.pk.fin
PI-RH
Pi.fin
Rh.fin
Pi.rh.fin
PK-RH
Pk.fin
Rh.fin
Pk.rh.fin
The largest signal will be in the PI-RH pair for the dispersive differences and in PK for the anomalous
differences. All the Pattersons should look basically the same. All 3 wavelengths, PI, PK and RH will
have an anomalous signal with the largest signal at PK. If you can't see peaks in the Patterson maps at
low resolution - say 5 Å, then there probably isn't any need to proceed. However, with well measured
data, this should not be a problem. We have seen sulfur atoms, which have a very small signal, in Bijvoet
difference Pattersons.
Solve the Patterson with SHELX and Shake-n-Bake if the Patterson is complicated. Actually, you can
skip making all the Pattersons if you know how to run these programs and interpret the output, as they
can tell you about the signal. I like to take the solution from these programs, put them into xpatpred,
write out a predictions file and then overlay this label file onto the Patterson with xcontur (use the File…
button in xcontur and load the prediction file you saved from xpatpred as a Labels file). The labels
should fall onto the peaks in the Patterson. This gives you the confidence to go on with the next steps. If
you have large peaks, and they are all accounted for by the solution, the MAD phasing cannot fail. If you
have some peaks not accounted for, but the majority are, the best course is probably to shove on and
with a partial solution and use these phases to make Bijvoet difference Fouriers with pk.fin and
xmergephs to pick up the additional sites later.
In general, if there isn't anything in the Patterson map for a particular wavelength or wavelength pair,
then it is not worth including this particular file in the phasing.
Phasing
Now you are ready to phase. Start xheavy and use the editor to add each pair. You will have 6
"derivatives": PI_ano, PK_ano, RH_ano, PI-PK, PI-RH and PK-RH. In order to avoid overestimating the
figure merit each derivative should be weighted correctly. Weight each derivative at 33%. This way, the
problem reduces to 1 isomorphous derivative plus one anomalous derivative. If you have an RL dataset,
lower the weights to 25% since you now have 4 identical solutions. Failure to downweight will hinder the
solvent flattening step by overestimating the figure of merit and not giving any room for improving the
phases. All the derivatives share the same sites - call them all atom type PT (the default - in practice the
subtle difference between different atom types structure factors are far below your noise levels) and set
the B's to 10.0. Refine the derivatives as usual. Although, they all share the same sites, letting them all
refine individually gives the best phases. Sometimes, one of the derivatives refines poorly – usually the
Patterson also looked rather poor. Delete it from the list. Poor derivatives do more harm than good.
Start at 5 Ångstroms and then raise the resolution limit to whatever limit you feel the signal continues to.
Be honest here- adding lots of high resolution but poorly phased data will not help the maps. One
problem is what to call the native F. If you have a data set you plan to use as the native for refinement
then include this on the master fin file field, or just leave this line blank.
Once refined you can calculate the protein phases. During phasing you get several useful statistics. One is
the R-anomalous of the 25% largest Bijvoet differences for the anomalous pairs. This should be 40% or
less. 25-30% is ideal. The R-centric can be used for the isomorphous/dispersive pairs. These should be
60% or less, although this can be in the 70’s and still give useful information. In the 50's is ideal. Another
thing to check is whether the acentric and centric scale factors agree to within about 20%. The differences
in the values of f' and f'' are all automatically scaled by xheavy. The phasing power for the dispersive
pairs is usually surprisingly high if you are used to heavy atom derivative phasing. This seems to be
because the errors are quite low since the same crystal is used for all measurements. The final figure-ofmerit in our cases is usually around 0.6. This would seem low for a normal MIR experiment but the
equivalent MAD maps are much better.
Absolute Configuration or "Hand"
At this point you MUST try BOTH alternate heavy atom configurations, also called "hands". The correct
configuration is ambiguous in the Patterson function and cannot be found a priori. To try the alternate
invert each atom around the center - i.e. change the sign on x,y,z to -x,-y,-z. (In XtalView4 you can do this
automatically by opening the Options… window in xheavy and changing the Heavy Atom Configuration to
–1. Also change the name of the output phase file or you will overwrite it J I call the files h1.phs and
h2.phs). Make both maps and examine them in xcontur. In one you will see clear solvent channels and
protein features and the other map will be junk. If you can't see a difference backtrack and check all the
steps. Before closing heavy, don’t forget to save the solution with Save Derivative. All the output goes
into a file named xheavy_####.log, where #### is the process id. You might want to rename this to
something you can remember for later use in writing the phasing table in the paper.
Improving the Model
At this point its worth looking for minor sites (unless the maps are so good you don't need to). What
minor sites you say? Why the sulfurs of course. The sulfurs in the protein will have no dispersive signal
as they have no absorption edge in the range useful for MAD. However, they do have a small but
significant anomalous signal, especially in Fe MAD experiments, and sometimes at the higher energy
selenium experiments. It is also very common to pick up a zinc, copper or magnesium ion that you were
unaware of perviously at this stage. By making a Bijvoet difference Fourier map using xmergephs (check
both Swap F1 and F2 and Add 90 to Phase check boxes), the PK.fin dataset and the current protein phases
you can look for peaks by making a 5-4 Ångstrom Fo-Fc map in xfft and then using xcontur and add these
to the anomalous solutions. Then recalculate the phases. This Bijvoet difference Fourier map is also very
useful for finding the sulfurs in cysteines and methionines to verify your sequence assignment and
finding sequence markers. We have been using this technique with great success for a number of years in
all our structure solutions. Even though the resolution at which the sulfurs can be seen is usually quite
low, 5-4 Angstroms, this is more than sufficient for locating the sulfur containing side-chains for
sequencing purposes.
One note. The site of the scatterers will be the worst spot in the map. The reason for this is technical but
has to do with all the errors piling up at the spot where the heavy atom is located since it effectively
defines the origin of the phasing. They usually come back with solvent flattening and in the first map
after refinement.
Now you are ready to fit the maps! Look at the map by loading the phase file from xheavy into xfit and
choose Fo*fom as the map coefficients in the FFT window. E-mail us with difficulties and/or questions.
We have solved numerous proteins using MAD phasing with Fe, Cu, selenium and bromine edges. It
seems complicated the way its described because I’ve put in all the details an XtalView novice might
need, but once you learn it, its very quick to do. We often knock the structure off at the beam line – at
least enough to verify that the maps are good. If you have diffraction data better than 2 and phases to
about 2.5 Å, you can often build much of the model using wARP, or you can use the Autotrace functions
in xfit to knock out a preliminary model in an afternoon.
Solvent Flattening
We use DM to solvent flatten our maps if needed. Many MAD maps are good enough to use unflattened
and while solvent flattening increases the contrast in the protein region, it often flattens sidechains on the
surface. In any case, look at both maps.
f2mtz hklin cua.phs hklout cua.mtz <<eofcell 34.9 70.60 53.50 90 98.12
90symmetry 4labout H K L FP FOM PHIBCTYPOUT H H H F W PFORMAT
'(3F4.0,2F9.2,F7.2)'ENDeof
We ran DM with the input file:
#This part adds the high resolution F's to the mtz file created
# before
mtzutils hklin1 hires_F.mtz \
hklin2 cua.mtz \ hklout hires_cua.mtz <<eofsymm
p21INCLUDE 1 FINCLUDE 2 FOM PHIBRUNeof#
#Now we run DM with solvent masking, histogram matching
# and symmetry averaging.
dm HKLIN hires_cua.mtz hklout dm_hires_aver.mtz \histlib
/nfs/image/usr/local/ccp4_3.3/lib/data/hist.lib <<eofsolc 0.45mode solv hist
averncsmask nmer 2wang 8 0aver refi
# the identity matrix for the first moleculerota matrix 1.00 0.00 0.00 0.00 1.00
0.00 0.00 0.00 1.00
trans 0.0 0.0 0.0
aver refi
#the matrix and vector from xfit's LSQ Fit function
#for the second molecule
rota matrix 0.051060 -0.364196 -0.929970 0.164497 -0.915385 0.367570 -0.985096
-0.171713 0.013137trans 10.930595 11.009571 39.851776
ncycle 20
combine free
LABIN FP=F FOMO=FOM PHIO=PHIB
LABOUT PHIDM=PHIDM FOMDM=FOMDM
eof
DM has the nice feature that it calculates all of the needed masks automatically including the noncrystallographic symmetry mask. At this point we put the phases back into Xfit using the script:
mtz2various hklin dm_hires_aver.mtz hklout dm_hires_aver.phs << eoflabin FP=FP
FOM=FOMDM PHIB=PHIDM OUTPUT USER '(3I4,x,F7.2,3x,F7.2,3x,F7.2)'END eof
Appendix
Quick Start Guide to XtalView
Before starting XtalView the environment must be set up. This is best done by placing the environment
variables and path in your .cshrc file as described at the start of the XtalView User’s Guide. Alternatively
you can source the file XtalView.env included with the standard XtalView distribution. However, each
user should set up their own crystal database directory as described in the XtalView User’s Guide. On
some machines the font path needs to be amended to include the XtalView fonts. This is done with the
fonts command. Now start Xtalmgr by typing xtalmgr&. If you have trouble starting Xtalmgr, send
email to ccms-help@sdsc.edu with a description of the problem and the error message (cut and paste the
error into your email).
Xfit environment variables
•
XFITPENTDIR - the location of the pentamer database - normally $XTALVIEWHOME/data/pbdvec
•
XFITDICT - the location of the dictionary to use. Two choices are in the standard release:
$XTALVIEWHOME/data/dict.pdb which has polar hydrogens a la XPLOR or
$XTALVIEWHOME/data/dict.noh.pdb.
•
PRINTERCOMMAND - use to specify a different printer command from lpr such as lp.-dprinter.
Menu Buttons
. Any time a button has a downward-pointing triangle like this,
A menu button looks like this:
pressing the right-button on the mouse over this button will bring up a menu of options. Missing this
fact will hide much of XtalView’s capabilities!
Xfit Canvas Window Keys
• Canvas window must have focus
• Keys are case sensitive
ESC
Toggle hardware stereo or exit photo mode
1
Activate Chi-1 torsion of side-chain if fitting
2
"
Chi-2
3
"
Chi-3
4
"
Chi-4
5
"
Chi-5
6
"
Chi-6
f
Activate phi torsion between focusres and previous residue
s
Activate psi torsion between focusres and next residue
9
Decrease stereo blank interval (use with 0 to adjust stereo height alignment)
0
Increase stereo blank interval
(
Decrase stereo y stretching (use with ')' to adjust stereo aspect ratio)
)
Increase stereo y stretching
b
Go back one residue in chain
c
Next side-chain rotamer
D
Delete residue (asks to confirm if not water)
h
Rotate about vertical -1 degree
H
Rotate about vertical -10 degrees
i
Zoom in 5% (makes model smaller)
I
Zoom in 9%
j
Rotate about screen perpendicular -1 degree
J
Rotate about screen perpendicular -10 degrees
k
Rotate about screen perpendicular +1 degree
K
Rotate about screen perpendicular +10 degrees
l
Rotate about vertical +1 degree
L
Rotate about vertical +10 degrees
n
Rotate about horizontal +1 degree
N
Rotate about horizontal +10 degrees
o
Zoom out 11% (makes model larger)
O
Zoom out 15%
Q
Calculate least-squares plane of atoms on stack
q
calculate distance of top atom on stack to plane calculated with Q
p
Toggle photo mode (fullscreen)
r
Rocking off
R
Rocking on
u
Rotate about horizontal 11 degree
U
Rotate about horizontal -10 degrees
W
Add water at cursor position
x
Look down x axis
y
Look down y axis
z
Look down z axis
+
Measuring cross at screen center (shows 1.54, 2.7, 3.8 Å distances)
.
Small (default) cross
>
Add MRK residue (single CA atom) after focus
<
Add MRK residue (single CA atom) before focus
]
Go to C-terminus of focus chain
[
Go to N-terminus of focus chain
;
Apply fit and/or Refinement (use Cancel button on main window to cancel)
SPACE
Go to next residue
Notes:
The focusres (focus residue) is the last one clicked on and is listed in the lower left of the canvas window
footer.
The focus chain is the fragment with the focus residue. Chains terminate if the distance to the next
residue is too long to form a peptide bond.
Download