Eric D. Watt, J. Patrick Loria
ISOTEC® Stable Isotopes 2008 - 2010 Catalog
Protein structure determination by solution NMR spectroscopy has long relied on the uniform stable isotopic enrichment with 13C and 15N to alleviate resonance overlap and to allow multiple distance and angular restraints, at as many atomic sites as possible, to facilitate computing the optimal three-dimensional structural model.1 Recently, the optimization of these labeling techniques has increased the range of protein sizes amendable to study, enhanced the quality of three-dimensional structures, and simplified the analysis of experimental data.2 Similarly, the field of protein dynamics has benefited from advances in isotopic labeling techniques that have allowed researchers to study the motional properties of ever larger proteins over a broad range of timescales while more accurately describing the protein motions. In many ways, improvements in isotopic labeling for dynamics have mirrored those used in structural studies. However, spin-relaxation experiments designed for studying protein dynamics have their own unique requirements for residue labeling, requiring special developments in isotope enrichment techniques better suited for these demanding studies.
Solution NMR is a powerful method for characterizing protein motions over a wide range of time scales by the measurement of relaxation rates of the desired nuclei. The design of these relaxation experiments as well as the analysis and interpretation of data are significantly simplified if the protein position of interest can be treated as an isolated spin pair. In cases such as this, pulse sequence design is not overburdened by the necessity of accounting for and manipulating multiple undesirable coherence pathways and the relaxation rates obtained are straightforwardly measured from monoexponential decay profiles of the peak intensities. However, the presence of multiple large one-bond couplings can needlessly complicate experimental results through multiexponential relaxation pathways and signal-tonoise degradation. Because of this, much of the work on labeling schemes has been to provide a means to isotopically label different isolated spin pairs within proteins such that one-bond scalar (J) couplings do not pose an experimental roadblock.
To date, most dynamics studies have been performed using uniform 15N enrichment. 15N is a good target to study for a variety of reasons. The necessary nitrogen needed for cell growth can be controlled by use of 15N-enriched minimal or nutrient-rich growth media that are readily available, allowing for easy sample preparation. Uniform 15N labeling results in an isolated spin system (1H-15N) that lends itself well to relaxation experiments. Every 15N position whether in the protein backbone or sidechain is separated from another 15N atom by at least two bonds. Therefore there are no 1JNN couplings that could lead to complicated multiexponential relaxation behavior, which would be difficult to accurately measure and would cloud the interpretation of the associated motions.
However, 15N enrichment by itself does not provide a complete picture of the motions a protein undergoes. Nitrogen makes up only 1/3 of the protein backbone and is only sparsely populated in the sidechains in 6 of the 20 amino acids. Therefore, except for a few select positions (Asn, Gln, His, Trp, Lys, and Arg) 15N relaxation experiments do not allow substantial protein-wide dynamical coverage to provide a full picture of sidechain and backbone motions. In particular, amide relaxation may be relatively insensitive to motions in the hydrophobic core of the protein. It has also been noted that certain motional modes of the protein backbone will not be detected by monitoring relaxation at the amide positions. Nonetheless, the simplicity of 15N protein labeling, the powerful experiments that exist and the sensitivity of nitrogen to structural, electrostatic, and hydrogen bonding effects make 15N an essential part of any dynamics study.
On the surface, carbon relaxation experiments provide many additional opportunities for molecular dynamics studies by NMR spectroscopy. The abundance of carbon in each amino acid provides still more probes of enzyme dynamics. These carbons are contained both in the backbone and the side-chains, providing information on dynamics throughout the entire protein. Methyl residues, typically buried in the hydrophobic core, are particularly suited to provide information on the dynamic contribution to protein folding and stability. The dependence of 13C chemical shifts, particularly Cα, on the protein secondary structure is more clearly understood than that for the amide nitrogen, allowing for easier interpretation of chemical shift changes retrieved from certain spin-relaxation experiments.
Unfortunately, ideal carbon labeling methods are not as straightforward as those for nitrogen. The great coverage of protein dynamics provided by the significant portion of carbon in the protein is the same feature that poses the biggest problem to its study; the large number of carbon atoms means that almost all carbons are adjacent to another carbon atom. Therefore, uniformly labeled 13C protein samples result in large one bond 13C -13C couplings for many residues, making straightforward interpretation of relaxation data difficult or intractable in many cases. The only position that does not suffer from the aforementioned 1JCC couplings is the methionine Ce methyl group, which is isolated from the rest of the protein by the sulfur atom. Thus, in a uniform 13C-labeled protein there are limited opportunities for dynamics studies. While methionine relaxation data may be extremely useful, it does not provide the level of dynamic coverage that could be obtained from a more complete carbon labeling scheme. Because of this, many isotope labeling methods that take advantage of the known bacterial metabolic pathways have been developed.
One method that was used to isolate 13C labels focused on partial labeling using 15% 13C-acetate with the remainder being 12C.3 This method results in dilution of the 13C labels to sufficient levels to make relaxation experiments feasible, though with a concomitant reduction in signal-to-noise due to the reduced fraction of labeled protein.
By using [3-13C] pyruvate as the sole carbon source, 13C labeling of Leuδ, Valγ, and Ileγ, methyl groups can be achieved at >90% incorporation levels.4 More importantly, the isotope labeling is not scrambled to directly bonded carbons to any great extent, allowing relaxation measurements for these residues. Thus, good single-tonoise and monoexponential relaxation behavior is observed for these methyl positions. Like methionine, these residues allow insight into the dynamics of protein motion in the hydrophobic core.
The use of α-keto acids also provides a cost-effective way to produce 13C-methyl labeled amino acids that is also compatible with high levels of deuteration at other carbon sites.5,6 Deuteration allows dynamic studies to be performed on much larger protein systems than otherwise possible. Another benefit of this approach is that there is minimal scrambling of the 13C labels thereby minimizing the aforementioned problems.
The use of 1- or 2- positionally labeled glycerol with an auxotrophic cell line allowed alternating labels to be incorporated for the majority of carbons throughout the protein sidechains. Typically, two protein samples are needed to obtain as complete coverage of atomic positions as possible.7 Aromatic residues can complement the data obtained for methyl groups as they are typically found in the hydrophobic core as well. Specific labeling in these residues is especially important given the strong J-couplings as well as the small range of chemical shifts. Early studies showed that growth on [2-13C]-glycerol will result in isotope enrichment at alternating carbons in most amino acids, including isolated aromatic carbons in Phe, Tyr, and Trp. Growth on [1,3-13C]-glycerol will give the opposite labeling pattern. An alternative to this is using [1-13C]-glucose as the sole carbon source. Aromatic rings are labeled at Pheδ, Tyrδ, Hisδ2/ε1 and Trpδ1/ε3.8 [1-13C]-glucose is beneficial because it is not only more affordable, but produces higher protein yields.
More recently, it has been shown that [1-13C]-glucose will also lead to ~45% enrichment of the methyl residues of Ala, Val, Leu, Met and Ile separated by two or more bonds from other 13C labeled atoms.9 In the same study, expression with [2-13C]-glucose as the only carbon source was shown to lead to 20-45% enrichment at Cα positions with no C' sites labeled, and only Leu, Val and Ile Cβ sites labeled. This labeling allows CPMG relaxation experiments to be run on the Cα positions, providing a wealth of data that complements that obtained for the more common 15N CPMG experiment.
Finally, isotopic labels can be incorporated quite specifically via introduction of the desired labeled amino acid directly into the growth media.10,11 Typically, to avoid scrambling and dilution of the label the desired amino acid is included in a mixture containing all the other amino acids in unlabeled form. Thus cell growth is quite robust due to the nutrient rich nature of the growth medium. Obviously, depending on the position of the desired label synthesis may not be straightforward and it may be quite time consuming. This technique has proved useful in dynamical and structural studies and has recently been used in NMR structure determination of large proteins.2
As more and more bacterial metabolic pathways are employed to provide specific isotopic labeling, solution NMR relaxation experiments that can take advantage of these advancements will be developed, allowing for an extremely detailed description of protein dynamics across a wide range of residue types.