PCR/qPCR/dPCR Assay Design

Introduction

The entire PCR workflow is vulnerable to factors which introduce variability. Many of the variable components are unavoidable, such as the source of the sample or the requirement for a reverse transcription step. Assay design is also highly variable and can make the difference between PCR success and failure and also contributes to the reproducibility and sensitivity of an assay. The process of assay design follows a logical flow: The first step is to determine the desired target location. In some cases the sequence of the oligos is determined by the application and cannot be avoided, e.g., SNP detection, in others the entire gene may be used, e.g., copy number determination. Once the approximate assay sites are selected, the most suitable primers are identified and modifications determined. When assays are to be run in multiplex it is important to consider the potential for interaction of all oligos in the reaction and also the relative abundance of the targets. In challenging situations, e.g., where the objective is to detect very low copy numbers or small differences in target concentration, it is advisable to select and test several primer combinations and then combine with a suitable probe.

The process of assay design is greatly facilitated by adoption of suitable design software. OligoArchitect, provided by Sigma‑Aldrich (sigma.com/probedesignonline), provides two options for design support. The first is OligoArchitect Online, a software design tool with a wide range of options. If the design requires a specialized capability, the second option is to request the design via OligoArchitect Consultative, utilizing the assistance of Sigma’s expert, molecular biologists.

Amplicon Selection

The amplicon is the region of target sequence that is to be analyzed and is encompassed by the forward and reverse PCR primers. The determination of the amplicon size is, in part, dependent on the method to be used for analysis. When visualizing PCR fragments by gel electrophoresis, the PCR fragment needs to be large enough to be stained efficiently using a DNA binding dye and fit within the range of the chosen artificial size marker. Similarly, when resolving the fragment through a capillary electrophoresis instrument, the PCR product will be between 100 base pairs and anywhere over 2 kb (eventually restricted by enzyme performance).

When using qPCR for the final readout, a smaller amplicon is selected to ensure accurate quantification at each cycle. Ideally a qPCR amplicon size ranges from 75 to 200 bases in length, unless design restrictions take the primers beyond this range. In reality, the fragment size may be determined by consideration of several factors, including the biology under consideration.

The assay location may be pre-determined by the objectives of the experiment: Ideally, assays for the determination of the presence or quantity of a mRNA target are located over an exon–exon junction to avoid detection of contaminating gDNA sequences. However, these regions are often highly folded; therefore a pragmatic decision is required as to the preference for an exon spanning assay with potentially poorer performance or an exonic assay of higher quality. If the mRNA is abundant and transcribed from a single copy gene, the contribution of signal from gDNA contamination will be considerably less significant than when detecting a low abundant transcript from a multicopy gene. Detection of SNPs requires location of a probe or the 3’ of a primer over the mismatch site.

Analysis of splice variants requires a design approach that is specific to the objective. In some cases it is desirable to detect all splice variants simultaneously and this is simply achieved by selecting an exon boundary that is conserved between all variants. However, investigations into differential expression of each splice variant require a more creative approach. In one example experiment, the objective is to examine which of the alternative transcripts, shown in Figure 6.1, are being expressed. Since the exons are relatively small, amplification across all exons results in a product of around 300 bases, with smaller products resulting from amplification of splice variants. The design options for this study would be:

  • Design several assays across each exon junction and probe each sample with each assay.
  • Design a primer to the 3’ of exon 1 and the 5’ of exon 4. Amplification of any transcript comprised of any combination of exons would result in an amplicon of a specific length. The definition of the transcript would be determined using a qPCR, post reaction SYBR Green I dye melt curve.
A schematic representation

Figure 6.1. A schematic representation of a gene expressed as four potential splice variants. Each splice variant can be distinguished by design of specific primer pairs across each exon junction or by amplification from a generic primer pair in exons 1 and 4 and differentiation of the splice variants using qPCR, SYBR Green I dye melt analysis.

 

In general, amplicon sequences should be assessed using the following criteria:

  • An initial assessment of the region of the target sequence is recommended.
  • Ensure that there are no unexpected SNPs. A single mismatch between the primer and the template can decrease the melting temperature by up to 10 °C, affecting the efficiency of PCR.
  • Confirm that the selected sequence does not have homology to any other sequence in the genome/transcriptome of the target species. When targeting multiorganism systems, e.g., pathogen detection, the homology determination must include all sequences that may be in the sample.
  • Test the potential for the target sequence to adopt a secondary structure using the folding algorithm of OligoArchitect (sigma.com/probedesignonline), the selected design software or mfold (http://mfold.rna.albany.edu/) to model the template folding at the desired primer annealing temperature. Select template regions that have a predicted open structure by avoiding stem loop secondary structures with very negative ΔG values. This is an important consideration when using a one-step reverse transcription protocol and gene-specific primers.
  • Avoid palindromic sequences and repetitive regions.
  • Avoid G/C rich areas, aiming for approximately 50% GC content.
  • When multiplex assays are being designed the amplicons should be as similar as possible, in length and CG content, to avoid biased amplification.
  • Identify regions of homology or heterogeneity (as required) when designing assays to gene families. Sequences should then be aligned and examined for stretches of suitable consensus sequence.
  • For transcript-specific designs (to avoid detection of gDNA templates), target regions over an exon-exon junction where possible.

Transcript-specific Amplicon Selection

Most, but not all, DNA is eliminated from the sample during RNA purification. To avoid DNA amplification during RT-qPCR, it is advisable to select primers that either flank a large intron that is not present in the mRNA sequence or that span an exon-exon junction (Figure 6.2).

Intron/exon annotations for known genes from many vertebrate, bacteria, protist, fungi, plant and invertebrate metazoan species are available at the EnsemblGenomes website (http://ensemblgenomes.org/). Alternatively, if both genomic and cDNA sequences for the target gene are publicly available, intron positions can be identified by performing a BLAST search with the cDNA sequence against the genomic database for the target organism (Figure 6.3). Intron 1 in Figure 6.3 is long enough (~6.5 kb) that DNA should not be amplified under conventional qPCR conditions or controlled PCR conditions. However, all other introns are relatively short (<1 kb), thus the DNA is likely to be amplified during RT-qPCR (for an example, see Assay Optimization and Validation). Primers should either span exon-exon junctions, flank a long (several kb) intron, or flank multiple small introns.

Primers span-flank intron

Figure 6.2. Illustration of (A) intron-spanning and (B) intron-flanking primers for RT-PCR. Introns are in red and exons are in green. Primers P1 and P2 span an intron and primers P3 and P4 flank an intron. Note that primers P1 and P2 will not generate a PCR product from DNA unless the annealing temperature is extremely low. P3 and P4 may generate a longer PCR product from DNA if the intron is short (~1 kb), but not if it is sufficiently long (several kb).

 

BLAST alignment of cDNA sequence with genomic DNA sequence

Figure 6.3. BLAST alignment of cDNA sequence with genomic DNA sequence. The complete cDNA sequence for rat p53 from Genbank (accession number NM_030989) was used in a megaBLAST search for identical sequences in the rat genome (blastn). The alignment of the cDNA to the gDNA on chromosome 10 is shown. Using this information, the exons of the cDNA can be aligned to the corresponding gDNA regions and primer design is directed towards exons that are separated by long introns, e.g., exons 1 and 2.

Methylation-specific Assays

DNA methylation is a crucial part of cellular differentiation, causing gene expression to be altered in a stable manner. Methylation is important for normal development in higher organisms and can be inherited. Gene regulation via DNA methylation involves the addition of a methyl group to position 5 of the cytosine pyrimidine ring or nitrogen 6 of the adenine purine ring. In adult somatic tissues, DNA methylation typically occurs in a CpG dinucleotide context whereas non-CpG methylation is prevalent in embryonic stem cells. Methylation specific assays require identification of CpG islands within the sequence, often within the gene promoter region. This information is automatically located when using Beacon Designer (Premier Biosoft) and is available at http://www.mybioinfo.info/index.php.

Primer Design

While the general primer and probe design suggestions described in this chapter are applicable to numerous applications including gene expression studies, SNP detection, methylation detection studies, copy number determination, monitoring viral load and splice variant quantification, each application also has specific design considerations which will be discussed separately.

General Design Criteria for Primers
For the majority of applications, primers are designed to be fully complementary to the template DNA sequences that they are intended to prime. The basic design considerations for PCR primers include:

  • Primers are typically 20–24 nucleotides in length with a melting temperature (Tm) of approximately 60 °C
    (59±2 °C) for qPCR but may vary (55±5 °C) for conventional PCR. Specific applications may require modifications to primer length and Tm.
  • Primer pairs should possess 40–60% GC content and should lack significant secondary structure.
  • Primers should not be complementary to themselves or partner primers, particularly at the 3’ end. This reduces the potential for the formation of primer-dimer products during amplification.
  • Avoid 3’ clamping (examine the 5 bases of the 3’ and accept 3 of these as A or T and 2 as G or C).
  • Avoid runs of the same nucleotide that are longer than 4 repeats or palindromic regions.

SNP-specific Primers
In some cases, such as when designing single-nucleotide polymorphism (SNP) assays, there is no flexibility for the location of the assay and the surrounding sequence will also influence the sequence of the selected oligos. The recognition of the association between clinical conditions and both germline and acquired somatic SNPs continues to drive considerable efforts into the development of increasingly sensitive and specific detection systems. This reflects the challenging nature of SNP discrimination using oligo hybridization. The challenge is due to the differences in destabilization between different mismatches. Where G:A, C:T and T:T may have a strong destabilizing effect, G:T and C:A are much weaker because hydrogen bonds can form and therefore it is difficult to discriminate these pairings from the natural G:C and T:A. Many systems are adaptations of the Amplification-Refractory Mutation System (ARMS)1 that has been widely used and was instrumental in screening for cystic fibrosis mutations2. ARMS primers are 30 bases long (longer primers, up to 60 bases are functional). The base at the 3’ is SNP specific and therefore specific for the target sequence (normal or mutant base). An additional mismatch is introduced at the penultimate position. This is determined with consideration to the neighboring bases and the SNP mismatch (Table 6.1 adapted from Little, 2001).

Table 6.1. Selection of Penultimate Base Mismatch for ARMS Primers.

3' Base in Primer Matching to WT Mismatch Base in Template of Mutant under Terminal SNP Bases of Primer Coding Strand Base Pairing to Penultimate Base in Primer
A C G T
A A A G A G
G A C T A G
C A G A C T
T T C T A G
G T G A T C or T
C T C T A G
C C C T A G
G G A G A G

Reprinted with permission of Current Protocols in Human Genetics. Little, S. 2001. Amplification-Refractory Mutation System (ARMS) Analysis of Point Mutations. Curr. Protoc.Hum. Genet. 7:9.8.1–9.8.12.

 

Table 6.2. An Example of the Potential Hybridization Pairing of Specific and Mismatched Primers to Detect a G>A Mutation. The addition mismatch added to the penultimate primer base results in greater destabilization and prevents elongation that may occur from a primer which has a single mismatch at the 3’ end.

Reprinted with permission of Current Protocols in Human Genetics. Little, S. 2001. Amplification-Refractory Mutation System (ARMS) Analysis of Point Mutations. Curr. Protoc. Hum. Genet. 7:9.8.1–9.8.12.

 

Additional research groups have used similar ideas and demonstrated the utility of introducing mismatches at the N-2 and N-3 positions in the primers, Liu et al.3 have performed an in depth analysis of the relative positions of mismatches for the greatest destabilization effect and, therefore, highest specificity.

Multiplex PCR
Amplification of several targets simultaneously in multiplex PCR is required when there is a desire to increase throughput with more PCRs per tube or to save sample material. Primer design is the most critical factor to successful multiplex PCR. It is crucial that the general guidelines are followed and that compatibility is verified for all the primers (and probes) to be included in the reaction. In some cases it can be advantageous to use slightly longer primers with a Tm of around 65 °C. If the resulting amplicons are to be analyzed based on size discrimination, the resolving power of the analysis must be considered in the assay design. When attempting to quantify multiple targets using qPCR, the amplicons should be as similar as possible to avoid amplification bias. In addition to the primers, it is important that the template cannot adopt stable secondary structure as this would impede PCR. If it is known that the targets are present at significantly different concentrations, it may be advantageous to include a blocking primer to the high concentration target to facilitate accurate detection of the lower concentration target4.

Non-coding RNA Quantification
In contrast to the coding genome, it is estimated that ~97% of the human transcriptome is composed of non-coding RNA (ncRNA)5;6,7. One member of this family is the long non-coding RNA which have been described as a class of regulatory RNA molecules. These molecules have roles in epigenetics, development, cancer and essential biological processes8,9. Long ncRNAs are traditionally defined as consisting of RNA strands of at least 200 bases10,11,12. This means that after recognition of the amplicon length, no special considerations, other than those already referred to, need to be made when designing for these targets.

In contrast, the family members that comprise the microRNAs (miRNAs) present considerable design challenges. These are short non-coding RNAs (sncRNA) of 21–23 nt that are produced via a complex cellular pathway at several stages of transcript processing. MicroRNAs negatively regulate protein translation by binding to the transcript (reviewed in Kato et al., 200813) and induce the formation of the RNA induced silencing complex (RISC)14. Commercial assays, such as the MystiCq® (Sigma) line are a welcome solution to the design challenges presented by miRNA. There have been several proposed schemes for qPCR for miRNA analysis15 and for those studying organisms for which there are no commercial products, there are several publications describing potential solutions. Many of these rely on addition of bases to the original miRNA by ligation of an adapter (see Chapter 22, Casoldi, et al., in PCR Technologies; Current Innovations ed Nolan16) or by addition of a poly-A tail using polyA polymerase (PAP)17. The addition of a tag to each of the miRNA specific primers enables optimization of hybridization Tm and reactions containing DNA primers have been shown to be more efficient than those spiked with LNA18. In this report DNA primers specific to miRNA were designed using conventional PCR primer guidelines with additional considerations:

  • Examine miRNA sequence and disregard all terminal A bases at the 3’. • Identify the forward primer as the longest stretch of sequence from 5’ to the terminal 4 bases of the 3’ (ignoring the A bases identified above).
  • It is preferable for one of the 2 bases at the 3’ of the forward primer to be A or T.
  • It is preferable for the 3 bases at the 3’ to include 1 or 2 A or T.
  • It is preferable for the 5 bases at the 3’ to include 2–3 A or T.
  • Analyze the Tm of the forward primer using a nearest neighbor algorithm. If this is below 59 °C, add the following bases to the 5’ in the given order and calculate the Tm after each addition: G,A,C,G,C (resulting in a primer of the form CGCAGN18 where N are the miRNA specific bases). Select the shortest primer with Tm closest to 59 °C. If it is above 59 °C, remove 5’ bases and calculate Tm after each base removal. Select the longest primer with Tm closest to 59 °C.
  • Select the 3’ bases for the reverse primer ensuring that these are not complimentary to the forward primer. Assess the terminal 5 bases as described for the forward primer.
  • Add 15×T to the primer (e.g., 5’ T15 N5 3’).
  • Analyze the Tm of the forward primer using a nearest neighbor algorithm. If this is below 59 °C add the following bases to the 5’ of the primer in the given order and calculate the Tm after each addition: G,A,C,C, T,G,G,A,C, C (resulting in a primer of the form CAGGTCCAG T15 N5 where N are the miRNA specific bases). Select the shortest primer with Tm closest to 59 °C.
  • The RT primer is CAGCTCCAG T15 V N (where V=A, C and G and N=A,C,G,T).

Primer Design Example

Since target sequences dictate the primer sequences, it may not always be possible to achieve the desired design criteria. Therefore compromises to assay design are overcome by assay-specific optimization. Some PCR targets may require special processing before a successful assay may be designed. A frequently encountered case concerns the detection of pathogens, including viruses. It is well-known that many viruses have high degrees of variability at specific locations in their genomes. A good example is the Hepatitis B virus. In a recent study, to design a successful qPCR assay against the known HBV variants19, it was necessary to conduct an extensive alignment of all of the available HBV genomic sequences. Several hundred sequences were compared using ClustalW in an attempt to find significant stretches of consensus sequence that might be used for a generic assay design. A snippet of the alignment result is shown in Figure 6.4. The asterisks (*) represent the consensus nucleotides found in the analysis of all of the genomes (a large number of other sequences that were a part of this alignment are not shown due to space restrictions).

partial-clustalw-analysis

Figure 6.4: Partial ClustalW analysis of HBV genome data. All known HBV genomic sequences were aligned using ClustalW and conserved nucleotides identified (*)

 

When designing primers and probes in such situations, it may be necessary to use oligos that contain mixed bases, also known as “wobbles” or degenerate bases. For example, consider the details of the consensus sequence shown for HBV (Figure 6.5).

A selected region of the HBV alignment showing regions of consensus

Figure 6.5. A selected region of the HBV alignment showing regions of consensus

 

A region of approximately twenty-three bases is required for a primer. In this case, when considering all possibilities of sequence for all HBV genomes the actual sequence of that twenty-three base region is shown in Figure 6.6:

The permutations of primer sequence to accommodate

Figure 6.6. The permutations of primer sequence to accommodate all base options for the selected consensus primer region.

 

The positions of ambiguous base can be represented using standard single letter codes for mixed bases (Table 6.3). When these are applied to the sequence shown in Figures 6.5 and 6.6, the oligo can be described as in Figure 6.7.

Table 6.3. Single Letter Mixed Base Codes.

Code Mixed Bases
B C G T  
D   G T A
H C   T A
K   G T  
M C     A
N C G T A
R   G   A
S C G    
V C G   A
W     T A
Y C   T  

A = adenosine, C = cytidine, G = guanosine, T = thymidine

 

The ambiguous bases of the consensus region oligo

Figure 6.7. The ambiguous bases of the consensus region oligo are represented by standard single letter codes.

 

This option is unlikely to result in a successful PCR primer, because there are additional considerations that need to be addressed concerning the high number of degenerate bases. In particular, a synthetic oligo manufactured using this sequence would, in fact, result in a mixture of each of the possible single base sequences. The number of possible, individual primer sequences is calculated by multiplying the individual base numbers at each position. For this sequence, this means 1×2×1×1×2×1×1×2×2×1×3×2×1×2×2×2×2×1×1×2×2×1×1 = 6,144 possible individual oligos. Therefore, the effective concentration of each specific oligo in the reaction is reduced proportionally. Empirical analysis has shown that the number of different sequence permutations in a primer should not be more than 512, therefore this example would not be an optimal degenerate-base primer. A redesign to a different location would offer a potential solution. In the example shown in Figure 6.8, the primer contains 2 bases with potential mismatches. However, these each have a single alternative base, resulting in 4 oligos in the mixed synthesis and the wobbles are located in the 5’ region of the primer. These factors offer a much higher chance of success than the primer presented in Figure 6.7.

Sequence showing alignment consensus

Figure 6.8. Sequence showing alignment consensus bases and potential primer location to a consensus region. The consensus primer is shown using wobble codes

When using degenerate oligos in PCR, a modified amplification protocol may be necessary. Cycling may be started with 2–5 cycles at a low annealing temperature (35–45 °C). Also, a slow ramp from the annealing temperature to the extension temperature should be incorporated, taking approximately 3–5 minutes to reach the extension temperature. The protocol should then be finished with 25–40 cycles at a more stringent annealing temperature without the ramp modification.

It is preferable, when possible, to avoid nucleotide heterogeneity. If is not possible to avoid regions of heterogeneity, which is often the case with difficult targets, then the use of specialized oligo modifications, such as inosine and other “universal” bases, such as 5-nitroindole, may help reduce the complexity (sigma.com/mods) and addition of modifying groups such as ZNA (see Quantitative PCR and Digital PCR Detection Methods) may improve performance.

Probe Design

As for PCR primers, qPCR probe design also depends largely on the sequence context and the desired application. Single probes such as Dual-Labeled Probes or Molecular Beacons are typically 20–30 bases long. Scorpions® Probes have a shorter probe length of 15–25 bases. In a LightCycler or FRET system, there are two probes; the sensor (probe 1) and anchor (probe 2) probes that are situated in close proximity, separated by 1–5 bases.

  • When using Dual-Labeled Probes, the Tm should be 7 °C to 10 °C higher than that of the primers to ensure that the probe has bound to the target before the primers hybridize and are extended. This is also the case for both FRET probes in the reaction. When used for SNP detection, the sensor probe (probe 1) is situated over the site of mismatch, avoiding the terminal 3 bases of the probe and has a lower Tm (about 5 °C) than the anchor probe (probe 2). Scorpions® Probes are the exception to this recommendation since the probe binds to the newly synthesized template after extension rather than before, as for other probe systems.
  • For quantitative studies using Dual-Labeled Probes, aim for the probe to be positioned close to the 3’ of the forward primer but not overlapping (around five bases); for SNP detection, position a Dual-Labeled Probe or Molecular Beacon in the center of the amplicon and the SNP in the center of the probe.
  • Avoid a guanidine at the 5’ end of probes, next to the reporter, as this causes quenching.
  • Ensure there are fewer Gs than Cs in the probe sequence.
  • Avoid runs of the same base (<4) and palindromic sequences.
  • Ensure that the probe cannot adopt secondary structure.
  • Ensure that the probe cannot hybridize to the primers.
  • When designing probes for multiplex reactions, ensure that there are no potential interactions between any of the probes and primers.

Molecular Beacons
After a suitable probe region has been selected, complementary stems are added to the 5’ and 3’ ends to create the Molecular Beacon structure20. The example below shows the addition of a stem sequence (in red) to a Dual-Labeled Probe to create a Molecular Beacon (Figure 6.9 adapted from Thelwell 200021).

 

Adaptation of a Dual-Labeled Probe Assay to a Molecular Beacon

Figure 6.9. Adaptation of a Dual-Labeled Probe Assay to a Molecular Beacon Format. Nucleic acids research by Oxford University Press. Reproduced with permission of Oxford University Press in the format reuse in a book/e-book via Copyright Clearance Center.

 

Scorpions® Probes
The Scorpions® Probe requires assembly of the probe with a forward primer such that they adopt the structure: 5’ dyestem–probe-stem–quencher–blocker–primer. The primer and probe must be on opposite strands since the probe binds to the newly created template that is on the same strand as the primer. The example shown in Figure 6.10 shows the addition of label, quencher, PCR blocker and stem sequences to a Dual‑Labeled Probe to create a Scorpions® Probe (adapted from Thelwell 200021).

 

Adaptation of a Dual-Labeled Probe Assay

Figure 6.10. Adaptation of a Dual-Labeled Probe Assay to a Scorpions® Probe Format. Nucleic acids research by Oxford University Press. Reproduced with permission of Oxford University Press in the format reuse in a book/e-book via Copyright Clearance Center.

 

Probe Modifications
When probes are located over a region of sequence with undesired heterogeneity, ambiguous bases are managed as described for PCR primers. In addition, it is also possible to incorporate modified nucleotides into qPCR probes, a common example is the addition of a Locked Nucleic Acid® (LNA®) base. LNA is a modified RNA nucleotide. The ribose moiety of LNA is modified with an extra bridge connecting the 2’ oxygen and 4’ carbon (see Quantitative PCR and Digital PCR Detection Methods). The bridge “locks” the ribose in the 3’-endo conformation. LNA can be mixed with DNA or RNA bases in the oligonucleotide wherever desired. The LNA modification results in increased thermal stability allowing for shorter probes to be designed with the equivalent Tm to a longer, non-modified equivalent probe. LNA-containing sequences are more specific than oligos comprised of DNA alone and ideally suited to SNP detection. Additional applications of LNA modifications include designing oligos for analysis of difficult sequences, such as viruses, where a high degree of variability can make it difficult to design a generic assay22.

The 5’ of a Dual-Labeled Probe, Molecular Beacon or Scorpions® Probe is labeled with a fluorophore, usually 6-FAM™ for single assays or when multiplexing, typically choosing these in the order FAM, HEX™/JOE™, Cyanine 5 (it is critical to determine compatibility of the label with the instrument). The 3’ of a Dual-Labeled Probe or Molecular Beacon and the 3’ end of the internal stem region of a Scorpions® Probe are modified with a quencher molecule. Historically, the dye TAMRA was used as an acceptor for FAM emissions, resulting in FAM quenching. Developments in dark quencher technology have resulted in widespread adoption of Black Hole Quenchers® and the more recent introduction of the Sigma Onyx Quencher™ collection (see Quantitative PCR and Digital PCR Detection Methods).

Template Controls
One advantage of using relatively short-length amplicons that are typically less than 150 bases, is that it is then possible to synthesize a long oligonucleotide that may be used as a synthetic amplification target. Use of such a target may help in development and optimization of assays where the intended target may be rare or in short supply, for example in the inhibition control assay SPUD23 (see Appendix A) or infectious disease detection studies19.

Oligo Synthesis and Handling

When ordering custom oligos for use in PCR applications, decisions must be made regarding the desired yield/scale of synthesis, purity and required modifications. Each of these factors impacts on the other, e.g., a higher level of purification will result in better quality oligonucleotide but at the cost of a reduction in overall yield. Tables 6.4, 6.5 and 6.6 provide guidance as to the synthesis scale and expected yield of oligonucleotides manufactured by Sigma.

Oligonucleotide Purification

When DNA is synthesized, each nucleotide is coupled to the growing chain sequentially, beginning from the 3’ end of the sequence. In each coupling cycle, a small percentage of the oligo chains will not be extended, resulting in a mixture of fulllength product and truncated sequences. After the oligo is cleaved from the support and the protecting groups are removed, purification is used to separate the full-length product from the truncated sequences. In general, the purity required for a specific application depends on the potential affect from the presence of truncated oligomers. For some applications, it is crucial that only the full-length (n) oligo be present. For others, such as PCR primers, the presence of shorter oligos (n-1,n-2,...) may not affect the experimental results.

Desalt Purification
The desalting procedure removes residual by-products that are remaining from the synthesis, cleavage and deprotection steps.

For many applications, including PCR, desalt purification is acceptable for oligos that are no more than 35 bases long, as the overwhelming abundance of full-length oligo outweighs any contributions from shorter products. Oligos required for cloning or greater than 35 bases in length require an additional method of purification, such as Reverse-Phase Cartridge Purification (RP1), HPLC, or PAGE (depending on length).

Reverse-phase Cartridge Purification (RP1)
Separation on a reverse-phase cartridge removes a high proportion of truncated sequences. The difference in hydrophobicity between full-length product and truncated sequences is used as the basis of the separation. While the full-length oligo is retained on the column, the truncated sequences are washed off. The desired full-length product is then eluted and removed from the cartridge.

Reverse-phase HPLC
As the oligo length increases, the proportion of truncated sequences tends to increase. Not all of these impurities will be removed by RP1 and thus for longer oligos, such as artificial amplicon template oligos or labeled probe oligos, HPLC or PAGE purification is recommended. Reverse-phase, high performance liquid chromatography (RP-HPLC) operates on the same principle as a reverse-phase cartridge. However, the higher resolution allows for higher purity levels. HPLC is an efficient purification method for oligos with fluorophores, such as qPCR probes, as their intrinsic lipophilicity provides excellent separation of product from contaminants. Furthermore, RP-HPLC is a method of choice for larger scales due to the capacity and resolving properties of the column. The resolution based on lipophilicity will decrease as the length of the oligo increases. Therefore, RP-HPLC is usually not recommended for purifying products longer than 50 bases. Although longer oligos (up to 80 bases) can be purified using this method, the purity and yields may be adversely affected.

Anion-exchange HPLC
Anion-exchange separation is based on the number of phosphate groups in the molecule. The anion-exchange purification method involves the use of a salt-gradient elution on a quaternary ammonium stationary phase column or a similar structure. The resolution is excellent for the purification of smaller quantities. This technique can be coupled with purification by RP-HPLC, adding a second dimension to the separation process. Anion-exchange HPLC is limited by oligo length (usually up to 40mers). The longer the oligonucleotides, the lower the resolution on the anion-exchange HPLC column and therefore the lower the purity of the target oligo.

PolyAcrylamide Gel Electrophoresis (PAGE)
The basis of the PAGE separation is charge over molecular weight, leading to good size resolution, resulting in purity levels of 95–99% full-length product. Yields from PAGE are lower than from other methods due to the complex procedures required for extracting oligos from the gel and the removal of the vast majority of truncated products. This technique is recommended when a highly purified product is required. PAGE is the recommended purification for longer oligos (≥50 bases).

 

Table 6.4. Expected Yield for Standard Oligos with Consideration of Purification Method.

Standard Oligos (OD/μg)*
Scale (μmol) Desalt Cartridge HPLC PAGE
0.025 3/90 NA NA NA
0.05 5/150 1/30 1/30 0.5/15
0.2 12/360 3/90 2.5/75 1/30
1.0 40/1,200 12/360 13/390 5/150
10 400/12,000 NA 130/3,900 NA
15 600/18,000 NA 190/5,700 NA

*Guarantee is for 20mers or longer. Shorter oligos may have fewer ODs.

 

Table 6.5. Expected Yield for Modified Oligos with Consideration of Purification Method.

Standard Oligos (OD/μg)*
Scale (μmol) Desalt Cartridge HPLC PAGE
0.05 2/60 0.4/12 0.4/12 0.2/6
0.2 5/150 1/30 1/30 0.4/12
1.0 16/480 5/150 5/150 2/60
10 160/4,800 N/A 52/1,560 N/A
15 240/7,200 N/A 76/2,280 N/A

*Guarantee is for 20mers or longer. Shorter oligos may have fewer ODs.
Note: Post-synthesis modifications may yield 50% less than the above stated values.

Table 6.6. Guaranteed Amounts for qPCR Probes.

qPCR Probes
Detection Chemistry Quantity (OD)  
Dual-Labeled Probes 1 3
10 10
Molecular Beacons 1
3
10 10
LightCycler Probes 0.1 0.25 1.5 15
Scorpions® Probes 1
5
10 N/A

All probes are purified by RP-HPLC. Inquire for 50 and 100 OD quantities.

Oligonucleotide Preparation

DNA oligonucleotides provided dry are ready for use upon re-suspension. It is recommended that oligonucleotides are resuspended in a weak buffer such as TE (10 mM Tris, pH 7.5–8.0, 1 mM EDTA). In applications where TE is not suitable, sterile nuclease-free water may be used. However, high-grade water may be slightly acidic and is not recommended for long-term storage of oligonucleotides.

A 100 μM stock solution may be obtained by using the following guideline: Take the number of nanomoles (nmol) provided (information found on the tube label and/or quality assurance document supplied with the oligo) and multiply by 10. The result provides the number of microliters of liquid to add to the tube to achieve a final concentration of 100 μM. For example, if the oligo yield is 43.5 nmol, the volume to add for 100 μM stock is 435 μL. Note that this is equivalent to a stock solution of 100 pmol/μL. The stock solution may then be further diluted as necessary, based upon the application requirements. For PCR, 10 μM or 20 μM working concentration is typically used. Store the stock solution in aliquots at –20 °C and avoid multiple freeze–thaw cycles.

 

References

  1. Little, S. Amplification-refractory mutation system (ARMS) analysis of point mutations. Curr Protoc Hum Genet 2001;Chapter 9
  2. Ferrie, R.M., Schwarz, M.J., Robertson, N.H., et al. Development, multiplexingand application of ARMS tests for common mutations in the CFTR gene. Am J Hum Genet 1992; 51: 251-262
  3. Liu, J., Huang, S., Sun, M., et al. An improved allele-specific PCR primer design method for SNP marker analysis and its application. Plant Methods 2012; 8: 34
  4. Vestheim, H., Jarman, S.N. Blocking primers to enhance PCR amplification of rare sequences in mixed samples - a case study on prey DNA in Antarctic krill stomachs. Front Zool 2008; 5: 12
  5. Taft, R.J., Pheasant, M., Mattick, J.S. The relationship between non-proteincoding DNA and eukaryotic complexity. Bioessays 2007;29: 288-299
  6. Taft, R.J., Pang, K.C., Mercer, T.R., et al. Non-coding RNAs: regulators of disease. J Pathol 2010; 220: 126-139
  7. Beck, D., Ayers, S., Wen, J., et al. Integrative analysis of next generation sequencing for small non-coding RNAs and transcriptional regulation in Myelodysplastic Syndromes. BMC Med Genomics 2011; 4: 19
  8. Ponjavic, J., Oliver, P.L., Lunter, G., et al. Genomic and transcriptional co-localization of protein-coding and long non-coding RNA pairs in the developing brain. PLoS Genet 2009; 5: e1000617
  9. Ponting, C.P., Oliver, P.L., Reik, W. Evolution and functions of long noncoding RNAs. Cell 2009; 136: 629-641
  10. Carninci, P., Kasukawa, T., Katayama, S., et al. The transcriptional landscape of the mammalian genome. Science 2005; 309: 1559-1563
  11. Rozowsky, J., Wu, J., Lian, Z., et al. Novel transcribed regions in the human genome. Cold Spring Harb Symp Quant Biol 2006; 71: 111-116
  12. Kapranov, P., Cheng, J., Dike, S., et al. RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 2007; 316: 1484-1488
  13. Kato, M., Slack, F.J. microRNAs: small molecules with big roles - C. elegans to human cancer. Biol Cell 2008; 100: 71-81
  14. Tijsterman, M., Plasterk, R.H. Dicers at RISC; the mechanism of RNAi. Cell 2004; 117: 1-3
  15. Benes, V., Castoldi, M. Expression profiling of microRNA using real-time quantitative PCR, how to use it and what is available. Methods 2010; 50: 244-249
  16. PCR Technologies: Current Innovations. 3 ed. Edited by Nolan and Bustin, CRC Press; 2013
  17. Shi, R., Chiang, V.L. Facile means for quantifying microRNA expression by real-time PCR. Biotechniques 2005; 39: 519-525
  18. Balcells, I., Cirera, S., Busk, P.K. Specific and sensitive quantitative RT-PCR of miRNAs with DNA primers. BMC Biotechnol 2011; 11: 70
  19. Heath, Ashley R., Deluge, Norha, de Amorim, Maria Galli, et al. Development and use of qPCR assays for detection and study of neglected tropical and emerging infectious diseases. In: Tania Nolan, Stephen A Bustin, eds. PCR Technologies: Current Innovations 3rd Edition ed CRC Press; 2013
  20. Tyagi, S., Kramer, F.R. Molecular beacons: probes that fluoresce upon hybridization. Nat Biotechnol 1996; 14: 303-308
  21. Thelwell, N., Millington, S., Solinas, A., et al. Mode of action and application of Scorpion primers to mutation detection. Nucleic Acids Res 2000; 28: 3752-3761
  22. Petersen, M., Wengel, J. LNA: a versatile tool for therapeutics and genomics. Trends Biotechnol 2003; 21: 74-81
  23. Nolan, T., Hands, R.E., Ogunkolade, W., et al. SPUD: a quantitative PCR assay for the detection of inhibitors in nucleic acid preparations. Anal Biochem 2006; 351: 308-310