onononononono dodododododod Ododododododo □onononODODOD O D O □ O " ~ ^ O D O D O DOn^fBRS/^OD Kód předmětu: C8980 I 1MJ ^ MASARYKOVA UNIVERZITA Mé ODODOu^uODODO DODODODODODOD ODODODODODODO ein expression and purification . Fusion proteins and affinity purification Radka Dopitova Tento projekt je spolufinancován Evropským sociálním fondem a státním rozpočtem České republiky. MINISTERSTVO ŠKOLSTVÍ, OP Vzděláváni MLÁDEŽE A TĚLOVÝCHOVY pro konkurenceschopnost EVROPSKÁ UNIE ■ INVESTICE DO ROZVOJE VZDĚLÁVÁNÍ Fusion proteins (tagged proteins) Translation fusion of sequences coding a recombinant protein and a) short peptides [ex. (His)n, (Asp)n, (Arg)n ... ]. b) protein domains, entire proteins [ex. MBP, GST, thioredoxin ... ]. Engineering a tagged protein requires adding the DNA encoding the tag to either the 5' or 3' end of the gene encoding the protein of interest to generate a single, recombinant protein with a tag at the N- or C-terminus. The stretch of amino acids containing a target cleavage sequence (CS) is included to allow selective removal of the tag. (i) 5' Promoter Tag Gene of interest 3' Terminator Transcribe and translate Tag fused to the N-terminus of the protein of interest (ii) 5' Promoter Gene of interest CS Tag 3' Terminator Transcribe and translate 1C Tag fused to the C-terminus of the protein of interest Expression plasmids containing various tags are commercially available Purposes of fusion tags > Increasing the yield of recombinant proteins - Fusion of the N-terminus of the target protein to the C-terminus of a highly expressed fusion partner results in high level expression of the target protein. > Enhancing the solubility of recombinant proteins - Fusion of the N-terminus of the target protein to the C-terminus of a soluble fusion partner often improves the solubility of the target protein. > Improving detection - Fusion of the target protein to either terminus of a short peptide (epitope tag) or protein which is recognized by an antibody (Western blot analysis) or by biophysical methods (e.g. GFP by fluorescence) facilitates the detection of the resulting protein during expression or purification. > Localization - A tag, usually located on the N-terminus of the target protein, which acts as an address for sending a protein to a specific cellular compartment. > Facilitating the purification of recombinant proteins - Simple purification schemes have been developed for proteins used at either terminus which bind specifically to affinity resins. No single tag is ideally suited for all of these purposes Fusion partner (tag) Size Tag placement Uses His-tag 6, 8, or 10 aa N- or C-terminus Purification, detection Thioredoxin 109 aa(11.7kDa) N- or C-terminus Purification, solubility enhancement Calmodulin-binding domain (CBD) 26 aa N- or C-terminus Purification Avidin/streptavidin Strep-tag 8 aa N- or C-terminus Purification, secretion Glutathione S-transferase (GST) 26kDa N-terminus Purification, solubility enhancement Maltose binding protein (MBP) 396 aa (40 kDa) N- or C-terminus Purification, solubility enhancement Green fluorescent protein (GFP) 220 aa (27 kDa) N- or C-terminus Localization, detection, purification Poly-Arg 5-16 aa N- or C-terminus Purification, solubility enhancement N-utilization substance A (NusA) 495 aa (54.8 kDa) N-terminus Solubility enhancement Combinatorial tagging > No single tag is ideally suited for all purposes. Therefore, combinatorial tagging might be the only way to harness the full potential of tags in a high-throughput setting. Combinations: Solubility-enhancing tag + purification tag: MBP + His6tag 2x purification tag: IgG-binding domain + streptavidin-binding domain Detection tag + purification tag: GFP + His6 tag Detection tag + 2x purification tag + immunodetection: GFP + SBP domain + His8 tag + c-Myc Tag8 Advantages Disadvantages GST Eflicient translation High metabolic burden initiation Inexpensive affinity resin Homodimeric protein Mild elution conditions Does not enhance solubility MBP Efficient translation High metabolic burden initiation Inexpensive affinity resin Enhances solubility Mild elution conditions NusA Efficient translation High metabolic burden initiation Enhances solubility Not an affinity tag Thioredoxin Efficient translation Not an affinity tagb initiation Enhances solubility Ubiquitin Efficient translation Not an affinity tag initiation Might enhance solubility FLAG Low metabolic burden Expensive affinity resin High specificity Harsh elution conditions BAP Low metabolic burden Expensive affinity resin Mild elution conditions Variable efficiency of enzymatic biotinylation Provides convenient means Co-purification of E. coli of immobilizing proteins in biotin carboxyl carrier a directed orientation protein on affinity resin Does not enhance solubility Hiss Low metabolic burden Specificity of IMAC is not as high as other affinity methods Inexpensive affinity resin Mild elution conditions Tag works under both Does not enhance native and denaturing so In bility conditions STREP Low metabolic burden Expensive affinity resin High specificity Does not enhance solubility Mild elution conditions SET Enhances solubility Not an affinity tag CBP Low metabolic burden Expensive affinity resin High specificity Does not enhance solubility Milu' elution conditions Stag Low metabolic burden Expensive affinity resin High specificity Harsh elution conditions (Dron-column cleavage) Does not enhance solubility Advantages and disadvantages of used fusion tags "GST. glutathione S transferase: MBP.maltose binding protein: NusA,N utilization substance A: FLAG, FLAG tag peptide: BAP. biotin acceptor peptide: Hise, hcxahistidinc tog: STREP, stroptavidin binding peptide: SET, solubility enhancing tag: CBP, calmodulin binding peptide. "Derivatives of thioredoxin have been engineered to have affinity for immobilized metal ions [His patch thioredoxini or avidin/streptavidin [38). > Proteins do not naturally lend themselves to high-throughput analysis because of their diverse physiological properties. Affinity tags have become indispensable tools for structural and functional proteomics. > Because affinity tags have the potential to interfere with structural and functional studies, provisions must also be made for removing them. Waugh, 2005 Otázka č. 1: Jaké jsou důvody pro využívaní tagů/kotev? Vyjmenujte 3. Increasing the yield of recombinant proteins using fusion technology Yield enhancing tags are proteins and peptides which can be involved in: > Increasing the efficiency of translation initiation (e.g. GST, MBP, NusA...) - Advantage of N-terminal tags - Providing a reliable context for efficient translation initiation - Ribosome efficiently initiates translation at the N-terminal methionin of the tag - Deleterious secondary structures are more likely to occur in conjunction with short N-terminal tags because short RNA-RNA interactions tend to be more stable than long-range interactions. > Protection against proteolytic degradation - Several studies have shown that the nature of terminal residues in a protein can play a role in recognition and subsequent action by proteases and in some cases affinity tags might improve the yield of recombinant proteins by rendering them more resistant to intracellular proteolysis. > Helping to properly fold their partners leading to increased solubility of the target protein (in vivo and in vitro). Enhancing the solubility of recombinant proteins Solubility-enhancing tags - Advantage of N-terminal tags - Rather proteins (highly soluble proteins) than peptides -Fusion with a soluble fusion partner often helps to properly fold their fusion partners leading to improved solubility (in vivo and in vitro) of the target protein. -Fusion partners do not perform equally with all target proteins, and each target protein can be differentially affected by several fusion tags (Esposito and Chatterjee, 2006). - The choice of a fusion partner is still a trial-and-error experience. >PROTEINS Some commonly used solubility-enhancing fusion partners Tag Protein Source organism MBP Maltose-binding protein Escherichia coli GST Glutathione-S-transferase Schistosoma japonicum Trx Thioredoxin Escherichia coli NusA N-Utilization substance Escherichia coli SUMO Small ubiquitin-moditier Homo sapiens SET Solubility-enhancing tag Synth etc DsbC Disulfide bond C Escherchia coli Skp Seventeen kilodalton protein EschericNa coli T7PK Phage T7 protein kinase Bacteriophage T7 QB1 Protein G B1 domain Streptococcus sp. ZZ Protein A IgG ZZ repeat domain Staphylococcus aureus Adopted from Esposito and Chatterjee, 2006 > PEPTIDES Poly-Arg Poly-Lys Generate parallel expression clones Dead end: insolubility His6 ( GST)—Target protein His6 (NusA)—Target protein His6 ( MBP)—Target protein (a) Express in ^ E. coli HÍS6 GST Target protein His6 ( Trx j— Target protein Protease cleavage site His6(NusA His6(MBP His6 ( Trx Target protein Target protein Target protein Hts6 MBP (e) Target protein IMAC FT SUCCESS Target protein Good cleavage His6 0 (b) Purify by IMAC Cleave with protease Poor \ cleavage Target protein Target protein (c) His6 (NusA]—Target protein His6 NusA Target protein Dead end: protein insoluble after cleavage of tag Dead end: difficult to separate cleaved protein from fusion Current Opinion in Biotechnology Schematic representation of the pathway from protein expression to purification using solubility tags (Esposito and Chatterjee, 2006). Enhancing the solubility of recombinant proteins 19, 84, 215 - human proteins involved in cancer produced in E.coli 19 84 215 27 28 29 34 27 28 29 34 27 28 29 34 sisisisi sisisisi sisisisi Example of SDS PAGE with soluble (s) and insoluble (i) fractions following lysis. The results produced from the four different expression vectors (27: His tag only; 28: thioredoxin + His tag; 29: GST + His tag; 34: GB1 + His tag) are shown for three different target proteins (Hammarstrom et ah, 2006). Solubility-enhancing tags - the mechanism of action -The mechanism by which partners exert their solubilising function is not fully understood and possibly differs between fusion proteins Examples of possible mechanisms Maltose binding protein (MBP) has an intrinsic chaperone-like activity. MBP might bind reversibly to exposed hydrophobic regions of nascent target polypeptide, steering the polypeptides towards their native conformation by a chaperone like -mechanism. N-utilization substance (NusA) decreased translation rates by mediating transtriptional pausing, that might enable critical folding events to occur. MBP and N-utilization substance (NusA) attract chaperones. The fusion tag drives its partner protein into a chaperone-mediated folding pathway. MBP and NusA interact with GroEL in E. coli (Huang and Chuang, 1999). Small ubiquitin related modifier (SUMO) promotes the proper folding and solubility of its target proteins possibly by exerting chaperoning effects in a similar mechanism to the described for its structural homolog Ubiquitin (Ub; Khorasanizadeh et al., 1996). Negative charged tags (highly acidic peptide) inhibit aggregation by increasing electrostatic repulsion between nascent polypepdides (Zhang et. 2004). Solubility-enhancing tag - mechanism of action Thioredoxin >Its intrinsic oxido-reductase activity is responsible for the reduction of disulfide bonds through thio-disulfide exchange > Thioredoxin serves as a covalently joined molecular chaperone independently of redox activity. Thioredoxin may, thus, act to prevent the aggregation and precipitation of fused nascent proteins, giving them an extended opportunity to adopt their correct tertiary folds. Proposed mechanism of thioredoxin-catalyzed protein disulfide reduction. Reduced thioredoxin [Trx-(SH)2] binds to a target protein via its hydrophobic surface area. Nucleophilic attack by the thiolate of Cys32 results in formation of a transient mixed disulfide, which is followed by nucleophilic attack of the deprotonated Cys35 generating Trx-S2 and the reduced protein. Conformation changes in thioredoxin and the target protein occur during the reaction. Holmgren, 1995; Berndt et al. 2006 In vitro solubility-enhancing tags Short peptide tags Poly-Lys tag, poly-Arg tag = one, three and five lysine or arginine residues fused to the C- or N-terminus of the target protein Solubility as defined here is the maximum protein concentration of the supernatant after centrifugation of the supersaturated protein sample (in vitro solubility). Nhfe ^2 NH Cht 1 Cht I Cht 1 1 Cht Cht 1 J Cht Cht HjN+—C —CQz 1 HjN+—C —CCfc-1 1 H 1 H Arginine (R) Lysine (K) BPTI-22 = bovine pancreatic trypsin inhibitor variant containing 22 alanines The solubilization factor is defined as the molar ratio between the solubility of tagged BPTI-22 variants and that of the reference BPT-22 molecule. ->— ■»->-> N- C- N- C- terminus The solubilization effect of poly-Lys tags is lower than that of poly-Arg tags (lysines are less hydrophilic than arginines). Kato et al., 2006 Biochemical properties of poly-Arg and poly- Lys tagged BPTI-22 protein Protein Solubility Protein Cone. |mM| (Cone. | mg/ml | )* Solubilization Factor*1 T„ VC) Rel. Trypsin Inhibitory" Activity (%)c BPTI-22 1.70 (10.00) 3ÍU XIK 1.70 (10.40) 1.00(1.04) 35.2 1.05 -N3K l.bb • hi.o7. 136(2.00) 34.4 1.04 K5K 537 (35.60) 3.16(3.56) 34.3 1.05 -C1K 1.79 (10.95) 1.05(1.10) 34.6 1.05 -C3K: 2.41 (15.23) 1.42(1.53) 36.2 1.05 ^^H^5) 35.0 1.02 ^BstTBS) ^5*5™ T^2™ -N3R 2.70 (17.23) 1.59 (1.72) :\h 0.99 N5R 6.20 (41.11) 3.65(4.11) 353 0.99 -C1R til (11.07) 1.06(1.11) 35.0 1.05 -OR 3.02 (19.26) 1.73(1.93) 34.4 1.05 -C5R 3.23 (543É) 4^4(5.46) 34.3 1.03 C6R 10.59(73.41) 6.22 (7.34) 32.7 1.1 BPTI 22,J 5.63 (33.11) 3.31 (3.31) 1.09 B BPTI-22' 2.01 (11.32) 1.1 ft (MS) ND1 NA* _ The addition of 0.5 M Arg barely increased its solubility, and trypsin activity was inhibited by the high arginine concentration. On the other hand, addition of 50 mM Arg+Glu was more effective and increased protein solubility more than threefold. Protein solubi lit y was deter mined as [lie rnuimum super natan t concentration at a supersaturated protein solution at 4:'C in 10(1 m M acetat e bu tier pi! 4.7. 1 Maiimum cunrmtrations calculated in milligrams per milliliter are Lndi4.~at.-nJ in parenthesis. The Mw of BPTI-22, -NIK and -OK, -N3Kand -C3K, -N5Kand-C5K. -NIR and-ClR,-N3R and-C3R,-N5R and-C5R, and -C*R are, restively: 5880,6123,6379, 6636.6151,6463,6776, and 6932 Da. 81 Calculated as the ratio between the m-nlar protein solubility of BPTI-22 and that of tagged BPTI-22. Values En parenthesis indicate tlte ratio calculated in r.n illig rams per mi Hi liter h. " Relative trypsin inhibitory activity calculated as tlie ratin between tlie activity of BPTI-22 and that of tagged BPTI-22. BPTI-22,which Lacks R39, an arginine residue involved in two hydrogen bonding interactions with the trypsin residue backbone,** has a reduced trypsin inhibitor activity corresponding to -60% of the wt-BPTI and BPTI-; 5,55] at stoichiometry and a protein concentration of 280 nM" So lubilit y i n the same buffer as above bu I with the addi tio n of 50- m M t-Arg +■ t-Glu, " The CD thermal melting curve Could not be deter mi ned due to the strong absorption of arginine and glutamic acid. f Protein solubility with !^00 mMArg-HCI added to the above buffer. * The trypsin activity could not be determined because the high arginine concentration inhibited trypsin activity. Kato et al., 2006 A) BPTI-22: RPAFCLEPPYA0PAKARIIRYFyNAAAQAAQAFVYQQAAAKRNNFA8AADALAACAAA B) (a) (b) FIGURE 3 Hydrophobic residues in BPTI-22. A: One letter amino add sequence of BPTI-22 with the hydrophobic residues (A,V,I,L,F,P) shown in green letters. B: Lett, BITl-22 ribbon model with a-helices colored red and /^-strands colored blue. Right, surface representation of BPTI-22 with the hydrophobic area determined as low electrostatic potential regions according to MOL-MOL,35 colored green. The molecule is oriented with the ^-sheet pointing to the back in (a) and to the front in (b). The N- and C-termini are labeled "NT and "C,* respectively. The C-terminal end is located on the same face as a large hydrophobic patch shown in green, whereas the N-terminal end is on the opposite side of the molecule and is shown with a light gray letter "NT in panel (b). > The solubilization factor of all C-terminal tags was slightly higher than that of the respective N-terminal tags. > The C-terminus of BPTI-22 is close to a large hydrophobic patch, whereas the N-terminus is located on the opposite side of the molecule, away from the hydrophobic patch. > Charged residues seem to act through repulsive electrostatic interaction and thus hamper intermolecular interaction arising from the hydrophobic cluster. Kato et al, 2006 Solubility-enhancing tags - comparison of peptide and protein tags, conclusions > Protein tags are inherently large and need to be correctly folded in order to enhance solubility. > Protein tags are often natural affinity tags. > Peptide tags are small, and, importantly, they do not need to be folded, which provides a significant advantage over protein tags. > The use of small tags (< 30 amino acids long) does not increase protein size substantially and reduces steric hindrance, which simplifies downstream structural and functional applications without the need to remove the tag. > The solubilization enhancement effect depends on the size of the target protein. Solubility enhancement of fusion partners such as thioredoxin, GB1 is less pronounced for larger target proteins (above 25 kDa). MANY TAGS SUFFER FROM THE SAME PROBLEM - THEY DO NOT FUNCTION EQUALLY WELL WITH ALL TARGET PROTEINS. Otázka 2: Který tag/kotvu by jste využily pro zvýšení rozpustnosti proteinu bohatého na cysteiny? Removal of fusion tags- the Achilles' heel of the fusion approach All tags, whether small or large, have the potential to interfere with the biological activity of a protein, impede its crystallization (presumably due to the conformational heterogeneity allowed by the flexible linker region), be too large for NMR analysis, cause a therapeutic protein to become immunogenic or otherwise influence the target protein's behavior. The fusion tags can be removed by: > Chemical cleavage > Self - cleavage > Enzymatic cleavage Removal of fusion tags - chemical cleavage > Rarely used. Cyanogen bromide Met-X Hydroxylamine Asn-Gly 1 1 MRGSHHHHHH M12 M15 / / GMASMEKNNQ M28V / GNGQGHNVPN 40 1 DPNRNVDENA NANSAVKNNN NEEPSDKHIK EYLNKIQNSL STEWSPCSVT M105V CGNGIQVRIK PGSANKPKDE LDYANDIEKK ICKVEKCS Amino - acid sequence of the P. falciparum C-terminal segment of CSP (PfCSP C-ter) fused to a purification tag (Rais-Beghdadi et al, 1998). Chemical cleavage is a harsh method, efficient, but rather non-specific and may lead to unnecesary denaturation or modification of the target protein. Removal of fusion tags - self - cleaving > Use of self-cleaving fusion tags 1. Inteins N-Extern Ink-in I i... i DNA: RNA: transcription translation precursor protein Protein; Protein: Intuit ^ protein splicing ^0 %J mř COOH ejrcjsfid intern Inteins (/wtervening prote/wsJ are protein segments that can excise themselves from protein precursors in which the are inserted and rejoin the flanking regions. > Self - splicing inteins can be mutated at the N- or C- terminal splice junction to yield self cleaving inteins, which can be used to mediate self cleaving of various tags. Perler, (2005) 1. Inteins His-Tai; 0 Cys N-S Acyl Shift AcNH —Thymosin im cys Induced by Temp upshift or DTT/2-ME II s nte i n J—HÍS-T:l 2 Trans-esterification reaction - formation of thioester between thiol and C- term amino acid of target protein. This product is not stable and easily hydrolyze to release target protein and tag. ■His-Tns AcNH— Th ymosin—cooh > Two categories of inteins: - inteins with pH-induced C-terminal cleaving activity - inteins with thiol-induced N- and C-terminal cleaving activity A pH intein N ^ -pH 6.0-6.5 ^ a n ^ ^ ^■ül^ 20-25X, 16 h -^^^ ^■■tü^ ^■k^ 15-30 mM thiol ^■rnpnÄ-C Thiol intein A Tag -m + C- Thiol intein A^ Removal of fusion tags - self - cleaving fusion tag 2. HHHHHH—I SrlAft».2iw> —LPXTGH target protein System based on the catalytic domain of Staphylococcus aureus sortase A (SrtA). SrtA cleaves the Thr-Gly bond at the conserved LPXTG motif in the substrates. Cleavage is inducible by adding calcium (cofactor of SrtA). 3. I Nlpro) |—CX— target protein N-terminal protease (Npro) is the first protein of the pestivirus polyprotein. It posesses autoproteolytic activity and catalyzes the cleavage by switching from chaotropic to cosmotropic conditions. 1—["oTlisorCBl) 4. target protein I—D P— SPM FrpC modul (from G+ bacteria Neisseria meningitides): FrpC protein undergoes calcium - inducible autocatalytic proccesing at the peptide bond between residues Asp and Pro. Cleavage reaction is catalyzed by a self proccessing modul (SPM). 5. target protein —VDAL ADGK— CPD |—HHHHHH Vibrio cholerae secretes a large multifunctional autoprocessing repeats-in-toxin (MARTX) toxin that undergoes proteolytic cleavage during translocation into host cells. Proteolysis of the toxin is mediated by a conserved internal cystein protease domain (CPD), which is activated upon binding of inositol hexakisphosphate. (Li, 2011) Removal of fusion tags - self - cleaving fusion tag Inteins (1) > Uncontrolled in vivo cleavage or in complete in vitro cleavage > Target protein modification - pH or thiols can modify the target protein > Protein compatibility with cleaving conditions - pH induced inteins > Compared to the traditional protease based method, the intein-based approach requires fewer steps and lower costs. Other system (2-5) > Tested on limited number of cases Table 3 Cjcneral features of the live self-cleavage fusion systems discussed in the text Self- MW Purification Cleavage Advantages Disadvantages cleaving (kDa) tag condition tag Intein 51: CBD.CBM. Thiols; pH Ilexible fusion and cleavage options: Lack of solubility-enhancing capacity: in 22' phasin. and/ or allowing generation of target protein vivo cleavage: incomplete cleavage: 17: 15" ELP temperature shift with native sequence miseleavage SrtA 17 His-tag. biotin 5 mM Cai+ Potential of enhancing target protein expression and solubility In vivo cleavage: incomplete cleavage: introduction of an extra Gly residue to the .V-iemiinus of the target protein V" 14 His-tag Kos inotropic conditions Allowing generation of target protein with native sequence Limited to proteins capable of refolding: in vivo cleavage; incomplete cleavage: long cleavage time FrpC 26 His-tag, CBD 10 inM Ca2+ Efficient and tightly controlled cleavage: insensitive to protease inhibitors Lack of solubility-enhancing capacity: introduction of an extra Asp residue to the C'-ierminus of the target protein; single C'-terminal fusion option CPD 23 His-tag 50-100 uM InsPfi Potential of enhancing target protein expression and solubility: efficient and tightly controlled cleavage; insensitive to protease inhibitors Introduction of up to four non-native residues to the C'-ierminus of the target protein: single C-tcrniinal fusion option a Molecular weight of the self-cleaving tag '' Inteins with different sizes are available Li, 2011 Removal of fusion tags - enzymatic cleavage Cleavage site 1-- _-„ Target 4-37°C, time varies Target Site-specific proteolytic cleavage: > Exopeptidases > Endopeptidases Exopeptidases (aminopeptidases and carboxypeptidases): DAPase (TAGZyme) Exotdi (peptidase l leave- N-kamin,il Hi--i;i'_ K -terminal 1 foi purification and removal . len -a,i- amine-peptidase Exopeptidase Cleaves N-terminal. effective on M. L. RequiresZn Aminopeplidase M Exopeptidase Cleaves N-terminal. does not cleave X-P Carbox\peptidase A Exopeptidase Cleaves C-teiminal. No cleavage at X-R, P Carboxypeptidase B Exopeptidase Cleaves C-terminal R, K > APM, CPA and CPB release sequentially a single amino-acid from the N- or C- terminus of a protein until the stop site is reached. TAGZyme system (Qiagen): > DAPase (dipeptidyl aminopeptidase I) TAGZyme stop points Amino acid DAPase slop point [i) sequence' Lysine (Lys. K'i XaaXaa. ..Xaa-Xaa X Lys-Xaa ... Arginire (Ajg. R) XmXm Xii"n i Arq-Xaa ... Roline (Pro. P) X.i i >:..i..i X.i.i K.i : i Km Kaa R -Xaa . FVoline (Pro. P) Xaa-Xaa...Xaa-Xaa 1 Xaa-Pro Xaa-Xaa... GlUamine (Gin. Oj' Xaa-Xaa. ..Xaa-Xaa J. Gln-Xaa... DAPase cleavage DAPase slop _1 1» * MK HQ HQ HQ HQ HHP SDK HT-Trx HHP-Trx M 1234567 Removal of fusion tags - enzymatic cleavage Endopeptidases > The enzymatic cleavage site has to be placed between the fusion tag and the target protein. Enzyme Cleavage sice Comments Enterokinase DDDDK* Secondary sites at other basic aa Factor Xa IDGR* Secondary sites at GR Thrombin LVPR*CS Secondary sites. Bio tin labeled for removal of the protease Pre Scission LEVLFQ'CP GST tag for removal of the protease TEV protease EQLYFC/C His-tag for removal of the protease 3C protease ETLFQ'GP GST tag for removal of the protease Sortase A LPET'G Ca2+-induction of cleavage, requires an additional affinity tag (e.g., his-tag) for on column tag removal Granzyme B d*x,nTx. m'n.s'x Serine protease. Risk for unspecifrc cleavage , Protease site Hisc t {MBP r Target protein Enterokinase Asp-Asp-Asp-Asp-Lys/X Table 4 Cleavage (fi i o f enterokinase through densitometry (HosJ'ield and Lu 1999) based on the amino acid residue X\. The sequence....-GSDYKDDDDK-Xi-ADQLTEEQIA-... of a GST-cal- modulin fusion protein was tested using 5 mg protein digested with 0.2 Uof enterokinase for 16 h at 37 C Amino acid in position Xj Cleavage of enterokinase (%) Alanine 88 Methionine 86 Lysine 85 Leucine 85 Asparagine 85 Phenylalanine 85 Isoleueine 84 Aspariic acid 84 Glutamic acid 80 ( rluLlllii k 79 \'ali:k- 79 Arginine 78 Threonine 78 Tyrosine 78 Hislidine 76 Serine 76 Cysteine 74 Glycine 74 Tryptophan 67 Proline 61 Removal of fusion tags - enzymatic cleavage A critical review of the methods for cleavage of fusion proteins with thrombin and factor Xa Richard J. Jenny,"'* Kenneth G. Mann." and Roger L. Lundbladc-d J Haemutologic Ttrhtiologkw liii;, Esse.x Junethit, VT. USA 15 Depart Wat of Biochemistry, University of Vermont. Burlington. VT, USA c Department of Pathology, University of North Carolina, Chapel Ilifi. NC, USA d Roger LLmdhlad, LLC. Chapel Hill. SC. USA Received 27 February 2003. and in revised form 7 May 201)3 The purpose of this review was to demostrate that both thrombin and factor Xa can hydrolyze variety of peptide bonds within the fused proteins of interest. Sequences cleaved by thrombin in polypeptide hormones Polypeptide horn ones* Sequence cleaved Secretin ELSLSRLRDSA Secretin ELSLSRLR (much slower than above) Vasoactive intestine polypeptide DNYTRLRK Vasoactive intestine polypeptide YTRLRKQM Choleocystokinin APSGRVSM Choleocystokinin VSMIKNLQ Dynorphin A RIRPK.LKW Somatostatin-28 AMAPRERK Somatostatin-28 NFFWKTFT Gastrin releasing peptide KMYPRCJNH Salmon calcitonin QTYPRTNT aThe reaction mixtures contained 0.5 NIH units thrombin and l.Onmol peptide in 20 uL of 50mM NH4CO3, pH 8.0, at 25°C. The conditions were designed to obtain an enzyme/substrate ratio of 1:60 (w/w). Protein ^Expression ^Purification Accuracy of cleavage has to be precisely verified! pRSETB::AHP2 Enterokinase cleavage site N^GSHHHHHHGMASMTGGQQMGRDLYDDDDK^)PSSRSAAGTMEFMDALIA....................GIVPQVDIN C Theoretically: 3,4 kDa 18,9 kDa Intact mass spectrometry analysis a.i. 1000 800 600 400 200 40 4 1 522 1 4923 5897 1 963 -1- AHP2 enterokinase 134B6 110 3 1 -1- 11175 -1- 1 1 D 3 9 1 8256 -1- _J_ AHP2 control 2 2 337 AHP2 standard *~ 3894 1 1 1 67 ' 4053 1 1 0 3 4 tm.......... ill_- _l 2000 ' 4000 ' 6000 ' 800o' ' 10000 ' 12000 ' 14000 ' 16000 ' 18000 ' 20000 ' 22000 in/z SUMO gene fusion system SUMO protease recognizes the tertiary structure of SUMO rather than an amino acid sequence. As a result, SUMO protease never cleaves within the fused protein of interest! Removal of fusion tags - enzymatic cleavage > Optimization of protein cleavage conditions (mainly enzyme-to-substrate ratio, temperature, pH, salt concentration, length of exposure). > Cleavage efficiency (Optimization is needed. The efficiency varies with each fusion protein in an unpredictable manner, probably due to aggregation or steric issues; the problem can be solved by introducing short linkers between the protease site and the fusion tag). > Unspecific cleavage (SOLUTION: optimization of protein cleavage conditions or using re-engineered proteases with increased specificity such as ProTEV and AcTEV proteases). > Precipitation of the target protein when the fusion partner is removed (so-called soluble aggregates; SOLUTION: another approach for protein solubilization has to be found). > Failure to recover active or structurally intact protein > Target protein modification (some proteases like thrombin, TEV, Precision leave one or two amino-acids on the target protein near the cleavage site). > Re-purification step is needed to separate the protease from target protein. > High cost of proteases The alternative is to leave the tag in place for structural analysis: The small tags are a better choice in structural and functional analysis of proteins. Otázka 3: Jaký je rozdíl mezi inteinem a samo-vyštěpujícím tágem odvozeným od inteinu? Affinity chromatography (AC) > A type of adsorption chromatography, in which the molecule to be purified is specifically and reversibly adsorbed to a complementary binding substance (ligand, L) immobilized on an insoluble support (matrix, M). Affinity tag Protein of interest Sepharose Affinity tag binding partner Immoblized binding partner of Affinity tag fused to N- or C-terminus affinity tag of protein TPEG (substrate analogue of p-galactosidase p-galactosidase) Glutathione Giutathione-S-Transferase Immunoglobulin G Protein A Cu II, Co 11 or Ni 11 poly His or poly Cys > AC has a concentrating effect, the high selectivity of separations derived from the natural specificities of the interacting molecules. > AC can be used (1) to purify substances from complex biological mixtures, (2) to separate native forms from denatured forms of the same substance, and (3) to remove small amounts of biological material from large amounts of contaminating substances, (4) and to isolate protein complexes from the native source. > the first application was in 1910 (adsorption of amylase onto insoluble starch) but it developed during the 1960s and 1970s. Affinity tags and affinity purification Alfinitytag Protein of interest Sepharose Affinity tag binding partner Immoblized binding partner of Affinity tag fused to N- or C-terminus affinity tag of protein TPEG (substrate analogue of p-galactosidase (3-galaclosidase) Glutathione Glu lathi one-S-Transferase Immunoglobulin G Protein A Cu II» Co II or Ni II poly His or poly Cys Table 2 Sequence and size of affinity tags Tag Re si dues Sequence Size (kDa) Poly-Arg 5-6 RRRRR 0.80 (usually 5) Poly-His 2-10 HHHHHH 0.84 (usually 6) FLAG 8 DYKDDDDK 1.01 Strep-tag II 8 WSHPQFEK 1.06 c-myc 1 EQKL1SEEDL 1.20 S 15 KETAAAKFERQHMDS 1.75 HAT- lJ KDHLJHNVHKEFHAHAHNK 2.31 3x FLAG 22 DYKDHDGDYKDHD1DYKDDDDK 2.73 Calmodulin-binding peptide 26 KRRWKKNF1AVSAANRFKK1SSSGAL 2.96 Cellulose-binding domains 27-1ÍÍ9 Domains 3.00- 20.00 SBP 38 M D EKTTGWRGGH V V EGL AGELEQ LRARLEHH PQGQREP 4.03 Chitin-binding domain 51 TNPGVSAWQ VNTAYTAGQL VTYNGKT Y KC LQ PHT S L AGWE PS N V PA LW Q LQ 5.59 Glutathione S-transferase 211 Protein 26.00 Ma 11 ose- bi ndi ng protei n 396 Protein 40.00 A tag is fused to the N- or C-terminus of the protein of interest to facilitate purification, which relies on a specific interaction between the affinity tag and its immobilized binding partner. Genetically engineered fusion tags allow the purification of virtually any protein without any prior knowledge of its biochemical properties. Purification tags Affinity tags Affinity tag Matrix Polv-Arg Polv-His FLAG Strep-tag II e-myc S HAT (natural histidiire affinity tag) C al modu li n-bi ndi ng pe pi i do Cellu lose-binding domain SBP Chi tin-binding domain Glutathione S-transferase Maltose-binding protein C at i o n-e xe ban ge re si n Ni2+-NTA, Co*+-CMA (Talon) Anti-FLAG monoclonal antibody Strep-Tactin (modified streptavidin) Monoclonal antibody S-fragment of RNaseA Co2+-CMA (Talon) Calmodulin Cellulose Streptavidin Chi tin Glutathione Cross-linked amy lose Non - chromatographic tags Tag_Matrix ELP PHB annexin B1 None Intracellular PHA granules None > These tags can eliminate affinity resin. Proteins are isolated by other non-chromatographic methods (centrifugation, filtration) > typically combined with self-cleaving tags > 75 % - 95 % purity > Traditional purification tags > The tag binds strongly and selectively to an immobilized ligand on a solid support. > After optimization one could achieve > 90% purity. Additional separation step + DTT T 1um O 1 ,„ , / ^ \ salt addition Affinity tag - mtein - target protein * Centrifugation a Pliasin - intein - target protein s v»-'--B^Hi ELP - intern - target protein Purification tags Non - chromatographic tags + DTT 1pm mv 42 Mild heating and/or ^ salt addition 4 ^ Centrifugation The PHB system (c): > PHB (polyhydroxybutarate): subclass of biodegradable polymers produced in various organisms, use as storing excess carbon. > The system includes in vivo production of PHB small granules (from the plasmid carrying PHB-synthesis genes). > Target protein in fusion to self cleaving phasin tag. > Tagged protein binds to the PHB particles via phasin tag, which allows the granules and the tagged protein to be co-purified via centrifugation. Phdiin - intein - target protein ELP - intein - target protein > DTT induced cleaving activity of intein and thus elution of the target protein. The ELP system (d): > ELP (elastin-like polypeptide) selectively and reversibly precipitates in response to changes in temperature and buffer salts. This allows soluble and insoluble contaminants to be removed by filtration or centrifugation. Components of a matrix for affinity chromatography H Ni2+ NTA sepharose A ligand > The dissociation constant (Kd) for the ligand - target complex should ideally be in the range 10~4to 10~8 M in free solution to allow efficient elution under conditions which will maintain protein stability. > A ligand has to be attached to the matrix with a suitable chemically reactive group. The mode of attachment must not compromise the reversible interaction between the ligand and protein. Components of a matrix for affinity chromatography A matrix > Typically, a macroporous polysaccharide bead such as agarose, that provides a porous structure so that there is an increased surface area to which the target molecule can bind. > A matrix has a suitable attachment site for the ligand. Typically matrices are chemically activated to permit the coupling of the ligand. A number of activation methods are available which depend on the nature of the matrix and the availability of compatible reactive groups on the ligand. 36 Components of a matrix for affinity chromatography Ni2+ NTA sepharose Spacer arm > A spacer arm will be required in cases where direct coupling of the ligand to the matrix results in steric hindrance and subsequently the target protein will fail to bind to the immobilized ligand efficiently. The introduction of a spacer arm between the ligand and the matrix minimizes this steric effect and promotes optimal adsorption of the target protein to the immobilized ligand. 37 Typical affinity purification steps equilibration- adsorption of sample and elution of unbound material wash away unbound material elute ' bound -protein(s) re-equilibration Column Volumes (cv) > In the equilibration phase, buffer conditions are optimized to ensure that the target molecules interact effectively with the ligand and are retained by the affinity medium as all other molecules wash through the column. > During the washing step, buffer conditions are created that wash unbound substances from the column without eluting the target molecules or that re-equilibrate the column back to the starting conditions (in most cases the binding buffer is used as a wash buffer). > In the elution step, buffer conditions are changed to reverse (weaken) the interaction between the target molecules and the ligand so that the target molecules can be eluted from the column. Affinity chromatography - Immobilized metal ion affinity chromatography (IMAC) > The most common purification tag is typically composed of six consecutive histidine residues. > Histidine, cysteine, and tryptophan residues are known to interact specifically with divalent transient metal ions such as Ni2+, Cu2+, Co2+, and Zn2+. > Histidine is the amino acid that exhibits the strongest interaction with immobilized metal ion matrices as the electron donor groups on the histidine imidazole ring readily form coordination bonds with an immobilized transition metal. \ L Í— CH—CH. II- M (kDa) 170 • 116 ■ 86 ■ 56 ■ Binding strength of His tag to metal ions: Cu2+ > Ni2+ > Zn2+ ~ Co2+ 27 20 -(HisJ^Zm-peO.r Zn2+ Ni2+ Co2+ Cu2+ (Zouhar et al, 1999) > IMAC can be used under native and/or denatured conditions. > A highly purified protein can often be obtained in one or, at most, two purification steps. His-tagged protein and IMAC under native conditions > Optimal binding of recombinant protein with metal ion is achieved at pH 7-8. > Buffers with a high salt concentration (0.5-1 M NaCl) reduce nonspecific electrostatic interaction. > Nonionic detergents or glycerol reduce nonspecific hydrophobic interactions. > Elution of contaminating proteins can be achieved by lowering the pH or using low concentrations of imidazole. > Elution of tagged protein is achieved at high imidazole concentrations (0-0.5 M), by strongly decreasing the pH, or by using EDTA. Immobilized metal affinity chromatography His protease cleavage sue charged metal chelate resin I pH j [imidazole] 1 [EDTA] ! protease -i + + + + I* < ^te^ Ü + i> -\J + --.....ť + His-tagged protein and IMAC under native conditions One-step purification of maize ß-glucosidase > Perfusion matrix: POROS MC/M > Functional group: iminodiacetate, metal ion Zn2+ > Removing contaminated proteins: linear gradient of imidazole (0-50 mM) and pH (pH 7-6.1) > Protein elution: 0.1 M EDTA > 80% recovery, 95 fold purification > Common production and isolation of the wild type protein and soluble mutant form for enzymatic measurements and crystallization. A B C D (Zouhar et at., 1999) His-tagged protein and IMAC under denatured conditions - Purification of proteins expressed in inclusion bodies. - Purification in a high concentration of urea or guanidine chloride. - Result is a pure protein, but in a denatured form (sufficient for immunization). Recovery of native conformers (necessary for functional and structural analysis): > Binding to the column under strong denaturing conditions (8 M urea) > Two possibilities of renaturation: 1. The protein is eluted from the column and renatured by dialysis or rapid dilution in renaturing buffers. 2. Renaturation of the protein bounded to the column (matrix assisted refolding procedure): gradient from denatured to renatured buffers or pulsion renaturation (8-OM urea). Identification of properly refolded (His)6Zm-p60.1 (maize (3-glucosidase) using 10% native PAGE, followed by activity in gel staining: A = crude protein extract prepared from maize seedlings containing the native enzyme B = (His)6Zm-p60.1, renatured product (matrix assisted refolding procedure - 23 renaturing cycles) C = (His)6Zm-p60.1 purified by native IMAC KM (His)6Zm-p60.1 purified by native IMAC: 0.64 ± 0.06 mM KM (His)6Zm-p60.1 renatured product: 0.6 ± 0.08 mM Determination of v^ and kcat was hampered by the fact that the refolding process yielded a number of improperly folded polypeptides. Zm-p60.1/ /(His)eZm-p60.r B (Zouhar et al, 1999) His-tagged protein and IMAC under native conditions Two-step purification of Arabidopsis histidine phosphotransfer protein 5 > IMAC matrix: highly cross-linked spherical agarose (His)6AHP5 > Functional group: nitrilotriacetic acid, metal ion Ni2+ > Removing contaminated proteins: linear gradient of imidazole (20-500 mM) > Protein elution: 130 mM imidazol > Common production and isolation of the wild type protein for protein-protein interaction measurements and crystallization. 1st step - metal chelate affinity chromatography 2nd step - gel filtration / \ bJ.j^LiiJji1^Jjl1jJjiii1mJju1.^ í \ After ultrafiltration Crystallization Purity: 96% Concentration of the protein: 22 mg/ml His-tagged protein and IMAC under native conditions Four-step purification of Arabidopsis CKI1RD 1. Affinity purification (IMAC) 2. Tag removal (TEV protease) 3. Affinity purification (IMAC) 4. Size exclusion chromatography Ub- SGSG- ■HisTaq- SA- TEV- AME-( :kii 3. Affinity purification after TEV cleavege 200 mM imidazole CKI1 RD 710 cv ApETM-60 pETM-60::CKIlRD 20 mM imidaz. \ ... pETM-60::CKILRD after cleavege CKI \_ RD pETM-60 4. Size-exclusion chromatography 1 L-> -10-20 mg forTB and M9 Pekařova B. Otázka č.4: Jakými metodami se izolují proteiny fúzované s nechromatografickými tagy/kotvami? Affinity purification for studying protein-protein interaction > Affinity purification provides a high-efficiency method for isolation of interacting proteins and protein complexes: > Co-immunoprecipitation > GST (or His) pull-down > Tandem affinity purification > Testing known protein-protein interaction. > Identification of novel protein-protein interactions. Co-immunoprecipitation (Co-IP) > The principle: If protein X is immunoprecipitated with an antibody of X, then protein Y, which is stably associated with X in vivo, may also be precipitated. This precipitation of protein Y, based on a physical interaction with X, is referred to as co-immunoprecipitation. > An obvious advantage is that complexes are isolated in the state closest to the physiological condition. > When a good quality antibody of X is available, Co-IP is a fast method and there is no need to clone and express the component(s) of the complex. 1. Cell lysis under mild conditions that do not disrupt protein-protein interactions (using low salt concentrations, non-ionic detergents, protease inhibitors, phosphatase inhibitors). 2. The protein of interest (X) is specifically immunoprecipitated from the cell extracts (using an antibody specific to the protein of interest or to its fusion tag). 3. The antibody-protein(s) complex is then pelleted usually using protein-A or G sepharose, which binds most antibodies . 4. Eluted immunoprecipitates are then fractionated by SDS-PAGE. 5. A protein of known identity is most commonly detected by performing a western blot. Identification of novel interaction is carried out by mass spectrometry analysis. Pull-down assay > Pull-down assays are a common variation of co-immunoprecipitation and are used in the same way, but pull down does not involve using an antibody specific to the target protein being studied. > They are used for purification of multiprotein complexes in vitro. > The target protein is expressed in E. coli as GST fusion and immobilized on glutathione-sepharose beads (GST alone is often used as a control). > Cellular lysate is applied to the beads or column, and the target protein competes with the endogenous protein for interacting proteins, forming complexes in vitro. > Centrifugation is used to collect the GST fusion probe protein and adhering proteins. > The complexes are washed to remove nonspecifically adhering proteins. Stspl, Immobilize the luswn-tagged h)~.; i:i ys ih: i ŕ.-.J'í1h Sil JhlFnihfLiglťiS FSjitřrjUlr-Cjntíiiinj LflM* Stĺp 4, Wash away unbo-und protein. Slep í. Wasti away unbound protein, Slep 6. Elule protein:priteč interaction complex. T yiUHH ^ r u % *_i.r*i HíplieMlliTÍ*fiClinflCliinp(pji Slep 3. Bind "prev" protein to iíiiniřt>iliíed "bail protein, L]W1 Slsp 6. Anaryif proiein nrotein interaction com ptat on SDS-PAGE. M*kb PurFrf ft-lird Dli G« MrndAiktdu > Free glutathione is used to elute the complexes from the beads, or alternatively the beads with attached complexes are boiled directly in an SDS-PAGE sample buffer. > The proteins are resolved on SDS-PAGE and processed for further analysis. Tandem affinity purification (TAP) Two-step purification strategy in order to achieve higher purity of isolated multiprotein complexes under near physiological conditions. This method was originally developed for use in yeast and quickly adapted to higher eukaryotes such as insect cells, human cells and plant cells. Examples of TAP (tandem affinity peptides) tags TAP tag: a double affinity tag (highly specific) which is fused to a protein of interest as an efficient tool for purification of native protein complexes. Ctlrodufo binding orptlde OP) TEV protease cleavage she Protein A IgG-bioding domain Tap-:»gg*d pcrvpefXide etr rü ssc d near pnytfoioglul lewis PROTEIN-PROTEIN INTERACTIONS Cel r/íij in mild condit am Affinity errwrutography SDS*I i Reverie cross! nks PuriryDNA PCRwthjpccrfic primers Pulsaru labH DNA ••••••••• Mass Spccvomciry Single Locus PCR Genomic iequexei l lieg u awry and tranicrfeed) Location DNA Microwray (a) (b) Protein A - Protein A - TEV CBP 20.7 kDa Protein G Protein G - TEV SBP 18.8 kDa (c) FLAG ! Strepll 4.6 kDa (d) Biotin signal 6xHis 9.6 kDa Colins knd Choudhary, 2008 Current Opinion in Biotechnology Tandem affinity purification Gene of Interest CBP TEV Site IgG BD Expression and purification from a culture. ^ ^Ajfinity Column 1: IgQ Beads ~ IgGBead >>= \ ^ ^ Cleavage with TEV Protease | Column 2: Calmodulin Beads A Ca 2) Ca 5 * Ca 4 \ Co2* Chelation, Elution j Sepai ■ation 61 Band Excision And Digestion *7 I Peptide Separation ' \ A mi- A nd MS A nalys is 1. Protein 2. A 3. B 4. C 5. D -v^T" vLeene et ai, 2007 (c) Lower background and higher complex yield with GS tag compared to TAP tag 0 I Ilm is 16 10 13 (Chepelev et al. 2008) Affinity purification for studying protein-protein interaction > An affinity tags can influence protein-protein interactions (testing N- and C-terminal fusions). > Loss of weak or transient protein-protein interactions. > Non-specificity: controls, affinity tags with higher specificity > Verification of newly identified interactors by other methods and biologically relevant mutants. Comparison of a standard purification process with affinity purification > Generally, the yield and efficiency of any specific purification procedure depends on the level of optimization developed for individual proteins and the method. It is therefore recommended to use the data presented in different comparisons as indicative rather than definitive, which it is not e.g., identical elution conditions are optimal for different proteins. > Standard chromatographic methods include several steps to obtain a relative pure protein. This results in a time-consuming procedure and a relatively low yield of recovery (typically 50 % of the starting material for optimized processes). > The yields obtained in purification of proteins using affinity chromatography can be over 90 % and include a reduced number of steps. A M I 3 It M pi IAP Comparison of purification strategies for recombinant pGAP Purification step Total volume (ml) Activity (U/ml) Total activity (U) Yield (%) Slumlord purification f untagged pGAP) Cell extract 750 2.1) 1463 I (if) Phenyl-Sepharose HS 400 2.6 1040 71 Phenyl-Sepharose LS 160 5.6 888 61 Q Sepharose HP 57 10.3 587 4(1 IM'ACpurification ihis-tagpGAPj Cell extract 120 19.0 2280 KH) Ni-NTA Sepharose 84 26.0 2184 96 IITpGAE' Fig. 2. Comparison of purification strategies for recombinant pGAP. A B. amytoliquefacietis pyroglutamyl aminopeptidase (pGAP) was produced in E. co/p with and without an N-terminal his-tag (HT-pGAP, tag sequence: MEP(H)6L). For untagged pGAP, purification included ammonium sulfate precipitation and two consecutive separation steps using phenyl-Sepharose. Subsequently, a desalting step using a Sephadex G-25 F column and a final step using Q Sepharose HP were performed. For HT-pGAP, purification wils performed with a single IMAC step. (A) Standard purification of pGAP. Lane M: MWM (Novex); lane 1: cell extract; lane 2: supernatant fraction of"cell extract; lane 3: pool from first phenyl-Sepharose step: lane 4: pool from the second phenyl-Sepharose step; lane 5: pool after desalting; lanes 6-10: several fractions from Q Sepharose HP containing pGAP. (B) IMAC purification of HT-pGAP. Lane M: MWM; lane 1: cell extract; lane 2: supernatant fraction of the cell extract; lane 3: flow through fraction from the IMAC; lane 4: eluted HT-pGAP. See Table 2 for process yields. Arnau et ah, 2006