m IsoudoMasesuejmiuoJi CMd jugAgigpzA dO jNVAViiazA aroAzoa oa bdiiseani AAOHOAAOH31 V 3Z30V~lľ\l ■ 3INľl V^SdOHAS 'JA±S"1C»H§ OA1SU31SINIIAI X)|!iqndaj ^ssp wa^odzoj lujuj?^ f luapuoj. wiu|i?pos w^sdojAg u?A03ireiJ[jn|ods 9í ^sfojd ojuai BAOJldOQ BJfpe^J uopeDijuiid ajiuijjk pui? supjcud uoisn j Ľ □ o o □ o □ o □ o □ o □ o □ o □ UOTJBOTJUnd pUB UOISS9jdX9 UT9+1§f||s o □ o □ o □ o ODODODODODODO DODODODODODOD OüODOn^nOGOüO VllZ*GAINn VAO>IAWSVW I f|M| I 0868O :n48ujpajd po>| D O L O □ O □ Ü ^~ O □ o □ o □ OüODOüOnOüOn ODODODODODODO DODODODODODOD ODODODODODODO Fusion proteins (tagged proteins) Translation fusion of sequences coding a recombinant protein and a) short peptides [ex. (His)n, (Asp)n, (Arg)n ... ]. b) protein domains, entire proteins [ex. MBP, GST, thioredoxin ...]. Engineering a tagged protein requires adding the DNA encoding the tag to either the 5' or 3' end of the gene encoding the protein of interest to generate a single, recombinant protein with a tag at the N- or C-terminus. The stretch of amino acids containing a target cleavage sequence (CS) is included to allow selective removal of the tag. (i) 5' Promoter Tag Gene of interest 3' Terminator Transcribe and translate N [ Tag fused to the N-terminus of the protein of interest (Ü) 5 Promoter Gene of interest Tag 3 Terminator Transcribe and translate N nc Tag fused to the C-terminus of the protein of interest Expression plasmids containing various tags are commercially available. Purposes of fusion tags > Increasing the yield of recombinant proteins - Fusion of the N-terminus of the target protein to the C-terminus of a highly expressed fusion partner results in high level expression of the target protein. > Enhancing the solubility of recombinant proteins - Fusion of the N-terminus of the target protein to the C-terminus of a soluble fusion partner often improves the solubility of the target protein. > Improving detection - Fusion of the target protein to either terminus of a short peptide (epitope tag) or protein which is recognized by an antibody (Western blot analysis) or by biophysical methods (e.g. GFP by fluorescence) facilitates the detection of the resulting protein during expression or purification. > Localization - A tag, usually located on the N-terminus of the target protein, which acts as an address for sending a protein to a specific cellular compartment. > Facilitating the purification of recombinant proteins - Simple purification schemes have been developed for proteins used at either terminus which bind specifically to affinity resins. No single tag is ideally suited for all of these purposes. Fusion partner (tag) Size Tag placement Uses His-tag 6, 8, or 10 aa N- or C-terminus Purification, detection Thioredoxin 109 aa(11.7kDa) N- or C-terminus Purification, solubility enhancement Calmodulin-binding domain (CBD) 26 aa N- or C-terminus Purification Avidin/streptavidin Strep-tag 8 aa N- or C-terminus Purification, secretion Glutathione ^-transferase (GST) 26kDa N-terminus Purification, solubility enhancement Maltose binding protein (MBP) 396 aa (40 kDa) N- or C-terminus Purification, solubility enhancement Green fluorescent protein (GFP) 220 aa (27 kDa) N- or C-terminus Localization, detection, purification Poly-Arg 5-16 aa N- or C-terminus Purification, solubility enhancement N-utilization substance A (NusA) 495 aa (54.8 kDa) N-terminus Solubility enhancement Combinatorial tagging > No single tag is ideally suited for all purposes. Therefore, combinatorial tagging might be the only way to harness the full potential of tags in a high-throughput setting. Combinations: Solubility-enhancing tag + purification tag: MBP + His6tag 2x purification tag: IgG-binding domain + streptavidin-binding domain Detection tag + purification tag: GFP + His6 tag Detection tag + 2x purification tag + immunodetection: GFP + SBP domain + His8 tag + c-Myc Tar) Advantages Disadvantages GST Efficient translation High metabolic burden initiation Inexpensive affinity resin Homodimeric protein Mild elution conditions Does not enhance solubility MBP Efficient translation High metabolic burden initiation Inexpensive affinity resin Enhances solubility Mild elution conditions NusA Efficient translation High metabolic burden initiation Enhances solubility Not an affinity tag Thioredoxin Efficient translation Not an affinity tag11 initiation Enhances solubility Ubiquitin Efficient translation Not an affinity tag initiation Might enhance solubility FLAG Low metabolic burden Expensive affinity resin High specificity Harsh elution conditions BAP Low metabolic burden Expensive affinity resin Mild elution conditions Variable efficiency of enzymatic biotinylation Provides convenient means Co-purification of E. coli of immobilizing proteins in biotin carboxyl carrier a directed orientation protein on affinity resin Does not enhance solubility Hise Low metabolic burden Specificity of IMAC is not as high as other affinity methods Inexpensive affinity resin Mild elution conditions Tag works under both Does not enhance native and denaturing solubility conditions STREP Low metabolic burden Expensive affinity resin High specificity Does not enhance solubility Mild elution conditions SET Enhances solubility Not an affinity tag CBP Low metabolic burden Expensive affinity resin High specificity Does not enhance solubility Mild elution conditions Slag Low metabolic burden Expensive affinity resin High specificity Harsh elution conditions (oron-column cleavage] Does not enhance solubility Advantages and disadvantages of used fusion tags > Proteins do not naturally lend themselves to high-throughput analysis because of their diverse physiological properties. Affinity tags have become indispensable tools for structural and functional proteomics. X > Because affinity tags have the potential to interfere with structural and functional studies, provisions must also be made for removing them. "GST, glutathiones transferase; MBP.maltoso binding protein; NusA.lM utilization substance; A; FLAG. FLAG tag peptide; BAP, biotin acceptor peptide; HisK. hexahistidine tag; STREP, streptavidin binding peptide; SET, solubility enhancing tag; CBP, calmodulin binding peptide. bDerivatives of thioredoxin have been engineered to have affinity for immobilized metal ions (His patch thioredoxin) or avidin/streptavidin [381. ... . -i/i/ic Otázka č. 1: Jaké jsou důvody pro využívaní tagů/kotev? Vyjmenujte 3. Increasing the yield of recombinant proteins using fusion technology Yield enhancing tags are proteins and peptides which can be involved in: > Increasing the efficiency of translation initiation (e.g. GST, MBP, NusA...) - Advantage of N-terminal tags - Providing a reliable context for efficient translation initiation - Ribosome efficiently initiates translation at the N-terminal methionin of the tag - Deleterious secondary structures are more likely to occur in conjunction with short N-terminal tags because short RNA-RNA interactions tend to be more stable than long-range interactions. > Protection against proteolytic degradation - Several studies have shown that the nature of terminal residues in a protein can play a role in recognition and subsequent action by proteases and in some cases affinity tags might improve the yield of recombinant proteins by rendering them more resistant to intracellular proteolysis. > Helping to properly fold their partners leading to increased solubility of the target protein (in vivo and in vitro). Enhancing the solubility of recombinant proteins Solubility-enhancing tags - Advantage of N-terminal tags - Rather proteins (highly soluble proteins) than peptides -Fusion with a soluble fusion partner often helps to properly fold their fusion partners leading to improved solubility (in vivo and in vitro) of the target protein. -Fusion partners do not perform equally with all target proteins, and each target protein can be differentially affected by several fusion tags (Esposito and Chatterjee, 2006). - The choice of a fusion partner is still a trial-and-error experience. >PROTEINS Some commonly used solubility-enhancing fusion partners Tag Protein Source organism MBP Maltose binding protein Escherichia coli GST Glutathione-S-transferase Schistosoma japonicum Trx Thioredoxin Eschericfta coli NusA N Utilization substance Escherichia coli SUMO Small ubiquitin-moditier Homo sapiens SET Solubility-enhancing tag Synthetic DsbC Disulfide bond C Escherichia coli Step Seventeen kilodalton protein EschericNa coli T7PK Phage T7 protein kinase Bacteriophage T7 GB1 Protein G B1 domain Streptococcus sp. ZZ Protein A IgG ZZ repeat domain Staphylococcus aureus Adopted from Esposito and Chatterjee, 2006 > PEPTIDES Poly-Arg Poly-Lys Generate parallel expression clones Dead end: insolubility Protease cleavage site His6 MBP (e) Target protein IMAC I I SUCCESS Target protein Good cleavage (b) Purify by IMAC Cleave with protease Poor \ cleavage fid His6 NusA Target protein His6 NusA Target protein Dead end: protein insoluble after cleavage of tag Dead end: difficult to separate cleaved protein from fusion Current Opinion in Biotechnology Schematic representation of the pathway from protein expression to purification using solubility tags (Esposito and Chatterjee, 2006). Enhancing the solubility of recombinant proteins 19, 84, 215 - human proteins involved in cancer produced in E.coli 19 84 215 27 28 29 34 27 28 29 34 27 28 29 34 SiSisisi sis is I si S I S I S I S I Example of SDS PAGE with soluble (s) and insoluble (i) fractions following lysis. The results produced from the four different expression vectors (27: His tag only; 28: thioredoxin + His tag; 29: GST + His tag; 34: GB1 + His tag) are shown for three different target proteins (Hammarstrom etal, 2006). Solubility-enhancing tags - the mechanism of action -The mechanism by which partners exert their solubilising function is not fully understood and possibly differs between fusion proteins Examples of possible mechanisms Maltose binding protein (MBP) has an intrinsic chaperone-like activity MBP might bind reversibly to exposed hydrophobic regions of nascent target polypeptide, steering the polypeptides towards their native conformation by a chaperone like -mechanism. N-utilization substance (NusA) decreased translation rates by mediating transtriptional pausing, that might enable critical folding events to occur. MBP and N-utilization substance (NusA) attract chaperones. The fusion tag drives its partner protein into a chaperone-mediated folding pathway. MBP and NusA interact with GroEL in E. coli (Huang and Chuang, 1999). Small ubiquitin related modifier (SUMO) promotes the proper folding and solubility of its target proteins possibly by exerting chaperoning effects in a similar mechanism to the described for its structural homolog Ubiquitin (Ub; Khorasanizadeh et al., 1996). Negative charged tags (highly acidic peptide) inhibit aggregation by increasing electrostatic repulsion between nascent polypepdides (Zhang et. 2004). Solubility-enhancing tag - mechanism of action Thioredoxin Mts intrinsic oxido-reductase activity is responsible for the reduction of disulfide bonds through thio-disulfide exchange > Thioredoxin serves as a covalently joined molecular chaperone independently of redox activity. Thioredoxin may, thus, act to prevent the aggregation and precipitation of fused nascent proteins, giving them an extended opportunity to adopt their correct tertiary folds. Proposed mechanism of thioredoxin-catalyzed protein disulfide reduction. Reduced thioredoxin [Trx-(SH)2] binds to a target protein via its hydrophobic surface area. Nucleophilic attack by the thiolate of Cys32 results in formation of a transient mixed disulfide, which is followed by nucleophilic attack of the deprotonated Cys35 generating Trx-S2 and the reduced protein. Conformation changes in thioredoxin and the target protein occur during the reaction. X Holmgren, 1995; Berndt et al. 2006 In vitro solubility-enhancing tags Short peptide tags Poly-Lys tag, poly-Arg tag = one, three and five lysine or arginine residues fused to the C- or N-terminus of the target protein Solubility as defined here is the maximum protein concentration of the supernatant after centrifugation of the supersaturated protein sample (in vitro solubility). Nht N+hb ; NH Cht 1 1 Cht Cht 1 1 Cht Cht 1 J Cht Cht HjN+—C —CCf 1 HjN+—C —COž I H H Arginine (R) Lysine (K) - j 17 1 tSJÜ SJ 5.0 4JS 1 O 3.5 ■Iii 2£ 2.0 1JS i (i Ů.0 JA BPTI-22 = bovine pancreatic trypsin inhibitor variant containing 22 alanines The solubilization factor is defined as the molar ratio between the solubility of tagged BPTI-22 variants and that of the reference BPT-22 molecule. —>->->-> N- C- N- C- terminus The solubilization effect of poly-Lys tags is lower than that of poly-Arg tags (lysines are less hydrophilic than arginines). Kato et ah, 2006 Biochemical properties of poly-Arg and poly- Lys tagged BPTI-22 protein Protein Solubility Protein Cone \ m\f\ (Cone. |mg/ml| )* Solubilization Factor TmCC) ReL Trypsin Inhibitory Activity <%)c BPTI-22 1.70 (]Q.l)l)) 38.4 N Ik 1.70 (10.40) 1.00 (1.04) 352 1.05 N3K 2.66 (1*07) 136(2.00) 34.4 1.04 -M5K 537 (35.60) 3.16(336) 34.3 1.05 -C1K 1.79 (10.95) 1.05(1.10) 34.6 1.05 The addition of 0.5 M Arg ■ C1K (ök 2.41 7.1b l-3 7.-3 7) 1.42(133) 4.21 (4.75) 36J2 35.0 1.05 1.02 barely increased its solubility, and trypsin activity was inhibited by the high arginine concentration. On the other S' ik -N3R -MSR i.ť$ (10.34; 2.70 (17.23) 6.20 (41.11) 0.99 {L03) 139(1.72) 3.65(4.11) 33Ö 35.6 353 1.02 0.99 0.99 CIR LSI (11.07) 1.06(1.11) 35.0 1.05 hand, addition of 50 mM ■C3R. 3.02 (19.26) 1.78(1.93) 34.4 1.05 Arg+Glu was more effective C3R ■ C6R 8.23 (34.36) 10.39(73.41) 4.34(5.46) b.ll :i:S4 i 34.S 32.7 1.08 1.1 and increased protein solubility more than "threefold. BPTI-22" BPTI-22* 5.63 (33.1 1) 2.01 (11.82) 3.31 (331) ].18(1.18) NIT MD1 ].09 NAB _ Protein .solubility was Jeter mined as the minimum supernatant concentration of a supersaturated protein solution at AC in 100 mM acetate buffer pit 4.7. " Ma.1 im urn c otscen tr at ions calculated in m illigr arris per milliliter are indicated i n pa rent besis. T be Mw o f BPT!-11, - N E K and -C. i K, -N 3 K a nd -C3K, -N5Kand -CSK. -NIR and -OR. -N3R and -OR, -N5R and -C5R, and -C6Rare. respectively. 5880,6123,6379, 6636,6151,6463,6776, and 6932 Da. "1! ill l l I .it k li ::v Lkt r .■ Li ■ ■ h-L v-l i-r. l III- m i--l.tr pnjldij wk.hilky ol :l>'T: hkJ Lli:.L dI l.:^:--:. Iin piLr^tJlk^is i r: ill l Lit l- lln- r.iti--- -.jkt.l.ittii ir. milligram's per milliliters, L Relative trypsin inhibitory activity calculated as the ratio between the activity of BPTI-22 and that ot tagged BPTI-22. BPTI-22. which lacks R.W, an argi-nine residue involved in two hydrogen bonding interactions with tke tripsin residue backbone.3"1 has a reduced trypsin inhibitor activity corresponding to — 60% of the wt-BPTIand BPTI-;5,55] at stoichiometry and a protein concentration of 280 nM.™ d Solubility in the same buffer as above but with the addition of 50 nxM L-Arg -t- L-Glu. 1 The CD tber ma I melting curve could not be deter mi tied due to die s Dfong absorp ion of arginirse liihI jdulamic acid. 1 J'mLrLn srlLkilitv with mm I m.WArg-HCl Lidded to [ he above buflcr. ! The trypsin activity could not be determined because the lug h ar gin ine cofi central km inhibited trypS in activ ity. Kato et al, 2006 A) BPTI22 RPAFCLEPPYAQPAKARIIRYFYNAAAGAAQAFVYG0AAAKRNNFA8AADALAACAAA B) (a) (b) FIGURE 3 Hydrophobic residues in I3PTI-22. A: One letter amino acid sequence of BPTI-22 with the hydrophobic residues (A,V,1,L,F,P) shown in green letters. B: Left, BIrFI-22 ribbon model with a-helices colored red and rf-strands colored blue. Right, surface representation of BPTI-22 with the hydrophobic area determined as low electrostatic potential regions according to MOL-MOL,35 colored green. The molecule is oriented with the A-sheet pointing to the back in (a) and to the front in (b). The N- and C-termini are labeled "N" and "C,* respectively. The C-terminal end is located on the same face as a large hydrophobic patch shown in green, whereas the N-terminal end is on the opposite side ot the molecule and is shown with a light gray letter "N™ in panel (b). > The solubilization factor of all C-terminal tags was slightly higher than that of the respective N-terminal tags. > The C-terminus of BPTI-22 is close to a large hydrophobic patch, whereas the N-terminus is located on the opposite side of the molecule, away from the hydrophobic patch. > Charged residues seem to act through repulsive electrostatic interaction and thus hamper intermolecular interaction arising from the hydrophobic cluster. Kato et al, 2006 Solubility-enhancing tags - comparison of peptide and protein tags, conclusions > Protein tags are inherently large and need to be correctly folded in order to enhance solubility. > Protein tags are often natural affinity tags. > Peptide tags are small, and, importantly, they do not need to be folded, which provides a significant advantage over protein tags. > The use of small tags (< 30 amino acids long) does not increase protein size substantially and reduces steric hindrance, which simplifies downstream structural and functional applications without the need to remove the tag. > The solubilization enhancement effect depends on the size of the target protein. Solubility enhancement of fusion partners such as thioredoxin, GB1 is less pronounced for larger target proteins (above 25 kDa). MANY TAGS SUFFER FROM THE SAME PROBLEM - THEY DO NOT FUNCTION EQUALLY WELL WITH ALL TARGET PROTEINS. Otázka 2: Který tag/kotvu by jste využily pro zvýšení rozpustnosti proteinu bohatého na cysteiny? Removal of fusion tags- the Achilles' heel of the fusion approach All tags, whether small or large, have the potential to interfere with the biological activity of a protein, impede its crystallization (presumably due to the conformational heterogeneity allowed by the flexible linker region), be too large for NMR analysis, cause a therapeutic protein to become immunogenic or otherwise influence the target protein's behavior. The fusion tags can be removed by: > Chemical cleavage > Self - cleavage > Enzymatic cleavage Removal of fusion tags - chemical cleavage > Rarely used. Cyanogen bromide Met-X Hydroxylamine Asn-Gly 1 I MRGSHHHHHH M12 M15 / / GMASMEKNNQ M28V / GNGQGHNVPN 40 I DPNRNVDENA NANSAVKNNN NEEPSDKHIK EYLNKIQNSL STEWSPCSVT M105V CGNGIQVRIK PGSANKPKDE LDYANDIEKK ICKVEKCS Amino - acid sequence of the P. falciparum C-terminal segment of CSP (PfCSP C-ter) fused to a purification tag (Rais-Beghdadi et al, 1998). Chemical cleavage is a harsh method, efficient, but rather non-specific and may lead to unnecesary denaturation or modification of the target protein. Removal of fusion tags - self - cleaving > Use of self-cleaving fusion tags 1. Inteins I xn-i'i Intsin C-Exlem dna: FINA: transcription translation precursor protein COOH Protein; Protei n: A i intoit Nil;.- m^ N-extem ^rW^ 8 ^ protein splicing rpafunepfoieTi excised intein Inteins (/ff/ervening proteins) are protein segments that can excise themselves from protein precursors in which the are inserted and rejoin the flanking regions. > Self - splicing inteins can be mutated at the N- or C- terminal splice junction to yield self cleaving inteins, which can be used to mediate self cleaving of various tags. Perler, (2005) > Two categories of self-cleaving fusion tags - inteins: - inteins with pH-induced C-terminal cleaving activity - inteins with thiol-induced N- and C-terminal cleaving activity f-A pH intein N pH 6.0-6.5 -■ 20-25°C, 16 h A pH intein -N + C Thiol intein A Tag 15-30 mM thiol -i 4°C, 16 h + C- Thiol intein A AcNH—Thymosi ntein 0 Cys N-S Acyl Shift His-TaE AcNH—Thymosin 0 í Induced by Temp upshift or DTT/2-ME His-Taa Trans-esterification reaction - formation of thioester between thiol and C- term amino acid of target protein. This product is not stable and easily hydrolyze to release target protein and tag. •f HS His-Tns MtzCys AeNH^ Thymosin —cooh + Removal of fusion tags - self - cleaving fusion tag 2. HHHHH H—I SrtAi,o.>iift |—LPXTG —| target protein System based on the catalytic domain of Staphylococcus aureus sortase A (SrtA). SrtA cleaves the Thr-Gly bond at the conserved LPXTG motif in the substrates. Cleavage is inducible by adding calcium (cofactor of SrtA). 3. N(pro) -CXH target protein N-terminal protease (Npro) is the first protein of the pestivirus polyprotein. It posesses autoproteolytic activity and catalyzes the cleavage by switching from chaotropic to cosmotropic conditions. |-| fillJS! 4. target protein —D P— SPM (HI) FrpC modul (from G+ bacteria Neisseria meningitides): FrpC protein undergoes calcium - inducible autocatalytic proccesing at the peptide bond between residues Asp and Pro. Cleavage reaction is catalyzed by a self proccessing modul (SPM). 5. target protein \—VDAL ADGK— CPI) I—HHHHHH Vibrio cholerae secretes a large multifunctional autoprocessing repeats-in-toxin (MARTX) toxin that undergoes proteolytic cleavage during translocation into host cells. Proteolysis of the toxin is mediated by a conserved internal cystein protease domain (CPD), which is activated upon binding of inositol hexakisphosphate. (Li, 2011) Removal of fusion tags - self - cleaving fusion tag Inteins (1) > Uncontrolled in vivo cleavage or in complete in vitro cleavage > Target protein modification - pH or thiols can modify the target protein > Protein compatibility with cleaving conditions - pH induced inteins > Compared to the traditional protease based method, the intein-based approach requires fewer steps and lower costs. Other system (2-5) > Tested on limited number of cases Table General features, of the five self-cleavage fusion systems discussed in the text Self- MWa Purification Cleavage Advantages cleaving (kDa) tag condition tag lutein SrtA N1*" IrpC CPD 51; CßD, CUM, Thiols; pH Flexible fusion and cleavage options; 22 17 Is ]- 2h 2} phasin. ELP His-tag. biotin 19 His-tag His-tag. CUD His-tag and/or temperature shift 3 inM Ca"' Kosmotropic conditions 10 mM Ca"' 50-100 (iM InsPfi allowing generation of target protein with native sequence Potential i'l enhancing target protein expression and solubility Allowing generation of target protein with native sequence Efficient and tightly controlled cleavage; insensitive to protease inhibitors Potential of enhancing target protein expression and solubility; efficient and tightly controlled cleavage; insensitive to protease inhibitors Disadvantages Lack of solubility-enhancing capacity; in vivo cleavage; incomplete cleavage; misclcavagc In vivo cleavage: incomplete cleavage: introduction of an extra Gly residue to the jV-tenninus of the target protein Limited to proteins capable of refolding; in vivo cleavage: incomplete cleavage: long cleavage time Lack of solubility-enhancing capacity: introduction of an extra Asp residue to the C-terminus of the target protein; single C'-terminal fusion option Introduction of up to four non-native residues to the C-tcrrainus of the target protein: single C-terminal fusion option u Molecular weight of the self-cleaving tag h Inteins with different sizes arc available Li, 2011 Removal of fusion tags - enzymatic cleavage Cleavage site Protease Tag —- -> Target 4-37°C, time varies Target Site-specific proteolytic cleavage: > Exopeptidases > Endopeptidases Exopeptidases (aminopeptidases and carboxypeptidases): DAPase (TAGZyme) Exo(di)pcptidase Cleaves N-terminal. His-ttig iC-lerminal) for purification and removal A eromonai aminopept idase Exopeptidase Cleaves N-terminal. effective on M. L. RequiresZn Arm no peptidase M Exopeptidase Cleaves N-terminal. does not cleave X-P Carboxypept idaseA Exopeptidase Cleaves C-terminal No cleavage at X-R. P Carboxypeptidase B hxopepiidase Cleaves C-terminal R, K >APM, CPA and CPB release sequentially a single amino-acid from the N- or C- terminus of a protein until the stop site is reached. TAGZyme system (Qiagen): > DAPase (dipeptidyl aminopeptidase I) TAGZyme stop points Amino acid DAPase stop point iJ.) sequence' Lysine (lys. K) XaaXaa...Xaa'Xaa I Lys-Xaa ... Arginire (Arg. R) Xaa-Xaa...Xaa-Xaa 4-Arq-Xaa ... FVoline (Pro. P) XmXm. ..Xaa-Xaa 4- Xaa-Xaa PtoXm... Proline (Pro. Pj Xaa-Xaa. ..Xaa-Xaa i Xaa FVo Xaa-Xaa... Glilamine (Gin. Q1 Xaa-Xaa...Xaa-Xaa i Glrt-Xaa... DAPase cleavage DAPase stop MK HQ HQ HQ HQ HHP SDK HT-Tr\ HHP-Trx M 1 2 .1 4 5 6 7 = = 4- HT-Tr\ *- HHP-Trx Arnau etal., 2006 Removal of fusion tags - enzymatic cleavage Endopeptidases > The enzymatic cleavage site has to be placed between the fusion tag and the target protein. Enzyme Cleavage site Comments Enter ok inase DDDDK* Secondary sites at other bit sic aa Factor Xa idgr' Secondary sites at gr Thrombin lvpr'cs Secondary sites. Bio tin labeled for removal of the protease PreScission LEVLFQ'GP GST tag for removal of the protease TEV protease EQLYFC/c His-tag for removal of the protease 3C protease ETLFQXP GST tag for removal of the protease Sortase A LPET'G Ca!+ -induction of cleavage, requires an additional affinity tag (e.g., his-tag) for on column tag removal Granzyme B d'x.n'x.m'n.s'x Serine protease. Risk for unspecific cleavage , Protease sile Hisc IV B P Target protein Enterokinase Asp-Asp-Asp-Asp-Lys/X lalik-l Cleavage (c/r) of enterokinase through densitometry (Host'ield and Lu 1999) based on the amino acid residue X|. The sequence....-GSDYKDDDDK-Xi ADQLTEEQIA-... of a GST-cal- inodulin fusion protein \ vas tested using 5 mg protein digested with 0.2 Uof enterokinase tor 16 h at 37 °C Amino acid in position X] Cleavage of enterokinase (%) Alanine 88 Methionine 86 Lysine 85 Leucine 85 Asparagine 85 Pheny (alanine 85 I so leucine 84 Aspartie acid 84 Glutamic acid 80 Glutamine 79 Valine 79 Arginine 78 Threonine 78 Tyrosine 78 Histidine 76 Serine 76 Cysteine 74 Glycine 74 Tryptophan 67 Proline 61 Removal of fusion tags - enzymatic cleavage A critical review of the methods for cleavage of fusion proteins with thrombin and factor Xa Richard J. Jenny,a'* Kenneth g. Mann,b and Roger L. Lundbladcd * Haematologic Technologies, Inc.. &sex Junction. VT. USA b Departtnent ofBiochemistry, University of Vcnnmtt. Burlington. IT. USA c Deparnuein of Pathology, University of North Carolina. Cltapel Hitl. AT. USA d Roger L Lmdblad, LLC. Chapel Hill. NC. USA Received 27 Hebruary 200.1. and in revised form 7 May 2003 The purpose of this review was to demostrate that both thrombin and factor Xa can hydrolyze variety of peptide bonds within the fused proteins of interest. Sequences cleaved by thrombin in polypeptide hormones Polypeptide homonesa Sequence cleaved Secretin ELSLSRLRDSA Secretin ELSLSRLR (much slower than above) Vasoactive intestine polypeptide DNYTRLRK Vasoactive intestine polypeptide YTRLRKQM Choleocystokinin APSGRVSM Choleocystokinin VSMIKNLQ Dynorphin A RIRPKLKW Somatostatin-28 AMAPRHRK Somatostatin-28 NFFWKTFT Gastrin releasing peptide KMYPRGNH Salmon calcitonin QTYPRTNT "The reaction mixtures contained 0.5NTH units thrombin and l.Onmol peptide in 20uL of 50mM NH4CO3, pH 8.0, at 25*C The conditions were designed to obtain an enzyme/substrate ratio of 1:60 fw/w). Protein ^Expression ^ Purification Accuracy of cleavage has to be precisely verified! pRSETB::AHP2 Enterokinase cleavage site N^GSHHHHHHGMASMTGGQQMGRDLYDDDDI^)PSSRSAAGTMEFMDALIA....................GIVPQVDIN C Theoretically: 3,4 kDa 18,9 kDa Intact mass spectrometry analysis a.i. 1000 800 600 400 " 200 - 40 4 1 5 2 2 1 NUIkili UK 49 23 3897 1968 I AHP2 enterokinase 1 3468 110 3 1 -1- 1 1 1 1 D 3 9 75 1 8258 -1- AHP2 control AHP2_standard 3 7 3 8 9 4 I 405 3 1 1 67 1 1 0 3 4 "I —1—I-1—I—1—I—1—I—1—I-1—I—1—I-1—I—I—I-1—I—1—I-1—I—I—I-1—I—1—I—1—I—I—I—1-1—1—I—I-1—1—I—1-1—1—I— 2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 22000 m/z SUMO gene fusion system 6xHis SUMO protease recognizes the tertiary structure of SUMO rather than an amino acid sequence. As a result, SUMO protease never cleaves within the fused protein of interest! Removal of fusion tags - enzymatic cleavage > Optimization of protein cleavage conditions (mainly enzyme-to-substrate ratio, temperature, pH, salt concentration, length of exposure). > Cleavage efficiency (Optimization is needed. The efficiency varies with each fusion protein in an unpredictable manner, probably due to aggregation or steric issues; the problem can be solved by introducing short linkers between the protease site and the fusion tag). > Unspecific cleavage (SOLUTION: optimization of protein cleavage conditions or using re-engineered proteases with increased specificity such as ProTEV and AcTEV proteases). > Precipitation of the target protein when the fusion partner is removed (so-called soluble aggregates; SOLUTION: another approach for protein solubilization has to be found). > Failure to recover active or structurally intact protein > Target protein modification (some proteases like thrombin, TEV, Precision leave one or two amino-acids on the target protein near the cleavage site). > Re-purification step is needed to separate the protease from target protein. > High cost of proteases The alternative is to leave the tag in place for structural analysis: The small tags are a better choice in structural and functional analysis of proteins. Otázka 3: Jaký je rozdíl mezi inteinem a samo-vyštěpujícím tágem odvozeným od inteinu? Affinity chromatography (AC) > A type of adsorption chromatography, in which the molecule to be purified is specifically and reversibly adsorbed to a complementary binding substance (ligand, L) immobilized on an insoluble support (matrix, M). Affinity tag Protein of interest Sepharose Affinity tag binding partner Immoblized binding partner of Affinity tag fused to N- or C-terminus affinity tag of protein TPEG (substrate analogue of" (i-gaEactosidase Ji-galactosidase) Glutathione Glutath i on e-S- Transferase immunoglobulin G Protein A Cu II, Co II or Ni II poly His or poly Cys > AC has a concentrating effect, the high selectivity of separations derived from the natural specificities of the interacting molecules. > AC can be used (1) to purify substances from complex biological mixtures, (2) to separate native forms from denatured forms of the same substance, and (3) to remove small amounts of biological material from large amounts of contaminating substances, (4) and to isolate protein complexes from the native source. > the first application was in 1910 (adsorption of amylase onto insoluble starch) but it developed during the 1960s and 1970s. Affinity tags and affinity purification Atftnily tag Protein of'interest Scpharosc Affinity lag binding partner Immoblized binding partner of Affinity tag fused to N- orC-terminus affinity Lag of protein TPEG (substrate analogue of fl-galactosidase fj-galactosidase) Glutathione Glutathione-S-Transferase Immunoglobulin G Protein A Cu II, Co II or Ni II poly His or poly Cys Table 2 Sequence and size of affinity tags Tag Residues Sequence Size (kDa) Poly-Arg 5-6 RRRRR 0.80 (usualh 5) Poly-His 2-10 HHHHHH 0.84 (usuallv 6) FLAG a DYKDDDDK 1.01 Strep lag 11 g WSHPQFEK 1.06 c-myc 1 1 EQKL1SEEDL 1.20 S- 15 KETAAAKFERQHMDS 1.75 HAT- 19 KDHL1HNVHKEFHAHAHNK 2.31 3x FLAG DYKDHDGDYKDHDIDYKDDDDK 2,73 Calmodulin-binding peptide 26 KRRWKKNF1AVSAANRFKK1SSSGAL 2.96 Cellulose-binding domains 27-189 Domains 3.00- 20.00 SBP 38 M DEKTTGW RGGHV VEGL AG ELEQLRARLEHH PQGQ REP 4.03 Chitin-binding domain 51 TNPGVS AWQ V NT A YT AGQL VT Y NGKTYKCLQPHTSLAGWEPSNVPA LW QLQ 5.59 Glutathione S-transferase 211 Protei n 26.00 Maltose-binding protein 396 Protei n 40.00 A tag is fused to the N- or C-terminus of the protein of interest to facilitate purification, which relies on a specific interaction between the affinity tag and its immobilized binding partner. Genetically engineered fusion tags allow the purification of virtually any protein without any prior knowledge of its biochemical properties. Purification tags Affinity tags Affinity tag Matrix Poly-Arg Polv-His FLAG Strep-tag II c-myc s HAT (natural histidine affinity tag) Calmodulin-biriding peptide Ccllulosc-binding domain SBP Chi tin-binding domain Glutathione S-transferase Maltose-binding protein Cation-exchange resin Ni2+-NTA, Coy+-CMA (Talon) Anti-FLAG monoclonal antibody Strep-Tactin (modified streptavidini Monoclonal antibody S-fragment of RNaseA Co2+-CMA (Talon) Calmodulin Cellulose Streptavidin Chtttn Glutathione Cross-linked amvlose > Traditional purification tags > The tag binds strongly and selectively to an immobilized ligand on a solid support. > After optimization one could achieve > 90% purity. Non - chromatographic tags Tag_Matrix_ ELP PHB annexin Bl None Intracellular PHA granules None > These tags can eliminate affinity resin. Proteins are isolated by other non-chromatographic methods (centrifugation, filtration) > typically combined with self-cleaving tags > 75 % - 95 % purity + DTT X T" Additional separation slep ftfTmltytag-Inker-UrBfttpnoteln \ frwwa eteavags j MjU heatmg a„d/or „ /"A I salt addition Affinity fag - intern - target protein * ^ Centrifugation % ELP - intein - target protein Phasin - inlein - target protein Purification tags Non - chromatographic tags + DTT Mild heating and/or ^ salt addition O 1 o Centrifugation The PHB system (c): > PHB (polyhydroxybutarate): subclass of biodegradable polymers produced in various organisms, use as storing excess carbon. > The system includes in vivo production of PHB small granules (from the plasmid carrying PHB-synthesis genes). > Target protein in fusion to self cleaving phasin tag. > Tagged protein binds to the PHB particles via phasin tag, which allows the granules and the tagged protein to be co-purified via centrifugation. Phasin - intein - target protein ELP - intein - target protein > DTT induced cleaving activity of intein and thus elution of the target protein. The ELP system (d): > ELP (elastin-like polypeptide) selectively and reversibly precipitates in response to changes in temperature and buffer salts. This allows soluble and insoluble contaminants to be removed by filtration or centrifugation. Components of a matrix for affinity chromatography K Ni2+ NTA sepharose A ligand > The dissociation constant (Kd) for the ligand - target complex should ideally be in the range 10"4to 10"8 M in free solution to allow efficient elution under conditions which will maintain protein stability. > A ligand has to be attached to the matrix with a suitable chemically reactive group. The mode of attachment must not compromise the reversible interaction between the ligand and protein. Components of a matrix for affinity chromatography A matrix > Typically, a macroporous polysaccharide bead such as agarose, that provides a porous structure so that there is an increased surface area to which the target molecule can bind. > A matrix has a suitable attachment site for the ligand. Typically matrices are chemically activated to permit the coupling of the ligand. A number of activation methods are available which depend on the nature of the matrix and the availability of compatible reactive groups on the ligand. 36 Components of a matrix for affinity chromatography Support matrix L t=u C" I.igand attached to the matrix via a spacer arm. Ni2+ NTA sepharose Spacer arm > A spacer arm will be required in cases where direct coupling of the ligand to the matrix results in steric hindrance and subsequently the target protein will fail to bind to the immobilized ligand efficiently. The introduction of a spacer arm between the ligand and the matrix minimizes this steric effect and promotes optimal adsorption of the target protein to the immobilized ligand. 37 Typical affinity purification steps adsorption of wash re-equilibration 1-2 cv - x cv-*- 1-2 cv cv 1-2 cv Column Volumes (cv) > In the equilibration phase, buffer conditions are optimized to ensure that the target molecules interact effectively with the ligand and are retained by the affinity medium as all other molecules wash through the column. > During the washing step, buffer conditions are created that wash unbound substances from the column without eluting the target molecules or that re-equilibrate the column back to the starting conditions (in most cases the binding buffer is used as a wash buffer). > In the elution step, buffer conditions are changed to reverse (weaken) the interaction between the target molecules and the ligand so that the target molecules can be eluted from the column. Affinity chromatography - Immobilized metal ion affinity chromatography (IMAC) > The most common purification tag is typically composed of six consecutive histidine residues. > Histidine, cysteine, and tryptophan residues are known to interact specifically with divalent transient metal ions such as Ni2+, Cu2+, Co2+, and Zn2+. > Histidine is the amino acid that exhibits the strongest interaction with immobilized metal ion matrices as the electron donor groups on the histidine imidazole ring readily form coordination bonds with an immobilized transition metal. L— c CH—Ch. ii' ^ >■ rx m *°i -'h á .M* iOCH "Ctí -Ch' V~V-i M (kDa) 170 Binding strength of His tag to metal ions: Cu2+ > Ni2+ > Zn2+ _ Co2+ '(HisifZfn-peO.r Zn2+ Ni2+ Co2+ Cu2+ (Zouhar et ah, 1999) > IMAC can be used under native and/or denatured conditions. > A highly purified protein can often be obtained in one or, at most, two purification steps. His-tagged protein and IMAC under native conditions > Optimal binding of recombinant protein with metal ion is achieved at pH 7-8. > Buffers with a high salt concentration (0.5-1 MNaCl) reduce nonspecific electrostatic interaction. > Nonionic detergents or glycerol reduce nonspecific hydrophobic interactions. > Elution of contaminating proteins can be achieved by lowering the pH or using low concentrations of imidazole. > Elution of tagged protein is achieved at high imidazole concentrations (0-0.5 M), by strongly decreasing the pH, or by using EDTA. Immobilized metal affinity chromatography Hisn protease cleavage site PROTEIN - protease + His-tagged protein and IMAC under native conditions One-step purification of maize ß-glucosidase > Perfusion matrix: POROS MC/M > Functional group: iminodiacetate, metal ion Zn2+ > Removing contaminated proteins: linear gradient of imidazole (0-50 mM) and pH (pH 7-6.1) > Protein elution: 0.1 M EDTA > 80% recovery, 95 fold purification > Common production and isolation of the wild type protein and soluble mutant form for enzymatic measurements and crystallization. M (kDa) I-1 A B C D (Zouhar et al, 1999) His-tagged protein and IMAC under denatured conditions - Purification of proteins expressed in inclusion bodies. - Purification in a high concentration of urea or guanidine chloride. - Result is a pure protein, but in a denatured form (sufficient for immunization). Recovery of native conformers (necessary for functional and structural analysis): > Binding to the column under strong denaturing conditions (8 M urea) > Two possibilities of renaturation: 1. The protein is eluted from the column and renatured by dialysis or rapid dilution in renaturing buffers. 2. Renaturation of the protein bounded to the column (matrix assisted refolding procedure): gradient from denatured to renatured buffers or pulsion renaturation (8-OM urea). Identification of properly refolded (His)6Zm-p60.1 (maize (3-glucosidase) using 10% native PAGE, followed by activity in gel staining: A = crude protein extract prepared from maize seedlings containing the native enzyme B = (His)6Zm-p60.1, renatured product (matrix assisted refolding procedure - 23 renaturing cycles) C = (His)6Zm-p60.1 purified by native IMAC KM (His)6Zm-p60.1 purified by native IMAC: 0.64 ± 0.06 mM KM (His)6Zm-p60.1 renatured product: 0.6 ± 0.08 mM Determination of vmax and kcatwas hampered by the fact that the refolding process yielded a number of improperly folded polypeptides. zm-peov f(His)eZm-p60.r (Zouhar et al, 1999) His-tagged protein and IMAC under native conditions Two-step purification of Arabidopsis histidine phosphotransfer protein 5 > IMAC matrix: highly cross-linked spherical agarose (His)6AHP: > Functional group: nitrilotriacetic acid, metal ionNi2+ > Removing contaminated proteins: linear gradient of imidazole (20-500 mM) > Protein elution: 130 mM imidazol > Common production and isolation of the wild type protein for protein-protein interaction measurements and crystallization. 1st step - metal chelate affinity chromatography -► 2nd step - gel filtration His-tagged protein and IMAC under native conditions Four-step purification of Arabidopsis C KI1 RI) 1. Affinity purification (IMAC) 2. Tag removal (TEV protease) 3. Affinity purification (IMAC) 4. Size exclusion chromatography Ub- SGSG- HisTaq- SA-" rEV- AME-( :kii 3. Affinity purification after TEV cleavege -CKI1RD Hi Trap Chelating 5 ml Mi 8 2011:10 UV -CKI1RD Hi Trap Chelating 5 ml linear gradienti 8 08 2011:10 Cone -CKI1RD Hi Trap Chelating 5 ml linear gradientiB OS ?011 10 Frartinns----CKI1RD Hi Trap Chelating 5 ml linear gradienti 8 OS ?01110 Injer.l CKI1 RD pETM-60:lXIl 0 rhM imidaz^ 200 mM imidazole ■H.N.N.H411H2I MJBkMJá m WW I1H2I WW I1ÍH2I WW ht^l I pETM-60::CK 1^ pETM-60 4. Size-exclusion chromatography ■ L.I..LLU U U UXU.U.1XU U 1.1 .-__ 4» 1 L^-10-20 mgforTB and M9 Pekařova B. Otázka č.4: Jakými metodami se izolují proteiny fúzované s nechromatografickými tagy/kotvami? Affinity purification for studying protein-protein interaction > Affinity purification provides a high-efficiency method for isolation of interacting proteins and protein complexes: > Co-immunoprecipitation > GST (or His) pull-down > Tandem affinity purification > Testing known protein-protein interaction. > Identification of novel protein-protein interactions. Co-immunoprecipitation (Co-IP) > The principle: If protein X is immunoprecipitated with an antibody of X, then protein Y, which is stably associated with X in vivo, may also be precipitated. This precipitation of protein Y, based on a physical interaction with X, is referred to as co-immunoprecipitation. > An obvious advantage is that complexes are isolated in the state closest to the physiological condition. > When a good quality antibody of X is available, Co-IP is a fast method and there is no need to clone and express the component(s) of the complex. □ A □ O O l Cell lysis under mild conditions that do not disrupt protein-protein interactions (using low salt concentrations, non-ionic detergents, protease inhibitors, phosphatase inhibitors). 2 3 The protein of interest (X) is specifically immunoprecipitated from the cell extracts (using an antibody specific to the protein of interest or to its fusion tag). The antibody-protein(s) complex is then pelleted usually using protein-A or G sepharose, which binds most antibodies . 4 Eluted immunoprecipitates are then fractionated by SDS-PAGE. 5 A protein of known identity is most commonly detected by performing a western blot. Identification of novel interaction is carried out by mass spectrometry analysis. Pull-down assay > Pull-down assays are a common variation of co-immunoprecipitation and are used in the same way, but pull down does not involve using an antibody specific to the target protein being studied. > They are used for purification of multiprotein complexes in vitro. > The target protein is expressed in E. coli as GST fusion and immobilized on glutathione-sepharose beads (GST alone is often used as a control). > Cellular lysate is applied to the beads or column, and the target protein competes with the endogenous protein for interacting proteins, forming complexes in vitro. > Centrifugation is used to collect the GST fusion probe protein and adhering proteins. > The complexes are washed to remove nonspecifically adhering proteins. Step 1, Immobilize thř I usion-taggerj "bsif fiom the lysat*. Gil Jwnnirj Listní * * - r*• Hni -riílní Hjil ŕruHir-Contaninj Lflie Step A, Wash awsy unbound protein, Slep Í. Wash awrav unbound protein, r,|,i:, Step 5. Elule protein: proteir interaction iomplo:. DilplKid IHtr Iding Cům plň &lep 3, Bind "FTjsy" protein to immobilized "bail protein. f * * • * * Ly**4 Slep S. Analyze proTeinjorgten interaction complex on SDS-FYVGE. M*ka Puiiltd *(■(** P^-Fiod ■lt\ OH 'ííll-l ľ*-i: CMMSl InKfKtibi > Free glutathione is used to elute the complexes from the beads, or alternatively the beads with attached complexes are boiled directly in an SDS-PAGE sample buffer. > The proteins are resolved on SDS-PAGE and processed for further analysis. Tandem affinity purification (TAP) Two-step purification strategy in order to achieve higher purity of isolated multiprotein complexes under near physiological conditions. This method was originally developed for use in yeast and quickly adapted to higher eukaryotes such as insect cells, human cells and plant cells. Examples of TAP (tandem affinity peptides) tags TAP tag: a double affinity tag (highly specific) which is fused to a protein of interest as an efficient tool for purification of native protein complexes. lngpeptldr(CBPl^ Tap-ragged polypeptide expressed near physiological levels Oknoduín binding aeptWei TEV protease cleavage she Protein A IgC-bi nctlng doma n - ATás PROTEIN-PROTEIN 7 TU* PROTEIN-DNA INTERACTIONS INTERACTIONS Cel lysis in mild condh ons / I Crcsslnkmittifoimak)*hyde Tandem affHiy chrorrilograpny Afinity crvomatoqraphy ilÄqcl i fleYersecroülnkj Curřy DNA PCH wi th specific primers PtirtfyarWlibdDNA tybňdlie Mass Spcí ťomttry Single-Locus PCR Genomic sequences Ifteguialoiy and Qarwc toed] Location DIU Micrurray (a) (b) (c) (d) Protein A-Protein A TEV CBP 20.7 kDa Protein G Protein G-TEV SBP 18.8 kDa 4.6 kDa [KG5-6XHIS Biotin signal bxHis 96 kDa Colins knd Choudhary, 2008 Current Opinion in Biotechnology Tandem affinity purification TTene of Interest CBP TEV Site IaGBD 1 / — i Expression and purification from a culture. 2 ^4ffinity Column 1: IgQ Beads T IgGBead £ with TEV Protease y ^ ^ Cleavage J Column 2: QaJmadldw Beads Ca 2+ Ca 2 • ^ I Co*+ Chelation, Elution D C J Sepa ration S Band Excision i And Digestion *J I Peptide Separation ' I And MS A nalys is "T .7 1. Protein 2. A 3. B 4. C 5. D (c) Lower background and higher complex yield with GS tag compared to TAP tag (Chepelev etal. 2008) J? J in -.15 Ii Affinity purification for studying protein-protein interaction > An affinity tags can influence protein-protein interactions (testing N- and C-terminal fusions). > Loss of weak or transient protein-protein interactions. > Non-specificity: controls, affinity tags with higher specificity > Verification of newly identified interactors by other methods and biologically relevant mutants.