DNA transcription - RNA synthesis Regulation of gene expression Biochemistry I Lecture 13 2008 (J.S.) Eukaryotic transcription and translation are separated in space and time Prokaryotes Eukaryotes 2 DNA is a template in RNA synthesis In DNA replication, both DNA strands of ds DNA act as templates to specify the complementary base sequence on the new chains, by base-pairing. In transcription of DNA into RNA, only one DNA strand (the negative strand) acts as template. The sequence of the transcribed RNA corresponds to that of the coding (positive) strand, except that thymidine is replaced by uridine in RNAs. dsDNA_ 5-p----CACCTGCTCAGGCCTTAGC*** -3-oh 3 -oh-« ••GTGGACGAGTCCGGAATCG*** -5 -p transcribed RNA 1 5-p----CACCUGCUCAGGCCUUAGC*- -3-oh coding strand positive strand template negative strand 3 RNA synthesis Ribonucleoside triphosphates are the substrates for the synthesis. RNA polymerases (DNA-dependent ribonucleotidyltransferases) recognize the nucleotide sequences in the template strands, initiate the synthesis of new chains of RNA without a primer, and catalyze the formation of 3'-5' phosphodiester bonds in the complementary transcripts. The nascent RNA chains grow only in the 5'—► 3' direction, antiparallel to the direction of the template strand. In contradistinction to DNA polymerases, RNA polymerases don't exhibit any nuclease (proof-reading) activity so that they cannot correct mismatches. 3'- RNA polymerases have binding sites - for the free 3-OH group, - for bases of the template strand, and - for nucleoside triphosphates. They cleave (3-phosphate bond of NuTP and form 3'-5' phosphodiester bond. 5'_ p_0-p_0_p_D-CH DNA template ■5'-P base -OH C-CH, binding sites for NuTP RNA polymerase 4 New 3-5' phosphodiester bond originates in the reaction between 3'-OH group of existing chain and a-5'-phosphate of the incoming nucleoside triphosphate, diphosphate is released (complexed with Mg2+ions). Template strand DNA N. 3'-i s \ \ \ \ \ r- 5-P 5-P-P-PJ -V- RNA-DNA hybrid hy Cyt Ade Ade Gua Cyt ■ ■ ■■■ ■■ ■■ ■ ■ ■■■ ■■ ■■ ■ ■ ■ ■ ■ ■■ ■■ Ade Gua Urd Urd Direction of RNA synthesis (movement of RNA polymerase) 5 RNA polymerases (DNA-dependent nucleotidyltransferases, transcriptases) In prokaryotes RNA is synthesized by a single kind of RNA polymerase. RNA polymerase from Escherichia coli consists of five subunits of four kinds, one of which is the a factor that helps find a promoter site where the transcription begins (and then dissociates from the rest of the enzyme. In eukaryotes the nucleus contains three types of RNA polymerase. The mechanism of their action is the same, but they differ in binding onto different promoters (template specificity), location in the nucleus, and also in susceptibility to inhibitor oc-amanitin. RNA polymerases contain from 8 to 14 subunits (Mr > 500 000). In the mitochondrial matrix, there is the fourth type - mitochondrial RNA polymerase. RNA polymerase Nuclear location Primary transcripts pol I nucleolus pre-rRNA 45 S pol II nucleoplasm pre-mRNAs, some snRNAs pol III nucleoplasm pre-tRNAs, rRNA 5 S, some snRNAs 6 Amanita phalloides (the death cup) produces a-amanitin that blocks the elongation phase of RNA synthesis 6-0 H C4,5^iOH)Ne—Trp—Gly S I (3-oH)Pro s=o Asn—Cys—Gly a-Amanitin is a cyclic octapeptide, in which the sulfinyl group (oxidized sulfanyl group of the cysteinyl residue) is attached to the indole ring of the tryptophyl residue. It is an effective inhibitor of eukaryotic RNA polymerases II and III, namely that of the polymerase II. 7 Transcription of DNA is a three-phasic process consisting of initiation, elongation, and termination. Transcription starts at promoters on the DNA template. Promoters are sequences od DNA that direct the RNA polymerase to the proper initiation site for transcription. Each of the three types of RNA polymerase has distinct promoters. Promoters are mostly in the normal upstream position to the initiation site. The effectiveness of promoters can be regulated (increased or restrained) by specific DNA sequences called enhancers or silencers that may be distant up to 2000 base pairs from the promoter either upstream or downstream. Promoters and enhancers are referred to as cis-acting elements, because they are sequences of the same molecule of DNA as the gene they regulate. The DNA sequences of cis-acting elements are binding sites for proteins called transcription factors. If those factors are encoded by a gene on a DNA molecule other than that containing the gene being regulated, they are called trans-acting factors. 8 Eukaryotic promoter site for RNA polymerase II promoter positive strand 3'- XXXXXDOC GGCAATC ATATAA template strand ~-100 -25 i 1 transcription unit (the transcribed DNA sequence) start of transcription xx>boocxtx>ooooo -► coding region -1 +1 CAAT box (sometimes present) in basal gene expression specifies the frequency of initiation TATA box (Hogness box) directs TF II D and RNA pol II to the correct site Polymerase II and transcription factors bound onto the promoter form a complex called the basal transcription apparatus. It regulates basal gene expression. Genes that are regulated wholly in this way are constitutively expressed genes (products of which are most constitutive proteins). Specifically regulated expression of numerous genes is mediated by various gene specific transcription factors. Those proteins (coactivators, corepressors, transcoactivators, etc.) bind to regulatory DNA sequences distant from promoters. The basal transcription apparatus is thus regulated through direct or mediated contact with the gene specific transcription factors. See "Regulation of gene expression". Transcription initiation Initiation begins with the binding of TF II D (transcription factor D for pol II) to the TATA box. TF I ID provides docking sites for binding of other transcription factors. One of those factors is an ATP-dependent helicase that separates the DNA duplex for the polymerase II, the last but one component of the basal transcription apparatus. Pol II contains an unphosphorylated carboxy-terminal domain (CTD). promoter unwound DNA (~17 bp opened) Polymerase II with its unphosphorylated CTD then slides to the start of transcription and initiate transcription by producing short transcripts consisting of not more than 20 - 25 nucleotides. After the transcription is initiated, most transcription factors are released from pol II. 10 Switch from initiation to elongation is driven by phosphorylation of carboxy-terminal domain of pol II CTD is then phosphorylated. The resulting change in conformation of pol II (from pol II A to pol II O form) enables binding of capping enzyme (CE) to pol II and methyltransferase (MT) to CE. Both those enzymes modify the 5'-end of the nascent transcript to 5-m7Gppp-cap required for the further progress in transcription. The phosphorylated CTD of pol II has a central role in cotranscriptional RNA processing, because it also binds, in addition, splicing factors and factors responsible for the final polyadenylation of the transcripts. Elongation phase 12 Termination In prokaryotes, termination signals usually contain a palindromic GC-rich region and an AT-rich region. Thus the mRNA transcripts of this DNA palindrome can pair to form a hairpin structure with a stem and loop followed by a sequence of more uracil base -RNA transcripts end within or just after them In eukaryotes no perspicuous termination signal has been found. Transcripts produced by DNA polymerase II are released from the transcription apparatus after the polyadenylation signal AAUAAA and the GU- or U-rich sequence that is able to bind cleavage stimulation factor (CStF) had been transcribed. The terminal sequences of the transcripts are decomposed in the course of 3'-polyadenylation (not encoded by template DNA). iT^G I I A-U I I C-G I I C-G I I G-C I I C-G I I C-G I I G-C _ 5' 3' 13 Polyadenylation of transcripts 5-m7Gppp 5 -m7Gppp Cleavage-and-polyadenylation specifity factor (CPSF) binds onto an polyadenylation signal AAUAAA. It is not quite clear when transcripts are released from the transcription apparatus. A downstream GU- or U-rich sequence binds the cleavage stimulation factor (CStF) and cleavage factors (CF 1,2), a loop is formed. 5 -m7Gppp cleavage ►UU/GG< 5 -m7Gppp Binding of poly(A) polymerase (PAP) then stimulates cleavage at a site about 20 nucleotides downstream the polyadenylation signal. The cleavage factors are released, the cleaved RNA chain degraded. | CStF | -OH Poly(A) polymerase adds 12 adenylate residues, elongation is provided by transfer of many short poly(A) chains from poly(A)-binding protein. 14 The transcription products of all three eukaryotic polymerases are processed before their export from the nucleus Precursors of rRNA are cleaved in functional rRNA types. _klA 1A ., Ribosomal RNA folding pattern (rRNA 18 S) RNA polymerase I transcribes 45 S pre-RNA within the nucleolus: AUGC AUCUGGAG AUA GGUCtCG, 45 S pre-rRNA / \ 32-35 S rRNA 20-23 S rRNA / 28 S rRNA \ 18 S rRNA 5.8 S rRNA AACCUGGGA CUGCAUCUGA CUGGCAAgM: * u —A c i i * i i iii i i'll i' cg G — c UCGGGCCC GUGU.GJCU G.UUGUUUG '//uC = = = """ sin °c S a" ' *»S ."Hi « cucfl CcUc P GG, GA A GGGAGUGGCCG C ' *Lc-cCfl yAA g; .vrcc The fourth ribosomal RNA |5 S rRNA is transcribed by RNA polymerase III within the nucleoplasm. CCGGGGAG CGCCA Examples of tRNA processing: Modification of somebases o HM uridine ribose ribosyl thymine HN transformation of the linkage to ribosyl nh pseudouridine ('- AG-GUAAGU-A-CAG-G -3 exon 1 intron exon 2 cleavage / ■AG-GUAAGU C3>. U1 Lariat forrr of intron Spliced product U2 ■AG-3-OH ■AG joining - _ -CAG-G- U5 CAG-G- -3' -3' U6 cleavage @>/ CAG-G -3 exon 1 exon 2 -3' 5-2-phosphodiester bond between 5'-end of the intron and branch site Ado forms a lariat spliceosome -CAG -3 Excised intron sequence is degraded in the nucleus 19 There are many types of small RNA molecules with fewer than 300 nucleotides in the nucleus - small nuclear RNAs (snRNAs). A few of them are essential for splicing pre-mRNA. They associate with specific nuclear proteins to form complexes called small nuclear ribonucleoprotein particles (snRNP, "snurps"). Small nuclear ribonucleoprotein particles (snRNPs) in the spticlng at mRNA precursors Si** fljfsnRNA (njufTiber of nucleotides) Function U1 1 &J5 Binds the Si splice site and then the 3'*splice site U2 Rinda the branch *ite and f^nm? part of the catalytic ceotsi US 116 "BihdK iht: 5' splice aite U4 145 Mi*»ks the Cu l illy tic activity of U& U6 100 During the splicing of pre-mRNA, the processed mRNA, snRNPs U1, 2, 4, 5, 6, and other protein splicing factors form large assemblies (about 60 S) termed spliceosomes. 20 Export of messenger ribonucleoprotein particles (mRNPs) through the nuclear pore complex ! mRNP For the transport, messenger ribonucleoprotein particle (mRNP) associates with the heterodimer known as the general mRNA export receptor (exportin 1). The nuclear pore complex consists of proteins nucleoporins, which contain Phe- Gly repeats and zinc-finger domains. These structures provide transient docking sites for the complex export receptor - mRNP traversing the nuclear pore "basket". 21 Regulation of gene expression 22 Gene expression is the term that involves conversion of the genetic information encoded by a gene into the final gene product, i.e. a protein or a functional RNA (rRNA, tRNA). Control of gene expression in prokaryotes differs from that in eukaryotes distinctly. Gene expression in prokaryotes In prokaryotes, gene activity is controlled foremost at the level of transcription, at its initiation. The structural genes are usually grouped together in operons, which are transcribed from one promoter controlled by a regulatory protein. Regulator gene operator 0peron fx^xxxx^xj ==== }oopo ■ ■ ■ ■ ■ ■ "OOOOOOC TA ~ 2 000 bp upstream •*- promoter enhancer 33 Regulation of transcription by steroid and thyroid hormones Steroid and thyroid hormones (iodothyronines) are hydrophobic so that they can diffuse through the plasma membrane into the cells. Hormones are bound onto specific intracellular receptors. Complexes of these receptors with hormones are specific transcription factors. They bind onto regulatory DNA sequences called hormone response elements (HRE). The interaction with coactivators and mediator proteins follows and interaction between mediator proteins and the basal transcription apparatus initiates (or inhibits) the transcription of particular gene. 34 Example: Initiation of transcription by Cortisol Active complex Cortisol-receptor binds onto DNA at the specific sequence GRE (glucocorticoid response element, one of the HRE - hormone response elements). The coactivator and specific hormone response element-binding proteins (HREB-proteins) are also attached. This complex acquires the ability to act as enhancer that supports initiation of transcription on the promoter by means of mediator proteins. cortisol-GR dimer complex I > 1 000 bp > basal transcription apparatus enhancer promoter GR dimer - intracellular glucocorticoid receptor (dimer) GRE - glucocorticoid response element GREB protein - GRE binding protein (a specific transcription factor) 35 Transcription factors that bind onto regulatory DNA sequences comprise mostly one of the typical structural motifs: helix-turn (or loop)-helix, zinc-finger, and leucine zipper. Only the small part of protein molecule (called DNA-binding domain) is responsible for the interaction with DNA. It is usually represented by two adjacent a-helical segments. helix-turn-helix zinc finger leucine zipper Zinc finger, e.g., occurs in DNA binding domains of steroid-hormone receptors. NRS (nucleotide recognition signal) is a part of oc-helix containing amino acid sequence that is able to recognize specific regulatory sequence of nucleotides in the major groove DNA. Transcription factors are attached to DNA usually in the major groove. 36 3 Regulation at the level of processing of primary transcripts Alternative splicing Alternative splicing can cause that the products of a sole gene are various proteins: mRNA pre-mRNA 1 | 2 I 4 | | 1 I 3. I 4 | | exon 1 |exon 21 exon 3|i iexon 4 1 ? 3 4 1 mRNA | | I | [ - RNA editing In some mRNAs, the base sequence is altered after transcription by processes other than RNA splicing. Those processes are called RNA editing and are not very rare E.g., cytidine residue may be deaminated to uridine, adenosine to inosine. 37 4 Regulation at the level of translation is mediated mostly through changes in activities of eukaryotic initiation factors (elFs). Example: The synthesis of globin in reticulocytes is controlled by phosphorylation of the initiation factor elF2. It is active when phosphorylated, inactive in the dephosphorylated form. Haem prevents from elF2 from phosphorylation. If haem is present within the cell, elF2 is not phosphorylated - active, the translation of mRNA for globin chains proceeds. If there is no haem in the cell, elF2 is inactive and globin chains are not synthesized. 38