lecture6_GENE CONTROL BY DNA (2015)Apunte Inglés
Vista previa del texto
Molecular biology – lecture 6
GENE CONTROL BY DNA-PROTEIN INTERACTIONS
1. INTRODUCTION - how does a cell determine which of its thousands of genes to describe?
As we’ve seen, the transcription of each gene is controlled by a regulatory region of DNA
relatively near the site where transcription begins (promoter region). Some regulatory regions
are simple and act like switches thrown by a single signal but many others are more complex.
Wither they are complex or simple, these switching devices are found in all cells and are composed of two types of fundamental components.
- Short stretches of DNA of defined sequence Gene regulatory proteins that reorganize and bind to this DNA 2. DNA BINDING DOMAINS The specific interaction between proteins and DNA takes place mostly through hydrogen bonds between certain amino acids of the protein and radical nitrogen bases in DNA. These amino acids should make contact with elements of the nitrogen base that are not involved in the interaction between the two nitrogen bases.
Non-specific interaction does not read the sequence of the DNA, for example the interaction between DNA and a histone. On the contrary, the specific interaction reads it.
Gene regulatory proteins must recognize specific nucleotide sequences embedded within the DNA structure. When we look at an aloha-helix protein, we can see that there are different radicals of the different amino acids that compound the protein. These amino acids have specific radicals and are able to contact with DNA sequence information. We have to take into account that the edge of each bas pair is exposed at the surface of the double helix, which have different residues (chemical groups) that can make chemical interactions with the protein radicals, mostly through hydrogen bonds. The residues are formed by: - Chemical groups with slightly positive charges (e- acceptor, H+ donor) Chemical groups with slightly negative charges (e- donor, H+ acceptor) Chemical groups that create hydrophobic interactions (methyl groups) Neutral groups Proteins in both, the major and the minor groove, can reorganize all these groups. The major groove is much more informative than the minor groove so proteins make specific contacts with the major groove. In the minor groove, because of the number of residues that are able to accept electrons (there is 1 at each side) the protein is no able to differentiate the nitrogen bases. In the major groove the nitrogen bases backbones are far apart while in the minor groove they are close together. They twist on opposite site.
Molecular biology – lecture 6 Example: If my protein has to detect a G, in the major groove it would not have any problem because the protein is not going to confuse the G with a C because each of the fou.base par configurations offers a unique pattern of features but in the minor groove the protein cannot differentiate between G and C because the patterns are similar.
With the major group there is no confusion because each combination has a different pattern, put in the minor group GC and CG has the same pattern and we have the same problem with AT and TA.
Like this, a specific nucleotide can be “read” as a pattern od molecular features on the surface of the DNA double helix.
3. HELIX-TURN-HELIX The first DNA-binding protein motif to be recognized was the helix-turn helix (HTH), this domain is present in many proteins from bacteria and also in our cells. It is constructed from two alpha-helices connected by a short extended chain of amino acids which constitutes a “turn”.
- - Recognition helix: is the one that fits into the major groove so it will read the sequence and make interactions with the nitrogen bases. Make specific interactions.
Stabilizing helix: interacts with the phosphate groups and the deoxyribose. These are not specific and can happen in any region of the DNA. They are used to stabilize the interaction between the protein and the DNA and don’t depend on the sequence of the DNA.
Molecular biology – lecture 6 If we calculate how many places in a genome can a protein bind in an specific sequence of DNA, the probability will be (1/4)n x3GB (being n the number of bases on DNA interacting with the protein, and taking into account that the DNA has 4 different nitrogen bases). So, in our genome a sequence of 5 bases would be found 3 million times in average. This means that the HTH is not specific enough.
A protein can interact with specific sequence because it is able to recognize specific bases. If a sequence is very frequent, the transcription factor will go to too many regions in our genome as to regulate a reduced number of genes. To avoid this, proteins make dimers that interact with palindromic sequences of NDA.
The probability of a 10-base sequence is 10-6 and it would be present in our genome only about 3000 times on average. So with the formation of a dimer the specificity is increased.
The protein that forms the dimer is the same as the first one but its rotated 180º to read the palindromic sequence.
Resume: if there was only a protein, there will be too many places (too many same sequences) in which the protein could bind, so that’s why a dimer is needed to increase the number of bases in the sequence and the specificity.
4. LEUCINE ZIPPER This leucine zipper is responsible for dimerization and also for DNA binding and is named like this because of the way the two alpha helices (one from each monomer) are joined together to form a sort coiled-coil. The helices are held together by interactions between hydrophobic amino acid side chains (often on leucine) that extend from one side of each helix. Leucines are very hydrophobic so they can make a zipper. Thanks to the zipper that unites two protein monomers, the dimer can recognize many more bases on DNA. This is also a way to increase variability because the interaction between two protein helices is not specific as long as they both contain leucine zipper.
Both, leucine zipper and HTH allow the cells to originate molecules to control hybrid sequences.
Molecular biology – lecture 6 5. ZINC FINGERS The Zn finger is quiet common in our genes. It forms a structure in which two aloha-helices are packed with zinc atoms. The Zn is a very important component that is coordinated by cytosine and histidine. The amino acids that are in between positions 6 and 19 create a finger (diagram). Proteins that interact with the DNA don’t contain only one finger; they have many fingers to make contact with different sequences along large regions of NDA. Like this, our transcription factors can recognize long sequences of DNA. As usual, the radicals of the amino acids make contact with the bases mostly in the major groove of DNA.
6. HOW GENETICS SWITHCES WORK We can differentiate two different types of gene considering the type of transcriptional control: - Negative control: the specific transcriptional factor acts as a repressor. This kind of control is frequent in bacteria (prokaryotes). It is a protein that is able to bind NDA very close to the promoter and by binding DNA it prevents RNA polymerases to do their job. They prevent transcription.
- Positive control: proteins have a positive function. They are activators of specific transcription factors. These are more frequent in eukaryotes. They activate transcription.
Molecular biology – lecture 6 These control systems are not exclusive; genes can be controlled by these two systems at once.
Why do our cells mostly use positive control? For our cell it is cheaper to activate the genes that a particular cell type needs (a hepatocyte for instance) than inhibit all the other genes that the cell doesn’t need to express (the hepatocyte doesn’t need to express these genes specific for neurons, skin cells, blood cells…) Why do bacteria mostly use negative controls? Bacteria are unicellular organisms so one cell has to make all the functions and usually expresses many of its genes. Bacteria have to react in seconds to the environment, so a negative control is quicker because you only need to generate a repressor instead of generating all the transcription factors that activate transcription.
7. EXAMPLE OF NEGATIVE CONTROL SYSTEM IN E.COLI In this case, we study how does “E. coli” reacts in front of lactose (lac operon). The lac operon is a paradigm among negative control systems in prokaryotes. The operon is a group of genes expressed from the same promoter (P) by means of an mRNA that code for all of them. In this case the lac operon codes for the genes that cerate the following proteins: - Beta-galactosidase Permease Transacetylase These proteins are used by the bacteria to convert lactose into glucose to obtain energy. If the bacteria feels that there’s no lactose in the medium, the genes that codify for the proteins used, are not expressed, but when there’s lactose in the medium, the genes that codify for proteins are expressed.
Process: there is a DNA sequence called operator that is bound by the repressor protein (in this case Lacl) and has a symmetry axis (palindromic).
This sequence overlaps with the promoter sequences: Molecular biology – lecture 6 - Repressed state: the repressor acts like a tetramer. This tetramer binds the operator using two monomers. These two monomers block transcription.
- Induced state: the repressor has a domain (cavity) that can bind lactose. When lactose binds to the repressor, it induces a conformational change into the repressor. This change causes the unbinding of the repressor from NDA and consequently RNA polymerases are free to bind the promoter and start transcription.
8. TETRAMER STRUCTURE OD REPRESSORS If the repressor was a monomer the presence of lactose would cause a continuous increase of repressor + lactose complex until saturation. With small quantities of lactose, there will be small quantities of repressor + lactose complex (it would be proportional) and for the cell this would not be efficient because the genes that are implicated in the utilization of lactose would be activated, consuming energy constantly.
To avoid this bacteria have tetramers. Lactose binds to the tetramer structure in a different way, following as sigmoidal graph. This is called allosterism and is a key property. With this system, small quantities will not activate the operon. Like this, bacteria will not spend a lot of energy producing the necessary proteins and will only produce proteins if the lactose concentration is high enough.
Molecular biology – lecture 6 9. EXAMPLE OF POSITIVE CONTROL SYSTEM IN E.COLI The lac operon is not only controlled by this negative control, it is also under the control of a positive control where some other proteins take a role.
In this case, another protein called CAP is able to bind DNA, but only in the presence of cyclic AMP (cAMP).
The presence of glucose will activate a molecule that is able to reduce the cAMP levels. If there is glucose the cell will activate a molecule that reduces the levels of cAMP and in consequence, CAP will not bind DNA. Without CAP, the RNA-polymerase can’t bind the DNA and can’t start transcription We can deduce that in the lac operon the presence of glucose inactivates the expression of the genes and the presence of lactose activates the expression of genes.
10. CELL SITUATIONS a) Presence of glucose and lactose: although lactose inactivates the repressor, cAMP levels are low and CAP is inactivated, so RNA polymerase can’t bind DNA and genes are not expressed. When there is glucose, the cell doesn’t want to spend energy creating the enzymes for lactose so there will not be expression of the genes.
b) Presence of glucose but not lactose: CAP is inactivated (RAN polymerase can’t bind DNA) and the repressor is activated (RNA polymerase can´t bind DNA). There would not be expression of the genes. When there is glucose, the cell can obtain energy from it and as there is no lactose it’s not necessary the production of the enzymes.
c) Absence of both glucose and lactose: CAP is activated (RNA polymerase can bind to DNA) but the repressor is activated (RNA polymerase can’t bind DNA). There would not be expression of genes. In this situation the cell is going to obtain the energy from another source that would require expression of other genes.
d) Presence of lactose but not glucose: CAP is activated (RNA polymerase can bind DNA) and the repressor is inactivated (RNA polymerase can bind DNA). This is the only situation where there would be expression of the genes.
Molecular biology – lecture 6 11. E.COLI SIGMA FACTORS E coli has several sigma factors (proteins capable of recognizing promoters with different consensus sequences), although sigma 17 is the general one.
There are some sigma factors that are only activated in front of a heat shock, so the genes that depend on the activation of sigma factors will only be expressed if there is a heat shock. The heat shock sigma factors at their basal conformation can’t bind DNA but if the temperature increases, they can bind DNA and work as sigma factors.
The group of genes regulated by the sigma factors is called regulons. Regulons are groups of different genes each of them with different promoters but they have the same sigma factor 35 and -10 sequences.