Chapter 4: Clusters and repeats. 4.1 Introduction  Duplication of DNA is a major force in evolution.

Slides:



Advertisements
Similar presentations
Evolution of genomes.
Advertisements

Genomics – The Language of DNA Honors Genetics 2006.
DNA Organization Lec 2. Aims The aims of this lecture is to investigate how cells organize their DNA within the cell nucleus, how is the huge amount of.
The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.
Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
The Nucleus Nuclear Organization Nuclear Envelope and Molecular Trafficking Nucleolus and rRNA Processing The nucleus is one of the main features that.
Retroviruses and Retroposons Chapter Introduction Figure 22.1.
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
Duplication, rearrangement, and mutation of DNA contribute to genome evolution Chapter 21, Section 5.
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Stability of the Genome Duplication, Deletion, Transposition.
Genome Structure 12 Jan, Nature of DNA Transformation (uptake of foreign DNA) in prokaryotes and eukaryotes has repeatedly shown that DNA is hereditary.
BIO 402/502 Advanced Cell & Developmental Biology I Section IV: Dr. Berezney.
Lecture Outline 12/7/05 The human genome
Chromosome Structure Variations. Causes and Problems Chromosome structure variations result from chromosome breakage. Broken chromosomes tend to re-join;
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
Chapter 2 Genes Encode RNAs and Polypeptides
Chapter 19: Eukaryotic Genomes Most gene expression regulated through transcription/chromatin structure Most gene expression regulated through transcription/chromatin.
Chapter 1 GENE PART IV CLUSTERS AND REPEATS. Introduction Key Terms A gene family consists of a set of genes whose exons are related; the members were.
Chapter 2: From genes to Genomes. 2.1 Introduction.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
1 The Interrupted Gene. Ex Biochem c3-interrupted gene Introduction Figure 3.1.
Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section B: Genome Organization at the DNA Level 1.Repetitive DNA and other noncoding.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Genomes & their evolution Ch 21.4,5. About 1.2% of the human genome is protein coding exons. In 9/2012, in papers in Nature, the ENCODE group has produced.
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Chapter 21 Eukaryotic Genome Sequences
Eukaryotic Genomes 15 November, 2002 Text Chapter 19.
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Genomics and Forensics
Molecular and Genomic Evolution Getting at the Gene Pool.
Lecture 10 Genes, genomes and chromosomes
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Eukaryotic Genomes: The Organization and Control.
Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings PowerPoint ® Lecture Presentations for Biology Eighth Edition Neil Campbell.
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Structure of the Genome Denaturation, Renaturation and Complexity.
MPL The DNA Sequence of chimpanzee chromosome 22 and comparative analysis with its human ortholog, chromosome 21 Bioinformatics Dae-Soo Kim.
Evolution at the Molecular Level
Differences in DNA Heterochromatin vs. Euchromatin
Chapter 3 The Interrupted Gene.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
How many genes are there?
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
LECTURE PRESENTATIONS For CAMPBELL BIOLOGY, NINTH EDITION Jane B. Reece, Lisa A. Urry, Michael L. Cain, Steven A. Wasserman, Peter V. Minorsky, Robert.
Chapter 4 Clusters and repeats.
Aim: How is DNA organized in a eukaryotic cell?. Why is the control of gene expression more complex in eukaryotes than prokaryotes ? Eukaryotes have:
Objective: I can explain how genes jumping between chromosomes can lead to evolution. Chapter 21; Sections ; Pgs Genomes: Connecting.
Chapter 2 Genes Code for Proteins. 2.1Introduction Early work measuring recombination frequencies between genes led to the establishment of “linkage groups”:
Fig Prokaryotes and Eukaryotes
Chapter 7 Clusters and Repeats
Evolution of eukaryotic genomes
Chapter 7 Clusters and Repeats Jocelyn E. Krebs.
Chapter 7 Clusters and Repeats
Differences in DNA Heterochromatin vs. Euchromatin
Genomes and Their Evolution
Genomes and Their Evolution
SGN23 The Organization of the Human Genome
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Organization of the human genome
Chapter 9 Organization of the Human Genome
Chapter 4 The Interrupted Gene.
Eukaryotic Genomes: The Organization and Control.
Gene Density and Noncoding DNA
Genes Code for Proteins
Evolutionary genetics
Genes Encode RNAs and Polypeptides
Extra chromosomal Agents Transposable elements
Chapter 6 Clusters and Repeats.
Forensic DNA Sadeq Kaabi
Presentation transcript:

Chapter 4: Clusters and repeats

4.1 Introduction  Duplication of DNA is a major force in evolution

Figure 4.1 After a gene has been duplicated, differences may accumulate between the copies. The genes may acquire different functions or one of the copies may become inactive.

 gene family: A set of genes descended by duplication and variation from some ancestral gene.  Gene clusters :a duplication has generated two adjacent related genes, hundreds of identical genes lie in a tandem array.  Recombination is a key event in evolution of the genome.

Figure 4.2 Chiasma formation is responsible for generating recombinants.

Figure 4.3 Recombination involves pairing between complementary strands of the two parental duplex DNAs.

Figure 4.4 Unequal crossing-over results from pairing between non-equivalent repeats in regions of DNA consisting of repeating units. Here the repeating unit is the sequence ABC, and the third repeat of the red allele has aligned with the first repeat of the blue allele. Throughout the region of pairing, ABC units of one allele are aligned with ABC units of the other allele. Crossing-over generates chromosomes with 10 and 6 repeats each, instead of the 8 repeats of each parent. How does the genome change its content of genes?

4.2 Gene clusters are formed by duplication and divergence

Figure 4.5 All functional globin genes have an interrupted structure with three exons. The lengths indicated in the figure apply to the mammalian  -globin genes.

Figure 4.6 Each of the  - like and  -like globin gene families is organized into a single cluster that includes functional genes and pseudogenes ( . Pseudogenes: inactive but stable components of the genome derived by mutation of an ancestral active gene.

Figure 4.7 Clusters of  -globin genes and pseudogenes are found in vertebrates. Seven mouse genes include 2 early embryonic, 1 late embryonic, 2 adult genes, and 2 pseudogenes. Rabbit and chick each have four genes.

Figure 4.8 All globin genes have evolved by a series of duplications, transpositions, and mutations from a single ancestral gene.

4.3 Sequence divergence is the basis for the evolutionary clock

Figure 4.9 Divergence of DNA sequences depends on evolutionary separation. Each point on the graph represents a pairwise comparison.

Figure 4.10 All globin genes have evolved by a series of duplications, transpositions, and mutations from a single ancestral gene.

Figure 4.11 Replacement site divergences between pairs of  -globin genes allow the history of the human cluster to be reconstructed. This tree accounts for the separation of classes of globin genes.

4.4 Pseudogenes are dead ends of evolution Pseudogenes (Ψ) : Some DNA sequences that are related to those of the functional genes, but that cannot be translated into a functional protein.

Figure 4.12 Many changes have occurred in a β globin gene since it became a pseudogene.

Figure 4.13 Pseudogenes could arise by reverse transcription of RNA to give duplex DNAs that become integrated into the genome.

If pseudogenes are evolutionary dead ends, simply an unwanted accompaniment to the rearrangement of functional genes, why are they still present in the genome?  survived in present populations, in past times, any number of other pseudogenes may have been eliminated.

4.5 Unequal crossing-over rearranges gene clusters

Figure 4.14 Clusters of β -globin genes and pseudogenes are found in vertebrates. Seven mouse genes include 2 early embryonic, 1 late embryonic, 2 adult genes, and 2 pseudogenes. Rabbit and chick each have four genes.

Figure 4.15 Gene number can be changed by unequal crossing- over. If gene 1 of one chromosome pairs with gene 2 of the other chromosome, the other gene copies are excluded from pairing, as indicated by the extruded loops. Recombination between the mispaired genes produces one chromosome with a single (recombinant) copy of the gene and one chromosome with three copies of the gene (one from each parent and one recombinant).

Figure 4.16 Thalassemias result from various deletions in the  -globin gene cluster.

Figure 4.17 Deletions in the  -globin gene cluster cause several types of thalassemia.

4.6 Genes for rRNA form tandem repeats two cases of large gene clusters that contain many identical copies of the same gene or genes: histone proteins: Number: rRNA genes varies from 7 in E. coli, 100~200 in lower eukaryotes, to several hundred in higher eukaryotes. A point of major interest is what mechanism(s) are used to prevent variations from accruing in the individual sequences. ???

Figure 4.18 A tandem gene cluster has an alternation of transcription unit and nontranscribed spacer and generates a circular restriction map.

Figure 4.19 The nucleolar core identifies rDNA under transcription, and the surrounding granular cortex consists of assembling ribosomal subunits. This thin section shows the nucleolus of the newt Notopthalmus viridescens.

Figure 4.20 Transcription of rDNA clusters generates a series of matrices, each corresponding to one transcription unit and separated from the next by the nontranscribed spacer.

Figure 4.21 The nontranscribed spacer of X. laevis rDNA has an internally repetitious structure that is responsible for its variation in length.

4.7 Crossover fixation could maintain identical repeats

crossover fixation model: an entire cluster is subject to continual rearrangement by the mechanism of unequal crossing-over

Figure 4.23 Unequal recombination allows one particular repeating unit to occupy the entire cluster. The numbers indicate the length of the repeating unit at each stage.

4.8 Satellite DNAs often lie in heterochromatin

Figure 4.24 Mouse DNA is separated into a main band and a satellite by centrifugation through a density gradient of CsCl.

Figure 4.25 Individual bands containing particular genes can be identified by in situ hybridization

Figure 4.26 Cytological hybridization shows that mouse satellite DNA is located at the centromeres.

4.9 Arthropod satellites have very short identical repeats

Figure 4.27 Satellite DNAs of D. virilis are related. More than 95% of each satellite consists of a tandem repetition of the predominant sequence

4.10 Mammalian satellites consist of hierarchical repeats

Figure 4.29 The repeating unit of mouse satellite DNA contains two half-repeats, which are aligned to show the identities (in color).

Figure 4.30 The alignment of quarter-repeats identifies homologies between the first and second half of each half- repeat. Positions that are the same in all 4 quarter-repeats are shown in red; identities that extend only through 3 quarter-repeats are indicated by grey letters in the pink area.

Figure 4.31 The alignment of eighth-repeats shows that each quarter-repeat consists of an a and a b half. The consensus sequence gives the most common base at each position. The "ancestral" sequence shows a sequence very closely related to the consensus sequence, which could have been the predecessor to the a and b units. (The satellite sequence is continuous, so that for the purposes of deducing the consensus sequence, we can treat it as a circular permutation, as indicated by joining the last GAA triplet to the first 6 bp.)

G A A A A A C G T G A A A A A T G A G A A A A A A C T Figure 4.32 The existence of an overall consensus sequence is shown by writing the satellite sequence in terms of a 9 bp repeat. AG TC

Figure 4.33 Digestion of mouse satellite DNA with the restriction enzyme EcoRII identifies a series of repeating units (1, 2, 3) that are multimers of 234 bp and also a minor series (½, 1½, 2½) that includes half-repeats (see text later). The band at the far left is a fraction resistant to digestion.

4.11 Minisatellites are useful for genetic mapping minisatellite or VNTR (variable number tandem repeat)

For example, one minisatellite has a repeat length of 64 bp, and is found in the population with the following distribution: 7% 18 repeats 11% 16 repeats 43% 14 repeats 36% 13 repeats 4% 10 repeats

Figure 4.34 Alleles may differ in the number of repeats at a minisatellite locus, so that cleavage on either side generates restriction fragments that differ in length. By using a minisatellite with alleles that differ between parents, the pattern of inheritance can be followed. 1

4.12 Summary  Almost all genes belong to families, defined by the possession of related sequences in the exons of individual members. Families evolve by the duplication of a gene (or genes), followed by divergence between the copies. Some copies suffer inactivating mutations and become pseudogenes that no longer have any function. Pseudogenes also may be generated as DNA copies of the mRNA sequences.

 An evolving set of genes may remain together in a cluster or may be dispersed to new locations by chromosomal rearrangement. The organization of existing clusters can sometimes be used to infer the series of events that has occurred. These events act with regard to sequence rather than function, and therefore include pseudogenes as well as active genes.

 Mutations accumulate more rapidly in silent sites than in replacement sites (which affect the amino acid sequence). The rate of divergence at replacement sites can be used to establish a clock, calibrated in percent divergence per million years. The clock can then be used to calculate the time of divergence between any two members of the family.

 A tandem cluster consists of many copies of a repeating unit that includes the transcribed sequence(s) and a nontranscribed spacer(s). rRNA gene clusters code only for a single rRNA precursor. Maintenance of active genes in clusters depends on mechanisms such as gene conversion or unequal crossing-over that cause mutations to spread through the cluster, so that they become exposed to evolutionary pressure.

 Satellite DNA consists of very short sequences repeated many times in tandem. Its distinct centrifugation properties reflect its biased base composition. Satellite DNA is concentrated in centromeric heterochromatin, but its function (if any) is unknown. The individual repeating units of arthropod satellites are identical. Those of mammalian satellites are related, and can be organized into a hierarchy reflecting the evolution of the satellite by the amplification and divergence of randomly chosen sequences.

 Unequal crossing-over appears to have been a major determinant of satellite DNA organization. Crossover fixation explains the ability of variants to spread through a cluster. Minisatellites have properties similar to satellites, but are much smaller; they are useful for human genomic mapping