Genome Sequences and Gene Number

Slides:



Advertisements
Similar presentations
Introduction to genomes & genome browsers
Advertisements

The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.
Genomics, Genetics and Biochemistry
© Wiley Publishing All Rights Reserved. Using Nucleotide Sequence Databases.
Human Genome Project What did they do? Why did they do it? What will it mean for humankind? Animation OverviewAnimation Overview - Click.
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
What is genomics? Study of genomes. What is the genome? Entire genetic compliment of an organism.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Eukaryotic Gene Expression The “More Complex” Genome.
Chapter 5 Genome Sequences and Gene Numbers. 5.1Introduction  Genome size vary from approximately 470 genes for Mycoplasma genitalium to 25,000 for human.
Eukaryotic Gene Control. Developmental pathways of multicellular organisms: All cells of a multicellular organism start with the same complement of DNA.
TEST 4 review. _____ 1. A prophage is a(n) a. emerging virus. b. type of retrovirus. c. prion that has been integrated into a bacterial cell's chromosome.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
Genomics Lecture 8 By Ms. Shumaila Azam. 2 Genome Evolution “Genomes are more than instruction books for building and maintaining an organism; they also.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
What does the word Promoter mean? It is the place at which RNA Pol II binds. But the word is incorrectly used to describe Enhancers plus Promoter.
1 The Interrupted Gene. Ex Biochem c3-interrupted gene Introduction Figure 3.1.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Chapter 21 Eukaryotic Genome Sequences
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Lecture 10 Genes, genomes and chromosomes
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Changes in the Eukaryotic Genome By: Sergio Aguilar.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
MPL The DNA Sequence of chimpanzee chromosome 22 and comparative analysis with its human ortholog, chromosome 21 Bioinformatics Dae-Soo Kim.
Chapter 3 The Interrupted Gene.
How many genes are there?
Genetics and Genomics Forward genetics Reverse genetics
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
Who is smarter and does more tricks you or a bacteria? YouBacteria How does my DNA compare to a prokaryote? Show-off.
Molecular structure of gene and chromosome Gene: In molecular terms, a gene is the entire DNA sequence required for synthesis of functional protein or.
Aim: How is DNA organized in a eukaryotic cell?. Why is the control of gene expression more complex in eukaryotes than prokaryotes ? Eukaryotes have:
Chapter 18.1 Contributors of Genetic Diversity in Bacteria.
The genome of prokaryotes and eukaryotes- nuclear and extranuclear genetic organization.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Gene Structure and Regulation. Gene Expression The expression of genetic information is one of the fundamental activities of all cells. Instruction stored.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
Ch 12: Genomes.
Genomes and their evolution
Sunday, Tuesday & Thursday 2-3
Chapter 5 The Content of the Genome
Genomes and Their Evolution
EL: To find out what a genome is and how gene expression is regulated
The Role of Recombinant DNA Technology in Biotechnology
Chapter 13: transposable elements
Genomics 20.
Genomes and Their Evolution
Genomes Genes and Alleles
SGN23 The Organization of the Human Genome
Genomes and Their Evolution
Lecture 6 By Ms. Shumaila Azam
Today… Review a few items from last class
Genomes and Their Evolution
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Evolution of eukaryote genomes
Chapter 4 The Interrupted Gene.
AH Biology: Unit 1 Proteomics and Protein Structure 1
Chapter 6 Genome Sequences and Gene Numbers
Extra chromosomal Agents Transposable elements
Evolution of Genomes Chapter 21.
The Content of the Genome
Unit Genomic sequencing
Eukaryotic Gene Regulation
The Content of the Genome
Presentation transcript:

Genome Sequences and Gene Number Chapter 6 Genome Sequences and Gene Number Jocelyn E. Krebs

Figure 06.CO: Mouse and human chromosomes. Photo courtesy of U.S. Department of Energy. Used with permission of Lisa J. Stubbs, University of Illinois at Urbana-Champagne.

Figure 5.01: Minimum gene numbers range from 500 to 30,000.

Figure 06.01_01: The minimum gene number required for any type of organism increases with its complexity. Photo of intracellular bacterium courtesy of Gregory P. Henderson and Grant J. Jensen, California Institute of Technology.

Free-living bacterium - 1,500 genes.(upto 7500) Figure 06.01_02: The minimum gene number required for any type of organism increases with its complexity. Courtesy of Rocky Mountain Laboratories, NIAID, NIH

Figure 06.01_03: The minimum gene number required for any type of organism increases with its complexity. Photo of unicellular eukaryote courtesy of Eishi Noguchi, Drexel University College of Medicine.

Figure 06.01_04: The minimum gene number required for any type of organism increases with its complexity. Photo of multicellular eukaryote courtesy of Carolyn B. Marks and David H. Hall, Albert Einstein College of Medicine, Bronx, NY.

C. elegans; 예쁜 꼬마 선충

Higher plants - 25,000 genes. Figure 06.01_05: The minimum gene number required for any type of organism increases with its complexity. Photo of higher plant courtesy of Keith Weller/USDA.

Mammals - 25,000 genes. Figure 06.01_06: The minimum gene number required for any type of organism increases with its complexity. Photo of mammal © Photodisc.

6.2 Prokaryotic Gene Numbers Range Over an Order of Magnitude Parasites- low genome size (>1.5Mb) Metabolic enzyme-coding genes are reduced Figure 06.02: Genome sizes and gene numbers are known from complete sequences for several organisms. Lethal loci are estimated from genetic data.

In prokaryotic cells, genome and gene show co-relationship Archeae; 1500 gene; mininum requirement for independent living Nitorgen fixation bacteria has large genome (7Mb) and gene (>7500); similar to yeast E.coli; 4288 genes, gene-to gene span-118bp, average gene size; 950bp Pathogenic island Horizontal transfer Figure 06.03: The number of genes in bacterial and archaeal genomes is proportional to genome size.

pathogenicity islands – DNA segments that are present in pathogenic bacterial genomes but absent in their nonpathogenic relatives. horizontal transfer – The transfer of DNA from one cell to another by a process other than cell division, such as bacterial conjugation.

6.3 Total Gene Number Is Known for Several Eukaryotes There are 6000 genes in yeast; 18,500 in a worm; 14,000 in a fly; 25,000 in the small plant Arabidopsis; and probably 20,000 to 25,000 in humans. Eukaryotic genes Yeast; 1.4 kb (ORF), 12.5-13.5 Mb, 5000-6000 genes 70% of genome is coding region Cerevisiae vs S. pombe 5% intron vs 43% intron Figure 06.04: The number of genes in a eukaryote varies from 6000 to 40,000 but does not correlate with the genome size or the complexity of the organism.

Figure 06. 05: The S. cerevisiae genome of 13 Figure 06.05: The S. cerevisiae genome of 13.5 Mb has 6000 genes, almost all uninterrupted. The S. pombe genome of 12.5 Mb has 5000 genes, almost half having introns.

C.Elegans (18500) vs D.melanogaster (14000) monocistronic mRNA – mRNA that codes for one polypeptide. polycistronic mRNA – mRNA that includes coding regions representing more than one gene. Figure 06.06: ~20% of Drosophila genes code for proteins concerned with maintaining or expressing genes, ~20% code for enzymes, <10% code for proteins concerned with the cell cycle. Adapted from Drosophila 12 Consortium, “Evolution of genes and genomes on the Drosophila phylogeny,” Nature 450 (2007): 203-218.

6.4 How Many Different Types of Genes Are There? The sum of the number of unique genes and the number of gene families is an estimate of the number of types of genes. Family gene Figure 06.07: Many genes are duplicated, and as a result the number of different gene families is much less than the total number of genes.

Figure 06.08: The proportion of genes that are present in multiple copies increases with genome size in multicellular eukaryotes.

Figure 06.09: Comparison of fly genes to those of the worm and yeast. orthologous genes (orthologs) – Related genes in different species. The minimum size of the proteome can be estimated from the number of types of genes. Figure 06.09: Comparison of fly genes to those of the worm and yeast.

6.5 The Human Genome Has Fewer Genes Than Originally Expected Only 1% of the human genome consists of exons. The exons comprise ~5% of each gene, so genes (exons plus introns) comprise ~25% of the genome. The human genome has 20,000 to 25,000 genes. Figure 06.10: Genes occupy 25% of the human genome, but protein-coding sequences are only a tiny part of this fraction.

60% gene show alternative splicing Increase of gene in human by alternative splicing, alternative promoter selection, alternative poly-A 60% gene show alternative splicing Up to 80% of the alternative splices change protein sequence, so the proteome has ~50,000 to 60,000 members. Figure 06.11: The average human gene is 27 kb long and has nine exons, usually comprising two longer exons at each end and seven internal exons.

6.6 How Are Genes and Other Sequences Distributed in the Genome? Repeated sequences (present in more than one copy) account for >50% of the human genome. The great bulk of repeated sequences consist of copies of nonfunctional transposons. There are many duplications of large chromosome regions. Figure 06.12: The largest component of the human genome consists of transposons. Other repetitive sequences include large duplications and simple repeats.

Figure 06.12: The largest component of the human genome consists of transposons. Other repetitive sequences include large duplications and simple repeats.

6.7 The Y Chromosome Has Several Male-Specific Genes The Y chromosome has ~60 genes that are expressed specifically in the testis. The male-specific genes are present in multiple copies in repeated chromosomal segments. Gene conversion between multiple copies allows the active genes to be maintained during evolution. Figure 06.13: The Y chromosome consists of X-transposed regions, X-degenerate regions, and amplicons.

Figure 06.B01: Distribution of Y-chromosome haplotypes presumed to have descended from Genghis Khan or close male relatives, among populations near and bordering the ancient Mongols. Adapted from Zerjal, T., Am. J. Hum. Genet. 72 (2003): 717-721

6.8 How Many Genes Are Essential? Not all genes are essential. In yeast and flies, deletions of <50% of the genes have detectable effects. When two or more genes are redundant, a mutation in any one of them may not have detectable effects. We do not fully understand the persistence of genes that are apparently dispensable in the genome.

Figure 06.14: Essential yeast genes are found in all classes.

Figure 06.15: A systematic analysis of loss of function for 86% of worm genes shows that only 10% have detectable effects on the phenotype.

In human, 4000-8000 genes seems to be essential Figure 06.16: Most know genetic defects in human genes are due to point mutations.

When two or more genes are redundant, a mutation in any one of them may not have detectable effects. Redundancy

synthetic lethal – Two mutations that are viable by themselves but cause lethality when combined. synthetic genetic array analysis (SGA) – An automated technique in budding yeast whereby a mutant is crossed to an array of approximately 5000 deletion mutants to determine if the mutations interact to cause a synthetic lethal phenotype. Figure 06.17: All 132 mutant test genes have some combinations that are lethal when they are combined with each of 4700 nonlethal mutations.

6.9 About 10,000 Genes Are Expressed at Widely Differing Levels in a Eukaryotic Cell In any given cell, most genes are expressed at a low level. scarce (complex) mRNA – mRNA that consists of a large number of individual mRNA species, each present in very few copies per cell. This accounts for most of the sequence complexity in RNA. Only a small number of genes, whose products are specialized for the cell type, are highly expressed. abundance – The average number of mRNA molecules per cell. abundant mRNA – Consists of a small number of individual species, each present in a large number of copies per cell.

The abundantly expressed mRNAs are usually specific for the cell type. mRNAs expressed at low levels overlap extensively when different cell types are compared. housekeeping gene – A gene that is (theoretically) expressed in all cells because it provides basic functions needed for sustenance of all cell types. The abundantly expressed mRNAs are usually specific for the cell type. luxury gene – A gene coding for a specialized function, synthesized (usually) in large amounts in particular cell types. ~10,000 expressed genes may be common to most cell types of a multicellular eukaryote.

Figure 06.18: The abundances of yeast mRNAs vary from <1 per cell (meaning that not every cell has a copy of the mRNA) to >100 per cell (coding for the more abundant proteins.)

6.10 Expressed Gene Number Can Be Measured En Masse DNA microarray technology allows a snapshot to be taken of the expression of the entire genome in a yeast cell. ~75% (~4500 genes) of the yeast genome is expressed under normal growth conditions. DNA microarray technology allows detailed comparisons of related animal cells to determine (for example) the differences in expression between a normal cell and a cancer cell.

Figure 06.19: Heat map of 59 invasive breast tumors from women who breastfed ≥6 months (red lines above map) or who never breastfed (blue lines). Different tumor subtypes are denoted by the blue, green, red, and purple bars above the map. In the map, the expression of a number of genes (listed at the right) in the tumor is compared to their expression in normal breast tissue: red = higher expression, blue = lower expression, grey = equal expression.