Kinetics and Components

Slides:



Advertisements
Similar presentations
LINEs and SINEs ….& towards cancer! Presenter: Manindra Singh Course: MCB 720 (Winter Qt.)
Advertisements

Genomics – The Language of DNA Honors Genetics 2006.
DNA Organization Lec 2. Aims The aims of this lecture is to investigate how cells organize their DNA within the cell nucleus, how is the huge amount of.
The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.
Chapter 7b - Transposable elements:
Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
Retroviruses and Retroposons Chapter Introduction Figure 22.1.
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Stability of the Genome Duplication, Deletion, Transposition.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
GENE DUPLICATIONS A.Non-homologous recombination B.Transposition C.Non-disjunction in meiosis.
ECE 501 Introduction to BME
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
Retroviruses and Retroposons
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao C havan Maharashtra Open University, Nashik.
Genomic Organization at the DNA level! By: Caroline Fowle, Amanda Zink, Ben Whitfield, Farvah Khaja and Danielle Siegert.
Organization of the human genome Genome structure Nuclear vs. mitochondrial genomes Gene families Transposable elements Other repeated sequences.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
GenomesGenomes Chapter 21. Genomes Sequencing of DNA Human Genome Project countries 20 research centers.
Eukaryotic Gene Expression The “More Complex” Genome.
Human Genetics The Human Genome 1.
Transposition Evidence Mechanisms: DNA-mediated RNA-mediated.
Selfish DNA Honors Genetics.
Eukaryotic Genomes Demonstrate Sequence Organization Characterized by Repetitive DNA Honors Genetics Lemon Bay High School
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
GenomesGenomes Chapter 21 Genomes Sequencing of DNA Human Genome Project countries 20 research centers.
Fig Genome = Genic + Intergenic (or non-genic) Eukaryotic genomes: composition of human genome.
Chapter 11: An Introduction to Human Genome.  A linear polynucleotide consisting of four types of monomeric nucleotides  Each nucleotide contains: 
Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section B: Genome Organization at the DNA Level 1.Repetitive DNA and other noncoding.
Genetics: Chromosome Organization. Chromosomes: Structures that contain the genetic material (DNA) Genome – complete set of genetic material in a particular.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Genomes & their evolution Ch 21.4,5. About 1.2% of the human genome is protein coding exons. In 9/2012, in papers in Nature, the ENCODE group has produced.
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Chapter 21 Eukaryotic Genome Sequences
Non-Coding Areas & Mutations Within the human genome the majority of the DNA (~75%) is made up of sequences not involved in coding for proteins, RNA, or.
Lecture 9 Site Specific Recombination and Transposition Quiz 5 due today at 4 PM.
HUMAN GENOME Gene density 1/100 kb (vary widely); Averagely 9 exons per gene 363 exons in titin gene Many genes are intronsless Largest intron is 800.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Control of Eukaryotic Genome
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Structure of the Genome Denaturation, Renaturation and Complexity.
Differences in DNA Heterochromatin vs. Euchromatin
Chromosome, Genome and Cell Cycle
What you need to know: The major goals of the Human Genome Project How prokaryotic genomes compare to eukaryotic genomes. The activity and role of transposable.
 Larger genomes are not generated by increasing the number of copies of the same sequences present in smaller genomes.  It is due to the presence of.
Genomics Chapter 18.
How many genes are there?
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
 DNA- genetic material of eukaryotes.  Are highly variable in size and complexity.  About 3.3 billion bp in humans.  Complexity- due to non coding.
Objective: I can explain how genes jumping between chromosomes can lead to evolution. Chapter 21; Sections ; Pgs Genomes: Connecting.
Chromosome Organization & Molecular Structure. Chromosomes & Genomes Chromosomes complexes of DNA & proteins – chromatin Viral – linear, circular; DNA.
Organization of prokaryotic, eukaryotic and viral genomes
Organization of the human genome
Genomes and Their Evolution
Genomes and their evolution
Transposable Elements
Differences in DNA Heterochromatin vs. Euchromatin
Genomes and Their Evolution
SGN23 The Organization of the Human Genome
Transposable Elements And Transposition
Genomes and Their Evolution
Evolution of eukaryote genomes
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
What kinds of things have been learned?
Evolution of eukaryote genomes
Organization of the human genome
Lecture 11 LTRs Properties of Chromatin Telomeres.
Gene Density and Noncoding DNA
BSC1010: Intro to Biology I K. Maltz Chapter 21.

Evolution of Genomes Chapter 21.
Presentation transcript:

Kinetics and Components Genome Structure Kinetics and Components

Genome The genome is all the DNA in a cell. All the DNA on all the chromosomes Includes genes, intergenic sequences, repeats Specifically, it is all the DNA in an organelle. Eukaryotes can have 2-3 genomes Nuclear genome Mitochondrial genome Plastid genome If not specified, “genome” usually refers to the nuclear genome.

Genomics Genomics is the study of genomes, including large chromosomal segments containing many genes. The initial phase of genomics aims to map and sequence an initial set of entire genomes. Functional genomics aims to deduce information about the function of DNA sequences. Should continue long after the initial genome sequences have been completed.

Genomics vs. Genetics Genetics: study of inherited phenotypes Peter Goodfellow (1997, Nature Genetics 16:209-210): "...I would define genetics as the study of inheritance and genomics as the study of genomes. The latter informs the former and includes the sequencing of genomes. The concept of functional genetics is a tautology (the whole point of genetics is to link genes with phenotypes). Functional genomics is the attachment of information about function to knowledge of DNA sequence' paradoxically, genetics is a major tool for functional genomics."

Human genome 22 autosome pairs + 2 sex chromosomes 3 billion base pairs in the haploid genome Where and what are the 30,000 to 40,000 genes? Is there anything else interesting/important? From NCBI web site, photo from T. Ried, Natl Human Genome Research Institute, NIH

Components of the human Genome Human genome has 3.2 billion base pairs of DNA About 3% codes for proteins About 40-50% is repetitive, made by (retro)transposition What is the function of the remaining 50%?

The Genomics Revolution Know (close to) all the genes in a genome, and the sequence of the proteins they encode. BIOLOGY HAS BECOME A FINITE SCIENCE Hypotheses have to conform to what is present, not what you could imagine could happen. No longer look at just individual genes Examine whole genomes or systems of genes

Genomics, Genetics and Biochemistry Genetics: study of inherited phenotypes Genomics: study of genomes Biochemistry: study of the chemistry of living organisms and/or cells Revolution lauched by full genome sequencing Many biological problems now have finite (albeit complex) solutions. New era will see an even greater interaction among these three disciplines

Finding the function of genes

Genome Structure Distinct components of genomes Abundance and complexity of mRNA Normalized cDNA libraries and ESTs Genome sequences: gene numbers Comparative genomics

Much DNA in large genomes is non-coding Complex genomes have roughly 10x to 30x more DNA than is required to encode all the RNAs or proteins in the organism. Contributors to the non-coding DNA include: Introns in genes Regulatory elements of genes Multiple copies of genes, including pseudogenes Intergenic sequences Interspersed repeats

Distinct components in complex genomes Highly repeated DNA R (repetition frequency) >100,000 Almost no information, low complexity Moderately repeated DNA 10<R<10,000 Little information, moderate complexity “Single copy” DNA R=1 or 2 Much information, high complexity

Reassociation kinetics measure sequence complexity

Sequence complexity is not the same as length Complexity is the number of base pairs of unique, i.e. nonrepeating, DNA. E.g. consider 1000 bp DNA. 500 bp is sequence a, present in a single copy. 500 bp is sequence b (100 bp) repeated 5X a b b b b b |___________|__|__|__|__|__| L = length = 1000 bp = a + 5b N = complexity = 600 bp = a + b

Less complex DNA renatures faster Let a, b, ... z represent a string of base pairs in DNA that can hybridize. For simplicity in arithmetic, we will use 10 bp per letter. DNA 1 = ab. This is very low sequence complexity, 2 letters or 20 bp. DNA 2 = cdefghijklmnopqrstuv. This is 10 times more complex (20 letters or 200 bp). DNA 3 = izyajczkblqfreighttrainrunninsofastelizabethcottonqwftzxvbifyoudontbelieveimleavingyoujustcountthedaysimgonerxcvwpowentdowntothecrossroadstriedtocatchariderobertjohnsonpzvmwcomeonhomeintomykitchentrad. This is 100 times more complex (200 letters or 2000 bp).

Less complex DNA renatures faster, #2 For an equal mass/vol:

Kinetics of renaturation are 2nd order

Equations describing renaturation Let C = concentration of single-stranded DNA at time t (expressed as moles of nucleotides per liter). The rate of loss of single-stranded (ss) DNA during renaturation is given by the following expression for a second-order rate process: Solving the differential equation yields:

Time required for half-renaturation is inversely proportional to the rate constant At half renaturation, k in liters (mole nt)-1 sec-1

Rate constant is inversely proportional to sequence complexity L = length; N = complexity Empirically, the rate constant k has been measured as in 1.0 M Na+ at T = Tm - 25oC

Time required for half-renaturation is directly proportional to sequence complexity (4) For a renaturation measurement, one usually shears DNA to a constant fragment length L (e.g. 400 bp). Then L is no longer a variable, and (5) (6) E.g. E. coli N = 4.639 x 106 bp

Types of DNA in each kinetic component Human genomic DNA Fig. 1.7.5

Clustered repeated sequences Human chromosomes, ideograms G-bands Tandem repeats on every chromosome: Telomeres Centromeres 5 clusters of repeated rRNA genes: Short arms of chromosomes 13, 14, 15, 21, 22

Almost all transposable elements in mammals fall into one of four classes

Short interspersed repetitive elements: SINEs Example: Alu repeats Most abundant repeated DNA in primates Short, about 300 bp About 1 million copies Likely derived from the gene for 7SL RNA Cause new mutations in humans They are retrotranposons DNA segments that move via an RNA intermediate. MIRs: Mammalian interspersed repeats SINES found in all mammals Analogous short retrotransposons found in genomes of all vertebrates.

Long interspersed repetitive elements: LINEs Moderately abundant, long repeats LINE1 family: most abundant Up to 7000 bp long About 50,000 copies Retrotransposons Encode reverse transcriptase and other enzymes required for transposition No long terminal repeats (LTRs) Cause new mutations in humans Homologous repeats found in all mammals and many other animals

Other common interspersed repeated sequences in humans LTR-containing retrotransposons MaLR: mammalian, LTR retrotransposons Endogenous retroviruses MER4 (MEdium Reiterated repeat, family 4) Repeats that resemble DNA transposons MER1 and MER2 Mariner repeats Were active early in mammalian evolution but are now inactive

Finding repeats Compare a sequence to a database of known repeat sequences from the organism of interest RepeatMasker Arian Smit and P. Green, U. Wash. http://ftp.genome.washington.edu/cgi-bin/RepeatMasker Try it on INS gene sequence