BB30055: Genes and genomes Genomes - Dr. MV Hejmadi (bssmvh)

Slides:



Advertisements
Similar presentations
Molecular Biology Fifth Edition
Advertisements

The Human Genome Project Main reference: Nature (2001) 409,
Janice S. Dorman, PhD University of Pittsburgh School of Nursing
Genomics – The Language of DNA Honors Genetics 2006.
Introduction to genomes & genome browsers
MCB 317 Genetics and Genomics Topic 11, part 2 Genomics.
An Introduction to Bioinformatics Finding genes in prokaryotes.
Sequencing a genome. Definition Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism.
Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & )Gene content 2)Proteome content 3)SNP identification.
Genetics: From Genes to Genomes
A Lot More Advanced Biotechnology Tools DNA Sequencing.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
The Human Genome Race. Collins vs. Venter Collins Venter.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Bacterial Physiology (Micr430)
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
Advanced Microbial Physiology
The Human Genome Project Public: International Human Genome Sequencing Consortium (aka HUGO) Private: Celera Genomics, Inc. (aka TIGR)
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Cloning, genomes, and proteomes
Human Genome Project Seminal achievement. Scientific milestone. Scientific implications. Social implications.
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao C havan Maharashtra Open University, Nashik.
DNA and Chromosome Structure. Chromosomal Structure of the Genetic Material.
Chapter 20: Biotechnology. Essential Knowledge u 3.a.1 – DNA, and in some cases RNA, is the primary source of heritable information (20.1 & 20.2)
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Genomics Chapter 18.
Trends in Biotechnology
CO 10.
Genome Annotation BBSI July 14, 2005 Rita Shiang.
Human Genetics The Human Genome 1.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
GenomesGenomes Chapter 21 Genomes Sequencing of DNA Human Genome Project countries 20 research centers.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
Sequencing a genome. Approximate Molecular Dynamics: New Algorithms with Applications in Protein Folding Author: Qun (Marc) Ma Predicting the 3D native.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Chapter 21 Eukaryotic Genome Sequences
BB30055: Genes and genomes Genomes - Dr. MV Hejmadi Lecture 2 – Repeat elements.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Biochemistry 412 Overview of Genomics & Proteomics 18 January 2005.
Human Genome.
Lecture 2 – Repeat elements
DNA LIBRARIES Dr. E. What Are DNA Libraries? A DNA library is a collection of DNA fragments that have been cloned into a plasmid and the plasmid is transformed.
1 From Mendel to Genomics Historically –Identify or create mutations, follow inheritance –Determine linkage, create maps Now: Genomics –Not just a gene,
Genomics Chapter 18.
IB Saccharomyces cerevisiae - Jan Major model system for molecular genetics. For example, one can clone the gene encoding a protein if you.
BIOL 433 Plant Genetics Term 2, Instructors: Dr. George Haughn Dr. Ljerka Kunst BioSciences 2239BioSciences Tel
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Gene structure and function
Objective: I can explain how genes jumping between chromosomes can lead to evolution. Chapter 21; Sections ; Pgs Genomes: Connecting.
BB30055: Genes and genomes Genomes - Dr. MV Hejmadi
Javad Jamshidi Fasa University of Medical Sciences, September 2016 Session 2 Medical Genetics The Cellular and Molecular Basis of Inheritance.
The Transcriptional Landscape of the Mammalian Genome
Human Genome Project.
BIOL 433 Plant Genetics Term 2,
Genomes and Their Evolution
Genomes and their evolution
Today… Review a few items from last class
Genomes and Their Evolution
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
BB30055: Genes and genomes Genomes - Dr. MV Hejmadi
BIOL 433 Plant Genetics Term 2,
CSCI 1810 Computational Molecular Biology 2018
From Mendel to Genomics
Introduction to Sequencing
Sequence the 3 billion base pairs of human
The Content of the Genome
Human Genome Project Seminal achievement. Scientific milestone.
Presentation transcript:

BB30055: Genes and genomes Genomes - Dr. MV Hejmadi (bssmvh)

BB30055: Genomes - MVH Recommended texts: 1)Genetics from genes to genomes 2e - Hartwell et al 2)Human Molecular Genetics 3 – Strachan and Read 4) Genomes 2 - TA Brown 5) Genes VII – Benjamin Lewin Special issue Journals: Nature (2001) 15 th Feb Vol 409 Science (2001) Vol 291 No 5507 Full text of both above journals available at

BB30055: Genomes - MVH 3 broad areas (A)Genomes, transcriptomes, proteomes (B)Applications of the human genome project (C) Genome evolution

A) Genomes, transcriptomes, proteomes Genome projects - Human Genome Project (HGP): a history - Other genome projects: why do it - Genome organisation -insights from HGP -Repeat elements -Transposable elements -Mitochondrial genomes -Y chromosome Post-genomics -transcriptomes - proteomes

(A) Genomes, transcriptomes and proteomes genome transcriptome proteome Entire DNA complement of any organism which include organelle DNA All RNA transcribed from genome of a cell or tissue all proteins expressed by a genome, cell or tissue

Why study the genome? 3 main reasons description of sequence of every gene valuable. Includes regulatory regions which help in understanding not only the molecular activities of the cell but also ways in which they are controlled. identify & characterise important inheritable disease genes or bacterial genes (for industrial use) Role of intergenic sequences e.g. satellites, intronic regions etc

History of Human Genome Project (HGP) 1953 – DNA structure (Watson & Crick) 1972 – Recombinant DNA (Paul Berg) 1977 – DNA sequencing (Maxam, Gilbert and Sanger) 1985 – PCR technology (Kary Mullis) 1986 – automated sequencing (Leroy Hood & Lloyd Smith 1988 – IHGSC established (NIH, DOE) Watson leads 1990 – IHGSC scaled up, BLAST published (Lipman+Myers) 1992 – Watson quits, Venter sets up TIGR 1993 – F Collins heads IHGSC, Sanger Centre (Sulston) 1995 – cDNA microarray 1998 – Celera genomics (J Craig Venter) 2001 – Working draft of human genome sequence published 2003 – Finished sequence announced

HGP Goal: Obtain the entire DNA sequence of human genome Players: (A)International Human Genome Sequence Consortium (IHGSC) - public funding, free access to all, started earlier - used mapping overlapping clones method (B) Celera Genomics – private funding, pay to view - started in used whole genome shotgun strategy

Whose genome is it anyway? (A)International Human Genome Sequence Consortium (IHGSC) - composite from several different people generated from primary samples taken from numerous anonymous donors across racial and ethnic groups (B) Celera Genomics – 5 different donors (one of whom was J Craig Venter himself !!!)

Genomicists looked at two basic features of genomes: sequence and polymorphism –How does one sequence a 500 Mb chromosome 600 bp at a time? –How accurate should a genome sequence be? DNA sequencing error rate is about 1% per 600 bp –How does one distinguish sequence errors from polymorphisms? Rate of polymorphism in diploid human genome is about 1 in 500 bp –Repeat sequences may be hard to place –Unclonable DNA cannot be sequenced (30%) Major challenge - to determine sequence of each chromosome in genome and identify polymorphisms

Divide and conquer strategy meets most challenges Chromosomes are broken into small overlapping pieces and cloned Ends of clones sequenced and reassembled into original chromosome strings Each piece is sequenced multiple times to reduce error rate –10-fold sequence coverage achieves a rate of error less than 1/10,000

Strategies for sequencing the human genome

Whole-genome shotgun sequencing Whole genome randomly sheared three times –Plasmid library constructed with ~ 2kb inserts –Plasmid library with ~10 kb inserts –BAC library with ~ 200 kb inserts Computer program assembles sequences into chromosomes No physical map construction Only one BAC library Overcomes problems of repeat sequences Fig Whole genome randomly sheared three times –Plasmid library constructed with ~ 2kb inserts –Plasmid library with ~10 kb inserts –BAC library with ~ 200 kb inserts Computer program assembles sequences into chromosomes No physical map construction Only one BAC library Overcomes problems of repeat sequences Private company Celera used to sequence whole human genome Fig Genetics by Hartwell

sequencing larger genomes Mapping phase Sequencing phase

Other genomes sequenced ,000 genes Sept ,473 human orthologs ,200 genes ,099 genes ,000 genes Science (26 Sep 2003)Vol301(5641)pp

Human genome – size and structure Nuclear genome (3.2 Gbp) 24 types of chromosomes Y- 51Mb and chr1 -279Mbp Base composition – 41% GC Mitochondrial genome

Nuclear genome organisation (human) Genomes 2 by TA Brown pg 23

Nuclear genome organisation (human) 1) Gene and gene related sequences Coding regions – Exons (5%) Non-coding regions RNA genes Introns Pseudogenes Gene fragments

Nuclear genome organisation (human) 16S, 23S, 28S, 18S etc 22 types of mitochondrial & 49 cytoplasmic U1,U2.U4,U5,U6 etc > 100 types RNA genes - rRNA tRNA snRNA snoRNA Major classes of RNA involved in gene expression Other RNA classes microRNA XIST RNA Imprinting associated RNA Nervous system specific Antisense RNA Others

introns Non-coding regions…..

Pseudogenes ( ) A non functional copy of most or all of a gene Inactivated by mutations that may cause either inhibition of signal for initiation or transcription prevent splicing at exon-intron boundary premature termination of translation Human Mol Gen 3 by Strachan & Read pgs Non-coding regions…..

Pseudogenes ( ) Different classes include Non-processed: contain non functional copies of genomic DNA sequence incl exons and introns arise from gene duplication events E.g. rabbit pseudogene 2 Non-coding regions…..

rabbit pseudogene 2 Related to 1 Usual exon and intron organisation Non-coding regions….. 1 2

Pseudogenes - processed Non-coding regions…

Pseudogenes - processed Non-coding regions… non functional copies of exonic sequences of an active gene. Thought to arise by genomic insertion of a cDNA as a result of retroposition Expressed processed: processed pseudogene integrated adjacent to a promoter site Contribute to overall repetitive elements

Gene fragments or truncated genes Gene fragments: small segments of a gene (e.g. single exon from a multiexon gene) Truncated genes: Short components of functional genes (e.g. 5 or 3 end) Thought to arise due to unequal crossover or exchange Non-coding regions…..

Nuclear genome organisation (human)

2) Extragenic (intergenic) DNA (~62% of genome) A) Unique or low copy number sequences B) Repetitive sequences (~ 53%)

A) Unique or low copy number sequences Non –coding, non repetitive and single copy sequences of no known function or significance

B) Repetitive sequences Significance Evolutionary signposts Passive markers for mutation assays Actively reorganise gene organisation by creating, shuffling or modifying existing genes Chromosome structure and dynamics Provide tools for medical, forensic, genetic analysis