Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & 875-914. 1)Gene content 2)Proteome content 3)SNP identification.

Slides:



Advertisements
Similar presentations
The Human Genome Project Main reference: Nature (2001) 409,
Advertisements

Genomics – The Language of DNA Honors Genetics 2006.
SNP Applications statwww.epfl.ch/davison/teaching/Microarrays/snp.ppt.
Introduction to genomes & genome browsers
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
chromosome organization, what about genome organization?
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
Chris Chander, Luke Adea BioSci D145 Feb. 12, 2015
The Human Genome The International Human Genome Consortium Initial sequencing and analysis of the human genome Nature, 409, February 15, (2001)
The Human Genome Project Public: International Human Genome Sequencing Consortium (aka HUGO) Private: Celera Genomics, Inc. (aka TIGR)
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.
Anum kamal(BB ) Umm-e-Habiba(BB ). Gene splicing “Gene splicing is the removal of introns from the primary trascript of a discontinuous gene.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Gene Structure and Identification
NcRNAs What Genomes are Telling Us ncrna.ppt. ncRNA genes are difficult to discover! small an annotational and statistical concern no ORFs and no polyadenylation.
1 Genetic Variability. 2 A population is monomorphic at a locus if there exists only one allele at the locus. A population is polymorphic at a locus if.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher.
CO 10.
Human Molecular Genetics
Cryptic Variation in the Human mutation rate Alan Hodgkinson Adam Eyre-Walker, Manolis Ladoukakis.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
Gene & Genome Evolution1 Chapter 9 You will not be responsible for: Read the How We Know section on Counting Genes, and be able to discuss methodologies.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
SNPs and the Human Genome Prof. Sorin Istrail. A SNP is a position in a genome at which two or more different bases occur in the population, each with.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
BB30055: Genes and genomes Genomes - Dr. MV Hejmadi (bssmvh)
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
Facts about the Human Genome.
Introduction to Bioinformatics II Lecture 5 By Ms. Shumaila Azam.
Diving into the gene pool: Chromosomes, genes and DNA
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
In The Name of GOD Genetic Polymorphism M.Dianatpour MLD,PHD.
Genomics Chapter 18.
The Secret of Life! DNA. 2/4/20162 SOMETHING HAPPENS GENE PROTEIN.
How many genes are there?
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
Single Nucleotide Polymorphisms (SNPs) By Amira Jhelum Rahul Shweta.
Notes: Human Genome (Right side page)
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Human Molecular Genetics Institute of Medical Genetics.
Chapter 13 Section 13.3 The Human Genome. Genomes contain all the information needed for an organism to grow and survive The Human Genome Project (HGP)
BB30055: Genes and genomes Major insights from the HGP.
Integrative Genomics. Double-helix DNA strands are separated in the gene coding region Which enzyme detects the beginning of a gene ? RNA Polymerase (multi-subunit.
Gene Regulation, Part 2 Lecture 15 (cont.) Fall 2008.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
BB30055: Genes and genomes Major insights from the HGP.
SNP Detection Congtam Pham 2/24/04 Dr. Marth’s Class.
Nucleotide variation in the human genome
Timing, rates and spectra of human germline mutation
Chapter 5 The Content of the Genome
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Recitation 7 2/4/09 PSSMs+Gene finding
By Michael Fraczek and Caden Boyer
Introduction to Bioinformatics II
Chapter 9 Organization of the Human Genome
Organisms are made up of cells, cells are largely protein and DNA carries the instructions for the synthesis of those proteins.
BB30055: Genes and genomes Major insights from the HGP.
The gene: structure, function and location
The Content of the Genome
SNPs and CNPs By: David Wendel.
Presentation transcript:

Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & )Gene content 2)Proteome content 3)SNP identification 4)Distribution of GC content 5)CpG islands 6)Recombination rates 7)Repeat content

1) Gene content ,000 protein-coding genes estimated based on known genes and predictions IHGSCCelera definite genes 24,500 26,383 possible genes ,000 Genes encode either protein or noncoding RNAs rRNA, tRNA, snRNA, snoRNA Nature (2001) 15 th Feb Vol 409 special issue; pg and

More genes: Twice as many as drosophila / C.elegans Uneven gene distribution: Gene-rich and gene- poor regions More paralogs: some gene families have extended the number of paralogs e.g. olfactory gene family has 1000 genes More alternative transcripts: Increased RNA splice variants produced thereby expanding the primary proteins by 5 fold (e.g. neurexin genes) Nature (2001) 409: pp 892 Gene content….

Gene-rich E.g. MHC on chromosome 6 has 60 genes with a GC content of 54% Gene-poor regions 82 gene deserts identified ? Large or unidentified genes What is the functional significance of these variations? Uneven gene distribution Genetics by Hartwell: pp Gene content

2) Proteome content proteome more complex than invertebrates Nature (2001) 15 th Feb Vol 409 special issue; pg 847 Protein Domains (sections with identifiable shape/function) Domain arrangements in humans largest total number of domains is 130 largest number of domain types per protein is 9 Mostly identical arrangement of domains AABBCBCCCC Protein X

proteome more complex than invertebrates…… Nature (2001) 15 th Feb Vol 409 special issue; pg 847  no huge difference in domain number in humans  BUT, frequency of domain sharing very high in human proteins (structural proteins and proteins involved in signal transduction and immune function) However, only 3 cases where a combination of 3 domain types shared by human & yeast proteins. e.g carbomyl-phosphate synthase (involved in the first 3 steps of de novo pyrimidine biosynthesis) has 7 domain types, which occurs once in human and yeast but twice in drosophila 2) Proteome content….

3) SNPs (single nucleotide polymorphisms) More than 1.4million SNPs identified One every 1.9kb length on average Densities vary over regions and chromosomes e.g. HLA region has a high SNP density, reflecting maintenance of diverse haplotypes over many millions of years Nature (2001) 15 th Feb Vol 409 special issue; pgs & 928

How does one distinguish sequence errors from polymorphisms? sequence errors Each piece of genome sequenced at least 10 times to reduce error rate (0.01%) Polymorphisms Sequence variation between individuals is 0.1% To be defined as a polymorphism, the altered sequence must be present in a significant population Rate of polymorphism in diploid human genome is about 1 in 500 bp Nature (2001) 15 th Feb Vol 409 special issue; pgs & 928

3) SNPs……  Sites that result from point mutations in individual base pairs  biallelic  ~60,000 SNPs lie within exons and untranslated regions (85% of exons lie within 5kb of a SNP)  May or may not affect the ORF  Most SNPs may be regulatory Nature (2001) 15 th Feb Vol 409 special issue; pg 821 & 928

3) SNPs……and disease

3) SNPs……and risk of disease

3) SNPs……and drug prescription

4) Distribution of GC content Genome wide average of 41% Huge regional variations exist E.g.distal 48Mb of chromosome 1p-47% but chromosome 13 has only 36% Confirms cytogenetic staining with G-bands (Giemsa) dark G-bands – low GC content (37%) light G-bands – high GC content (45%) Nature (2001) 15 th Feb Vol 409 special issue; pg

5) CpG islands Significance of CpG islands 1)Non-methylated CpG islands associated with the 5’ ends of genes 2)Aberrant methylation of CpG islands is one mechanism of inactivating tumor suppressor genes (TSGs) in neoplasia CpG Methyl CpG TpG methylated at C Deamination CpG islands show no methylation

CpG islands Greatly under-represented in human genome ~28,890 in number Variable density e.g. Y – 2.9/Mb but 16,17 & 22 have 19-22/Mb Average is 10.5/Mb Nature (2001) 15 th Feb Vol 409 special issue; pg

6) Recombination rates 2 main observations Recombination rate increases with decreasing arm length Recombination rate suppressed near the centromeres and increases towards the distal 20-35Mb

7) Repeat content a)Age distribution b)Comparison with other genomes c)Variation in distribution of repeats d)Distribution by GC content e)Y chromosome Nature (2001) 409: pp