Introduction to genomes Content  the human genome CNVs SNPs Alternative splicing  genome projects Celia van Gelder CMBI UMC Radboud June 2009

Slides:



Advertisements
Similar presentations
Introduction to genomes & genome browsers
Advertisements

The Organization of Cellular Genomes Complexity of Genomes Chromosomes and Chromatin Sequences of Genomes Bioinformatics As we have discussed for the last.
Genomics, Genetics and Biochemistry
Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & )Gene content 2)Proteome content 3)SNP identification.
Chap. 6 Problem 2 Protein coding genes are grouped into the classes known as solitary (single) genes, and duplicated or diverged genes in gene families.
Human Genome Project What did they do? Why did they do it? What will it mean for humankind? Animation OverviewAnimation Overview - Click.
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
Lecture 2 Molecular Biology Primer Saurabh Sinha.
living organisms According to Presence of cell The non- cellular organism The cellular organisms According to Type the Eukaryotes the prokaryotes human.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
Mutations Georgia Standard:
ECE 501 Introduction to BME
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Genomes and Genetic Architecture. Life on Earth.
Prepared with lots of help from friends... Metsada Pasmanik-Chor, Zohar Yakhini and NUMEROUS WEB RESOURCES. BioInformatics / Computational Biology Introduction.
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
Goals of the Human Genome Project determine the entire sequence of human DNA identify all the genes in human DNA store this information in databases improve.
Polymorphisms – SNP, InDel, Transposon BMI/IBGP 730 Victor Jin, Ph.D. (Slides from Dr. Kun Huang) Department of Biomedical Informatics Ohio State University.
What is genomics? Study of genomes. What is the genome? Entire genetic compliment of an organism.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Elements of Molecular Biology All living things are made of cells All living things are made of cells Prokaryote, Eukaryote Prokaryote, Eukaryote.
DNA Organization.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
1 Genetic Variability. 2 A population is monomorphic at a locus if there exists only one allele at the locus. A population is polymorphic at a locus if.
Eukaryotic Gene Expression The “More Complex” Genome.
Chapter 5 Genome Sequences and Gene Numbers. 5.1Introduction  Genome size vary from approximately 470 genes for Mycoplasma genitalium to 25,000 for human.
Introduction to genomes & genome browsers Content  Introduction  The human genome  Human genetic variation SNPs CNVs Alternative splicing  Browsing.
Introduction to genomes & genome browsers Content  Introduction  The human genome  Human genetic variation SNPs CNVs Alternative splicing  Browsing.
RNA and Protein Synthesis
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
GenomesGenomes Chapter 21 Genomes Sequencing of DNA Human Genome Project countries 20 research centers.
CS177 Lecture 10 SNPs and Human Genetic Variation
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Chapter 21 Eukaryotic Genome Sequences
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Introduction to genomes Content  the human genome CNVs SNPs Alternative splicing  genome projects Celia van Gelder CMBI UMC Radboud June 2009
Lecture 10 Genes, genomes and chromosomes
Eukaryotic Genomes: The Organization and Control.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Bailee Ludwig Quality Management. Before we get started…. ….Let’s see what you know about Genomics.
Changes in the Eukaryotic Genome By: Sergio Aguilar.
Johnson - The Living World: 3rd Ed. - All Rights Reserved - McGraw Hill Companies Genomics Chapter 10 Copyright © McGraw-Hill Companies Permission required.
11 Gene function: genes in action. Sea in the blood Various kinds of haemoglobin are found in red blood cells. Each kind of haemoglobin consists of four.
Genomics Chapter 18.
How many genes are there?
Chapter 19 The Organization & Control of Eukaryotic Genomes.
DNA Organization.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
Who is smarter and does more tricks you or a bacteria? YouBacteria How does my DNA compare to a prokaryote? Show-off.
Aim: How is DNA organized in a eukaryotic cell?. Why is the control of gene expression more complex in eukaryotes than prokaryotes ? Eukaryotes have:
The Organization and Control of Eukaryotic Genomes Ch. 19 AP Biology Ms. Haut.
Chapter 8: Genetic Epidemiology. Introduction Scientific advancement in genetics – Mendel’s laws of inheritance – Charles Darwin’s observations – Discovery.
The genome of prokaryotes and eukaryotes- nuclear and extranuclear genetic organization.
Genomes and Their Evolution
SGN23 The Organization of the Human Genome
Genomes and Their Evolution
Genomes and Their Evolution
Fig Figure 21.1 What genomic information makes a human or chimpanzee?
Evolution of eukaryote genomes
Today: Regulating Gene Expression.
Gene Density and Noncoding DNA
School of Pharmacy, University of Nizwa
Unit Genomic sequencing
Genome Annotation and the Human Genome
SNPs and CNPs By: David Wendel.
Presentation transcript:

Introduction to genomes Content  the human genome CNVs SNPs Alternative splicing  genome projects Celia van Gelder CMBI UMC Radboud June 2009

The human genome Genome: the entire sequence of DNA in a cell 3 billion basepairs (3Gb) 22 chromosome pairs + X en Y chromosomes Chromosome length varies from ~50Mb to ~250Mb About protein-coding genes Human genome is 99.9% identical among individuals

Eukaryotic Genomes: more than collections of genes Protein coding genes RNA genes (rRNA, snRNA, snoRNA, miRNA, tRNA) Structural DNA (centromeres, telomeres) Regulation-related sequences (promoters, enhancers, silencers, insulators) Parasite sequences (transposons) Pseudogenes (non-functional gene-like sequences) Simple sequence repeats

Annotating the genome Genome annotation is the process of attaching biological information to sequences. It consists of two main steps: 1.identifying elements on the genome, a process called Gene Finding, and 2.attaching biological information to these elements. Automatic annotation tools try to perform all this by computer analysis, as opposed to manual annotation which involves human expertise. Ideally, these approaches co-exist and complement each other in the same annotation pipeline.

The human genome cntnd From: Molecular Biology of the Cell (4 th edition) (Alberts et al., 2002) Only 1.2% codes for proteins, 3.5-5% is under selection Long introns, short exons Large spaces between genes More than half consists of repetitive DNA

Eukaryotic Genomes: High fraction non-coding DNA Blue: Prokaryotes Black: Unicellular eukaryotes Other colors: Multicellular eukaryotes (red = vertebrates) From: Mattick, NRG, 2004

Variation along genome sequence Nucleotide usage varies along chromosomes – Protein coding regions tend to have high GC levels Genes are not equally distributed across the chromosomes – Housekeeping generally in gene- dense areas – Gene-poor areas tend to have many tissue specific genes From: Ensembl

Chromosome organisation (1) From: Lodish (4 th edition)

Chromosome organisation (2) From: Lodish (4 th edition) DNA packed in chromatin Non-active genes often in densely packed chromatin (30-nm fiber) Active genes in less dense chromatin (beads-on-a-string) Gene regulation by changing chromatin density, methylation/acetylation of the histones Genes that are OFF Genes that are ON

Today’s focus 1.Copy number variations (CNV) 2.Single Nucleotide Polymorphisms (SNPs) 3.Alternative transcripts

Copy Number Variation People do not only vary at the nucleotide level (SNPs) Copy Number Variations (CNVs): duplications and deletions of pieces of chromosome When there are genes in the CNV areas, this can lead to variations in the number of gene copies between individuals CNVs may either be inherited or caused by de novo mutation

Why study CNVs? CNVs are common in cancer and other diseases. CNVs are also common in normal individuals and contribute to our uniqueness. These changes can also influence the susceptibility to disease. Since CNVs often encompass genes, they can have important roles both in characterizing human disease and discovering drug response targets. Understanding the mechanisms of CNV formation may also help us better understand human genome evolution.

CNV & disease, examples CNVs have been implicated in Cancer EGFR higher copy number in non-small cell lung cancer Low copy number of FCGR3B can increase susceptibility to SLE & other autoimmune disorders Autism Schizophrenia (dept. human genetics) Mental retardation (dept. human genetics)

Single Nucleotide Polymorphisms (SNPs) SNPs are DNA sequence variations that occur when a single nucleotide (A,T,C,or G) in the genome sequence is altered. Similar to mutations, but are simultaneously present in the population, and generally have little effect Are being used as genetic markers (a genetic disease is e.g. associated with a SNP) T T T T T T A A A A A C C CG G G G A T T T T T T A A A A A C C CG G G G A CG GCTA Single Nucleotide- Polymorphism (SNP)

SNP fact sheet For a variation to be considered a SNP, it must occur in at least 1% of the population. SNPs, which make up about 90% of all human genetic variation, occur every 100 to 300 bases along the 3-billion-base human genome. Two of every three SNPs involve the replacement of cytosine (C) with thymine (T). SNPs can occur in coding (gene) and non coding regions of the genome.

SNPs & medicine Although more than 99% of human DNA sequences are the same, variations in DNA sequence can have a major impact on how humans respond to: – disease; – environmental factors such as bacteria, viruses, toxins, and chemicals; – and drugs and other therapies. This makes SNPs valuable for biomedical research and for developing pharmaceutical products or medical diagnostics. SNPs are also evolutionarily stable—not changing much from generation to generation—making them easier to follow in population studies.

SNP & disease, example Alzheimer's disease & apolipoprotein E ApoE contains two SNPs that result in three possible alleles for this gene: E2, E3, and E4. Each allele differs by one DNA base, and the protein product of each gene differs by one amino acid. Each individual inherits one maternal copy of ApoE and one paternal copy of ApoE. Research has shown that a person who inherits at least one E4 allele will have a greater chance of developing Alzheimer's disease.

The HapMap Project is a multi-country effort to identify and catalog genetic similarities and differences in human beings. Using HapMap, researchers will be able to find genes that affect health, disease, and individual responses to medications and environmental factors. HapMap is a collaboration among scientists and funding agencies from Japan, the United Kingdom, Canada, China, Nigeria, and the United States All of the information generated will be released into the public domain. HapMap

Alternative splicing

Alternative splicing (2) ~ 15 % of the mutations that cause genetic diseases affect pre-mRNA splicing

Genome projects, a bit of history

Sequenced genomes 1995Haemophilus influenzae 1.8 Mb 1996Yeast 12 Mb 1998C. elegans100 Mb 1999Fruit fly125 Mb 2000Arabidopsis115 Mb 2001Human (draft) 2002Mouse 2.6 Gb 2002 Rice 2004Human (“finished”) 3 Gb 2006 Sea urchin 2007 Grapevine 2008 Platypus (draft) 2009 Cow

Some genome sizes OrganismGenome size (base pairs) Virus, Phage Φ-X174;5387 First sequenced genome Virus, Phage λ5×10 4 Bacterium, Escherichia coli4×10 6 Plant, Fritillary assyrica13×10 10 Largest known genome Fungus,Saccharomyces cerevisiae2×10 7 Nematode, Caenorhabditis elegans8×10 7 Insect, Drosophila melanogaster2×10 8 Mammal, Homo sapiens3×10 9

Genome browsers can be used to examine …. –Genomic sequence conservation –Duplications en deletions of pieces chromosome (Copy Number Variations, CNVs) –Single Nucleotide Polymorphisms (SNPs) –Alternative splicing –And much more…. LET’S GO BROWSE GENOMES!

Alternative Transcripts Source: Wikipedia (