Introduction to genomes & genome browsers

Slides:



Advertisements
Similar presentations
CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Advertisements

Genomics – The Language of DNA Honors Genetics 2006.
Major insights from the HGP on Nature (2001) 15 th Feb Vol 409 special issue; pgs 814 & )Gene content 2)Proteome content 3)SNP identification.
Lecture #8Date _________ n Chapter 19~ The Organization and Control of Eukaryotic Genomes.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. CHAPTER 18 LECTURE SLIDES.
ECE 501 Introduction to BME
Genes. Outline  Genes: definitions  Molecular genetics - methodology  Genome Content  Molecular structure of mRNA-coding genes  Genetics  Gene regulation.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
Genomes summary 1.>930 bacterial genomes sequenced. 2.Circular. Genes densely packed Mbases, ,000 genes 4.Genomes of >200 eukaryotes (45.
RNA Ribonucleic Acid.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
Introduction Basic Genetic Mechanisms Eukaryotic Gene Regulation The Human Genome Project Test 1 Genome I - Genes Genome II – Repetitive DNA Genome III.
1 Genetic Variability. 2 A population is monomorphic at a locus if there exists only one allele at the locus. A population is polymorphic at a locus if.
Introduction to genomes Content  the human genome CNVs SNPs Alternative splicing  genome projects Celia van Gelder CMBI UMC Radboud June 2009
Eukaryotic Gene Expression The “More Complex” Genome.
Chapter 5 Genome Sequences and Gene Numbers. 5.1Introduction  Genome size vary from approximately 470 genes for Mycoplasma genitalium to 25,000 for human.
Introduction to genomes & genome browsers Content  Introduction  The human genome  Human genetic variation SNPs CNVs Alternative splicing  Browsing.
Introduction to genomes & genome browsers Content  Introduction  The human genome  Human genetic variation SNPs CNVs Alternative splicing  Browsing.
Selfish DNA Honors Genetics.
GenomesGenomes Chapter 21 Genomes Sequencing of DNA Human Genome Project countries 20 research centers.
Genetics: Chapter 7. What is genetics? The science of heredity; includes the study of genes, how they carry information, how they are replicated, how.
Genome Organization & Evolution. Chromosomes Genes are always in genomic structures (chromosomes) – never ‘free floating’ Bacterial genomes are circular.
DNA PACKAGING. 8 histones make up the nucleosome core DNA wraps twice around the 8 histones Histone 1 helps maintain the nucleosome DNA is negatively.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Genomes & their evolution Ch 21.4,5. About 1.2% of the human genome is protein coding exons. In 9/2012, in papers in Nature, the ENCODE group has produced.
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Chapter 21 Eukaryotic Genome Sequences
Non-Coding Areas & Mutations Within the human genome the majority of the DNA (~75%) is made up of sequences not involved in coding for proteins, RNA, or.
Chapter 5 The Content of the Genome 5.1 Introduction genome – The complete set of sequences in the genetic material of an organism. –It includes the.
Introduction to genomes Content  the human genome CNVs SNPs Alternative splicing  genome projects Celia van Gelder CMBI UMC Radboud June 2009
ABC for the AEA Basic biological concepts for genetic epidemiology Martin Kennedy Department of Pathology Christchurch School of Medicine.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Facts about the Human Genome.
Eukaryotic Genomes: The Organization and Control.
Class 22 DNA Polymorphisms Based on Chapter 10 Recombinant DNA Technology Copyright © 2010 Pearson Education Inc.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Diving into the gene pool: Chromosomes, genes and DNA
Genomics Chapter 18.
Crash Course!  Introduction to Molecular Biology.
The Secret of Life! DNA. 2/4/20162 SOMETHING HAPPENS GENE PROTEIN.
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
Chapter 19 The Organization & Control of Eukaryotic Genomes.
Chapter 2 Genetic Variations. Introduction The human genome contains variations in base sequence from one individual to another. Some sequence variants.
1 From Bi 150 Lecture 0 October 4, 2012 An introduction to molecular biology... but you will learn the cell biology in this course.
KEY CONCEPT 8.5 Translation converts an mRNA message into a polypeptide, or protein.
Who is smarter and does more tricks you or a bacteria? YouBacteria How does my DNA compare to a prokaryote? Show-off.
RNA & Protein Synthesis
Objective: I can explain how genes jumping between chromosomes can lead to evolution. Chapter 21; Sections ; Pgs Genomes: Connecting.
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
Chromosome Organization & Molecular Structure. Chromosomes & Genomes Chromosomes complexes of DNA & proteins – chromatin Viral – linear, circular; DNA.
Ch 12: Genomes.
Fig Prokaryotes and Eukaryotes
Thursday, March 2, 2017 GOALS: Finish Ghost in your Genes
Lesson Four Structure of a Gene.
Lesson Four Structure of a Gene.
Organization of the human genome
2/23/15 Learning Objectives
School of Pharmacy, University of Nizwa
SGN23 The Organization of the Human Genome
Mutations changes in the DNA sequence that can be inherited
Gene Density and Noncoding DNA
Chromosome structures
School of Pharmacy, University of Nizwa
Chapter 6: Transcription and RNA Processing in Eukaryotes
Chapter 6 Clusters and Repeats.
The Structure of the Genome
Unit 1: 1.5 Structure of the Genome
The gene: structure, function and location
SNPs and CNPs By: David Wendel.
Presentation transcript:

Introduction to genomes & genome browsers Content Introduction to genomes The human genome Human genetic variation SNPs CNVs Alternative splicing Browsing the human genome Celia van Gelder CMBI UMC Radboud December 2014 Celia.vanGelder@radboudumc.nl

Exponential Growth in Genomic Sequence Data # of genomes First eukaryote complete (yeast) First metazoan complete (flatworm) First 2 bacterial genomes complete

http://www.genomesonline.org/ 4

Ebola

The human genome Genome: the entire sequence of DNA in a cell 3 billion basepairs (3Gb) 22 chromosome pairs + X en Y chromosomes Chromosome length varies from ~50Mb to ~250Mb About 20000 protein-coding genes (average gene length 3000 bases, but largest known gene is 2.4 Mb (dystrophin)) Human genome is 99.9% identical among individuals This means that every 2 persons differ in 3 million nts!!

Eukaryotic Genomes: more than collections of genes Genes & regulatory sequences make up 5% of the genome Protein coding genes RNA genes (rRNA, snRNA, snoRNA, miRNA, tRNA) Structural DNA (centromeres, telomeres) Regulation-related sequences (promoters, enhancers, silencers, insulators) Parasite sequences (transposons) Pseudogenes (non-functional gene-like sequences) Simple sequence repeats

The human genome cntnd Only 1.2% codes for proteins Long introns, short exons Large spaces between genes More than half consists of repetitive DNA Alu repeat ~300 bp > million copies From: Molecular Biology of the Cell (4th edition) (Alberts et al., 2002)

Non coding DNA

Human Genetic Variation Genetic variation explains some of the differences among people, such as: Blood group Eye color, Skin color, Hair color Length Higher or lower risk for getting particular diseases Cystic fibrosis, Sickle cell disease, Diabetes, Cancer, Arthritis, Asthma etc

Variations in the Genome Common Sequence Variations Polymorphism Deletions Insertions Chromosome Translocations

Today’s focus Single Nucleotide Polymorphisms (SNPs) Copy number variations (CNV) Alternative transcripts

Single Nucleotide Polymorphisms (SNPs) SNPs are DNA sequence variations that occur when a single nucleotide (A,T,C,or G) in the genome sequence is altered. For a variation to be considered a SNP, it must occur in at least 1% of the population. SNPs make up about 90% of all human genetic variation and occur every 100 to 300 bases. SNPs can occur in coding (gene) and non coding regions of the genome; <1% alter the protein sequence

SNPs determine properties like eye color, hair (curly or straight), or if you can taste bitter or not. are used for identification and forensics are used for estimating predisposition to disease can cause drug side–effects and/or non responsiveness for the drug have impact on how humans respond to environmental factors like bacteria, viruses, toxins and chemicals are used to predict specific genetic traits are used for classifying patients in clinical trials are used for mapping and genome-wide association studies of complex diseases

SNP - Bitter tasting, TAS2R38

SNP & disease, Alzheimer Alzheimer's disease (AD) & apolipoprotein E (APOE) Apolipoprotein E is a cholesterol carrier that is found in the brain and other organs. APOE is suspected to be involved in amyloid beta aggregation and clearance, influencing the onset of amyloid beta deposition. APOE contains 2 SNPs that result in 3 possible alleles: E2, E3, E4. Variant rs429358 rs7412 E2 T + T E3 T + C E4 C + C A person who inherits at least one E4 allele will have a greater chance of developing AD.

Today’s focus Single Nucleotide Polymorphisms (SNPs) Copy number variations (CNV) Alternative transcripts

Copy Number Variation Copy Number Variations (CNVs): segment of DNA (> 1 kB) which is present at variable copy number in two or more genomes When there are genes in the CNV areas, this can lead to variations in the number of gene copies between individuals CNVs contribute to our uniqueness. CNVs can also influence the susceptibility to disease. CNVs may either be inherited or caused by de novo mutation

Copy Number Variation Normal cell CN=2 deletion amplification CN=0 CN=1 CN=3 CN=4

CNVs and their possible effects on gene expression. Cabianca D S , Gabellini D J Cell Biol 2010;191:1049-1060 © 2010 Cabianca and Gabellini

CNVs & disease Many inherited genetic diseases result from CNVs; Gene copy number can be elevated in cancer cells Autism Schizophrenia (dept. human genetics) Mental retardation (dept. human genetics) Parkinsons disease There are CNVs that protect against HIV infection and malaria. The contribution of CNV to the common, complex diseases, such as diabetes and heart disease, is currently less well understood

Today’s focus Copy number variations (CNV) Single Nucleotide Polymorphisms (SNPs) Alternative transcripts

Alternative splicing

Alternative splicing Defects in alternative splicing have been implicated in many diseases, including: neuropathological conditions such as Alzheimer disease cystic fibrosis, those involving growth and developmental defects many human cancers, e.g. BRCA1 in breast cancer Beta-globin in Beta-thalassemia Parkinsons Disease

Annotating & Browsing the Human Genome

Annotating the genome Annotation: attaching biological information to sequences. Two main steps: identifying elements on the genome attaching biological information to these elements.

Basic & Advanced Genome Annotation Genomic location Gene features: Exons, Introns, UTRs Transcript(s) Pseudogenes, Non-coding RNA Protein(s) Links to other sources of information Advanced Cytogenetic bands Polymorphic markers Genetic variation, including SNPs & CNVs Repetitive sequences cDNAs or mRNAs from related species Genomic sequence variation Regulation sequences (enhancers, silencers, insulators)

[Human] Genome Browsers Not limited to only human data EBI Ensembl NCBI Map Viewer UCSC Genome Browser

Ensembl ©EMBL-EBI

Other Ensembl Installations ©EMBL-EBI (2013)

Organized Data Based on Chromosome Location Gene X Description Transcript data Structure Gene Ontology Pathway Data Homologous Genes Expression Data Etc…. genes & predictions tracks variations & repeats cross-species comparative data & many more types of data from expression & regulation to mRNA and ESTs…

ENSG### Ensembl Gene ID ENST### Ensembl Transcript ID ENSP### Ensembl Peptide ID ENSE### Ensembl Exon ID HGNC – a unique name and symbol for every gene in human http://www.genenames.org/

Ensembl: An Example Click for more details tracks tracks

Direction of transcription Above blue line: forward strand Below blue line: reverse strand

Ensembl Transcripts A red transcript comes from Ensembl or VEGA/Havana. A transcript from the Ensembl annotation pipeline starts with 2 (MYO6-201) A transcript with Vega/Havana manual curation starts with 0 (MYO6-001) A gold, or merged, transcript is identical between Ensembl automated annotation and VEGA/Havana manual curation. Only human, mouse, and zebrafish will have gold transcripts. This transcript can be thought of as stable (unlikely to change), and is coloured gold. It is assigned a number beginning with 0. A blue, pink or grey transcript is non-coding. See the 'NON-CODING TRANSCRIPTS' section below for more. ©EMBL-EBI

Synopsis- What can I do with Ensembl? View, examine & explore annotated information for any chromosomal region: Genes, ESTs, mRNAs, alternative transcripts Proteins SNPs, and SNPs across strains (rat, mouse), populations (human), or even breeds (dog) homologues and phylogenetic trees across more than 40 species whole genome alignments conserved regions across species gene expression profiles Upload your own data and use BLAST/BLATagainst any Ensembl genome Export sequence, or create a table of gene information

Help & Documentation -> Tutorials Save configuration Glossary FAQ Help & Documentation -> Tutorials Save configuration Share this link functionality Share this image functionality