Who's in charge here? Jim Kent ENCODE Data Coordinating Center (DCC) University of California Santa Cruz Finding and characterizing regulatory regions.

Slides:



Advertisements
Similar presentations
Methods to read out regulatory functions
Advertisements

Regulomics II: Epigenetics and the histone code Jim Noonan GENE760.
Gene Expression Chapter Eleven. What is Gene Expression? When a gene is expressed – that gene’s protein product is made: 1.DNA is transcribed into RNA.
20,000 GENES IN HUMAN GENOME; WHAT WOULD HAPPEN IF ALL THESE GENES WERE EXPRESSED IN EVERY CELL IN YOUR BODY? WHAT WOULD HAPPEN IF THEY WERE EXPRESSED.
Prof. Drs. Sutarno, MSc., PhD.. Biology is Study of Life Molecular Biology  Studying life at a molecular level Molecular Biology  modern Biology The.
Detecting DNA-protein Interactions Xinghua Lu Dept Biomedical Informatics BIOST 2055.
Understanding the Human Genome: Lessons from the ENCODE project
Copyright © 2005 Brooks/Cole — Thomson Learning Biology, Seventh Edition Solomon Berg Martin Chapter 13 Gene Regulation.
Regulation of Gene Expression
Everything you wanted to know about ENCODE But were afraid to ask.
Lecture #8Date _________ n Chapter 19~ The Organization and Control of Eukaryotic Genomes.
ENCODE Data Coordination at UCSC Kate Rosenbloom ENCODE DCC Technical Project Manager UCSC Genome Bioinformatics Group September 2010 Genome Browser SAB.
Jim Kent, UC Santa Cruz. A little ENCODE There is a need to do integrated tracks! Some work going on at UCSC Hope to bring in integrated tracks from.
Genome Browsers Ensembl (EBI, UK) and UCSC (Santa Cruz, California)
[Bejerano Spr06/07] 1 TTh 11:00-12:15 in Clark S361 Profs: Serafim Batzoglou, Gill Bejerano TAs: George Asimenos, Cory McLean.
[Bejerano Aut07/08] 1 MW 11:00-12:15 in Redwood G19 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
Genome Browsers UCSC (Santa Cruz, California) and Ensembl (EBI, UK)
ENCODE Data Coordination at UCSC Kate Rosenbloom ENCODE DCC Technical Project Manager UCSC Genome Bioinformatics Group September 2010 Genome Browser SAB.
1 and 3 November, 2006 Chapter 17 Regulation in Eukaryotes.
[BejeranoFall13/14] 1 MW 12:50-2:05pm in Beckman B302 Profs: Serafim Batzoglou & Gill Bejerano TAs: Harendra Guturu & Panos.
Making Sense of the ENCODE Project (ENCyclopedia Of DNA Elements) Data Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences.
ENCODE The Human Genome project sequenced “the human genome” “the human genome” that we have labeled as such doesn’t actually exist What we call.
DNA Organization.
Chapter 11 Table of Contents Section 1 Control of Gene Expression
Chapter 11 Objectives Section 1 Control of Gene Expression
Eukaryotic Gene Expression The “More Complex” Genome.
Center for Biomolecular Science and Engineering University of California, Santa Cruz Robert Kuhn, PhD Center for Biomolecular Science and Engineering University.
An Introduction to ENCODE Mark Reimers, VIPBG (borrowing heavily from John Stamatoyannopoulos and the ENCODE papers)
ENCODE Data Coordination at UCSC Kate Rosenbloom ENCODE DCC Technical Project Manager UCSC Genome Bioinformatics Group September 2010 Genome Browser SAB.
Regulation of Gene Expression Chapter 18. Warm Up Explain the difference between a missense and a nonsense mutation. What is a silent mutation? QUIZ TOMORROW:
Introduction to the Tsinghua University ENCODE Journal Club Monica C. Sleumer ( 苏漠 )
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Eukaryotic Genome & Gene Regulation The entire genome of the eukaryotic organism is present in every cell of the organism. Although all genes are present,
I519 Introduction to Bioinformatics, Fall, 2012
Gene Regulation How does your body know when to make certain proteins? Unit 4 – Chapter 12-5.
Gene Expression. Cell Differentiation Cell types are different because genes are expressed differently in them. Causes:  Changes in chromatin structure.
AP Biology Control of Eukaryotic Genes.
Introduction to Molecular Cell Biology Transcription Regulation Dr. Fridoon Jawad Ahmad HEC Foreign Professor King Edward Medical University Visiting Professor.
Control of Eukaryotic Genome
Thoughts on ENCODE Annotations Mark Gerstein. Simplified Comprehensive (published annotation, mostly in '12 & '14 rollouts)
Overview of ENCODE Elements
Analysis of ChIP-Seq Data Biological Sequence Analysis BNFO 691/602 Spring 2014 Mark Reimers.
DNAse Hyper-Sensitivity BNFO 602 Biological Sequence Analysis, Spring 2014 Mark Reimers, Ph.D.
GENE REGULATION RESULTS IN DIFFERENTIAL GENE EXPRESSION, LEADING TO CELL SPECIALIZATION Eukaryotic DNA.
CS173 Lecture 9: Transcriptional regulation III
Biol 456/656 Molecular Epigenetics Lecture #5 Wed. Sept 2, 2015.
STAT115 STAT225 BIST512 BIO298 - Intro to Computational Biology.
1 Chromatin Boundries Observe DNA loops attached to nuclear scaffold DNA loops are kb in length DNA is attached to Nuclear Matrix Attachment region.
Genomics 2015/16 Silvia del Burgo. + Same genome for all cells that arise from single fertilized egg, Identity?  Epigenomic signatures + Epigenomics:
Genes in ActionSection 2 Section 2: Regulating Gene Expression Preview Bellringer Key Ideas Complexities of Gene Regulation Gene Regulation in Prokaryotes.
Peak Calling for ChIP-Seq data Larry Meyer UCSC Bioinformatics Dept. BME 230 January 11, 2011.
Molecular Genetics: Part 2B Regulation of metabolic pathways:
Chapter 2. Differential gene expression in Development
Functional Elements in the Human Genome
Eukaryotic Genome & Gene Regulation
Regulation of Gene Expression
Gene Expression.
Regulation of Gene Expression
Regulation of Gene Expression by Eukaryotes
Chapter 11 Gene Expression.
Epigenetics Study of the modifications to genes which do not involve changing the underlying DNA
High-Resolution Profiling of Histone Methylations in the Human Genome
Unit 7: Molecular Genetics
High-Resolution Profiling of Histone Methylations in the Human Genome
Volume 67, Issue 6, Pages e6 (September 2017)
Adam C. Wilkinson, Hiromitsu Nakauchi, Berthold Göttgens  Cell Systems 
DNA AND RNA 12-5 Gene Regulation.
NuRD and Pluripotency: A Complex Balancing Act
Beyond GWASs: Illuminating the Dark Road from Association to Function
HOXA9 and STAT5 co-occupy similar genomic regions and increase JAK/STAT signaling. HOXA9 and STAT5 co-occupy similar genomic regions and increase JAK/STAT.
Presentation transcript:

Who's in charge here? Jim Kent ENCODE Data Coordinating Center (DCC) University of California Santa Cruz Finding and characterizing regulatory regions in the human genome.

The Paradox of the Genome How does a long, static, one dimensional string of DNA turn into the remarkably complex, dynamic, and three dimensional human body? GTTTGCCATCTTTTG CTGCTCTAGGGAATC CAGCAGCTGTCACCA TGTAAACAAGCCCAG GCTAGACCAGTTACC CTCATCATCTTAGCT GATAGCCAGCCAGCC ACCACAGGCATGAGT

Early explanations of development A little man in the sperm is in charge of making the baby. Begs the question of what makes the little man. Theory later disproved by better microscopes.

More modern thinking An organism is created by the cooperative/competitive actions of cells that make it up. Though all cells (save some specialized blood cells) share the same DNA, which parts of the DNA are used by cells varies. As cells divide they differentiate into different cell types based on signals from other cells, the environment, a bit of randomness, and the cell’s internal state. Most of the differentiation decisions ultimately take place in the cell nucleus.

Nucleus Used to Appear Simple Cheek cells stained with basic dyes. Nuclei are readily visible.

Mammalian nuclei stained in various ways reveals additional structure within nucleus Image from Tom Misteli lab

Focusing on Chromatin

Turning on/off a gene: Opening/closing chromatin. Binding expressive/inhibitory transcription factors. mRNA transcription (or not) Additional regulation occurs after transcription, but that is beyond scope of this talk.

ENCODE Project Not to be confused with ENCODE pilot project that just covered 1% of genome. 23 biology labs organized into 8 grants, plus an Analysis Working Group and a Data Coordination Center (DCC) I’m the principal investigator of the DCC ENCODE’s overall goal is to identify and characterize all functional elements of the genome. ENCODE DCC’s job is to make data accessible and clear, to put it in UCSC Genome Browser, and to help other databases at NCBI, EBI, and elsewhere import ENCODE data as well.

ENCODE assays on regulation of transcription Opening/closing chromatin –DNase hypersensitivity –Chromatin immunoprecipitation & sequencing (ChIP- seq) of histone marks Binding expressive/inhibitory transcription factors. –ChIP-seq of various transcription factors RNA transcription (or not) –mRNA sequencing of ENCODE cell lines –Exotic RNA sequencing also (see Tom Gingeras’ talk)

ENCODE DNase Hypersensitivity Several genome-wide high throughput methods being used in ENCODE. All involve DNA-seq Data currently available for >50 cell lines. Plans for >300 cell lines. Main artifacts to watch for: –DNA present in cell in multiple copies: Mitochondria, centromeric repeats, other repeats Generally such regions ignored except in “raw” data. –Sequencing biases (highly g/c rich regions etc.) –In general artifacts easier to work around than those associated with DNA-chip based assays.

UW DNaseI at Hemoglobin Beta Top track shows genes in the Hemoglobin beta (HBB) locus. Next track shows RNA levels in GM12878 and K562 cell lines. The last track is density plots of DNAse hypersensitivity in many cell lines. K562, a cell line similar to a red blood cell precursor, shows much RNA and DNAase activity.

A more typical locus - PICALM DNase patterns typically are less specific to a single cell type as seen here

Histone Mark and related ChIP-SEQ Various histone marks give a broad picture of promoters, enhancers, repressed regions, transcribed regions ENCODE data sets currently include 9 histone marks + CTCF (insulator mark) in 9 cell lines. More planned.

Histone marks on 2 cell lines Histone mark data at the same locus in two cell lines, GM12878 (red) and K562 (blue). Different marks are associated with promoters, transcribed regions, silencers, enhancers, etc. Most marks are darker in K562, which is more actively transcribing this region.

Transcription Factor ChIP-Seq ENCODE has data on 57 factors – most in several cell lines where they are expressed. More coming.

Making data fit on a single screen All of the ENCODE data is excellent, but there is so much of it, it can be hard to know if you’ve seen everything relevant. Problem most acute in transcription factor ChIP-SEQ, but really a problem everywhere. Lately UCSC has developed several ways of visually summarizing the data.

Integrating DNase across cell lines HBB Gene DNAseI signal peaks clustered peaks

Rainbow overlay for histone marks

Integrated regulatory tracks in context with other genomics information at UCSC

Acknowledgements Programming – Tim Dreszer, Brian Raney, Galt Barber Wrangling – Cricket Sloan, Venkat Malladi, Melissa Cline Testing – Katrina Learned and colleagues Systems –Erich Weiler, Victoria Lin, Jorge Garcia Cat Herding – Kate Rosenbloom, Jim Kent Funding – NHGRI, HHMI, QB3

The End