Molecular Biology of Cancer AND Cancer Informatics (omics) David Boone.

Slides:



Advertisements
Similar presentations
Cancer & Mutations Powerpoint
Advertisements

Cancer—Principles and overview By Robert A. Weinberg
Alterations in the Cell Cycle and Gene Mutations that Cause Cancer
Chapter 19 Lecture Concepts of Genetics Tenth Edition Cancer and Regulation of the Cell Cycle.
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis Yan Guo.
RNA and PROTEIN SYNTHESIS
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
 2.e.1 – Timing and coordination of specific events are necessary for the normal development of an organism, and these events are regulated by a variety.
Chapter 13- RNA and Protein Synthesis
The origin of metastatic disease: clues from genomics 7/13/2011.
Canadian Bioinformatics Workshops
1. What is the Central Dogma? 2. How does prokaryotic DNA compare to eukaryotic DNA? 3. How is DNA organized in eukaryotic cells?
34 Cancer.
RNA = RiboNucleic Acid Synthesis: to build
High Throughput Sequencing
RNA Ribonucleic Acid.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Tumor Supressor Gene Non-functional TSG Mutations increasing risk of cancer “Loss of function” mutation Proto-oncogene Oncogene (Hyperactive or unregulated.
Tumor genetics Minna Thullberg
NOTES: CH 18 part 2 - The Molecular Biology of Cancer
REGULATION OF GENE EXPRESSION Chapter 18. Gene expression A gene that is expressed is “turned on”. It is actively making a product (protein or RNA). Gene.
CHAPTER 18 Molecular Biology and Medicine
Inquiry into Life Twelfth Edition Chapter 25 Lecture PowerPoint to accompany Sylvia S. Mader Copyright © The McGraw-Hill Companies, Inc. Permission required.
Genetics-multistep tumorigenesis genomic integrity & cancer Sections from Weinberg’s ‘the biology of Cancer’ Cancer genetics and genomics Selected.
Group Number: 2 Britney Porter, Sandra Nguyen, Eduardo Vargas and Samender Singh Randhawa.
RNA and Protein Synthesis
RNA AND PROTEIN SYNTHESIS RNA vs DNA RNADNA 1. 5 – Carbon sugar (ribose) 5 – Carbon sugar (deoxyribose) 2. Phosphate group Phosphate group 3. Nitrogenous.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Chapter 11 Regulation of Gene Expression. Regulation of Gene Expression u Important for cellular control and differentiation. u Understanding “expression”
Biotechnology and Genetic Engineering. Human Cloning-The Science In The News.
GENE REGULATION ch 18 CH18 Bicoid is a protein that is involved in determining the formation of the head and thorax of Drosophila.
The Biology and Genetic Base of Cancer. 2 (Mutation)
Microarrays and Their Uses Brad Windle, Ph.D
Mutations.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
Cancer When cell division goes wrong……. Growing out of control, cancer cells produce malignant tumors Cancer is a general term for many diseases in.
Gene Expression. Cell Differentiation Cell types are different because genes are expressed differently in them. Causes:  Changes in chromatin structure.
Gene Regulations and Mutations
Genetics of Cancer Genetic Mutations that Lead to Uncontrolled Cell Growth.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
ROLE OF GENE EXPRESSION:  Activation of a gene that results in a protein  Cells DO NOT need to produce proteins for every code. GENOME:  Complete genetic.
Benign Versus Malignant Tumors
Cancer By: Aujan M., Zach J., Aditya P.. * Genetic disease that results in uncontrolled growth. * Mutation in genetic code results in failure of cell.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
CHAPTER 19 THE ORGANIZATION AND CONTROL OF EUKARYOTIC GENOMES Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Section D: The.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Genetics of Cancer Genetic Mutations that Lead to Uncontrolled Cell Growth.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Cancer Bioinformatics Tom Doman Bioinformatics Scientist Eli Lilly & Company Informatics 519 guest lecture IU Bloomington Sept
1. What is the Central Dogma? 2. How does prokaryotic DNA compare to eukaryotic DNA? 3. How is DNA organized in eukaryotic cells?
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
The Problem of Cancer. What are cancer cells ? Cancerous growth involves unrestrained proliferation (malignancy) and spread (metastasis). Caused by: mutations.
The Organization and Control of Eukaryotic Genomes Ch. 19 AP Biology Ms. Haut.
Chapter 13- RNA and Protein Synthesis
Molecular Genetics: Part 2B Regulation of metabolic pathways:
GENETIC BASIS OF CANCER
Gene Expression.
Microarray Technology and Applications
Inquiry into Life Twelfth Edition
Alterations in the Cell Cycle and Gene Mutations that Cause Cancer
Last Week’s Reading Assignments
What makes a mutant?.
Concept 18.5: Cancer results from genetic changes that affect cell cycle control The gene regulation systems that go wrong during cancer are the very same.
Inquiry into Life Twelfth Edition
Genetics of Cancer.
Chapter 18: Regulation of Gene Expression
Review Warm-Up What is the Central Dogma?
Presentation transcript:

Molecular Biology of Cancer AND Cancer Informatics (omics) David Boone

Outline What is cancer and how is it different from other diseases? Major players in tumor initiation (Understand one before many) – Oncogenes – Tumor suppressors Types of mutations The many – informatics – arrays – sequencing – How to read a heatmap Personalized Medicine/cancer genomics WORKS – examples from breast cancer. Summary Molecular biology of cancer Cancer Informatics (omics)

Goals Understand why cancer treatment is difficult. Learn how changes in genome and/or expression can affect cell proliferation/death. – looking at signaling schematic be able to identify what a certain mutation might do. Define oncogenes and tumor suppressors. Define two ways of gathering ‘omics’ data Be able to analyze heat maps and Kaplan Meier Curve Describe how informatics is impacting personalized medicine.

Outline What is cancer and how is it different from other diseases? Major players in tumor initiation (Understand one before many) – Oncogenes – Tumor suppressors Types of mutations The many – informatics – arrays – sequencing – How to read a heatmap Personalized Medicine/cancer genomics WORKS – examples from breast cancer. Summary Molecular biology of cancer Cancer Informatics (omics)

How is cancer different from a bacterial infection? Term used to describe 100s or 1000s of distinct neoplastic disorders caused by your own cells growing out of control.

Normal

Cancer

Metastasis Spreading of cancer from one organ to another

The many steps of Metastasis occur through clonal selection and the accumulation of different mutations Fidler 2003 nature reviews cancer

Outline What is cancer and how is it different from other diseases? Major players in tumor initiation (Understand one before many) – Oncogenes – Tumor suppressors Types of mutations The many – informatics – arrays – sequencing – How to read a heatmap Personalized Medicine/cancer genomics WORKS – examples from breast cancer. Summary Molecular biology of cancer Cancer Informatics (omics)

Structure = function DNA RNAPROTEIN

The Central Dogma of Molecular Biology Structure is very important – replication – transcription/translation info to create proteins DNA RNA Protein 23 chromosomes 3.2 billion base pairs ~20000 genes (1.5% of genome) DNA

Question If all of our cells (mostly) have identical DNA sequences that are densely packed in nuclei then how do we have the different types of cells and organs necessary for human life? Regulation of gene expression

Central Dogma No T’s replaced with U’s Translation start Translation stop

Important genes in cancer Oncogenes- genes that encode for proteins that have the potential to cause cancer. – like a gas pedal they make the cells divide more frequently or survive when they shouldn’t. – turned on by activating mutations, amplifications, and overexpression. – ex. c-Myc, IGF1R, Ras (growth factors, signaling molecules, transcription factors)

IGF1R is an important oncogene in BC – Can you pick out any other potential oncogenes? Survival

Important genes in cancer Tumor Suppressors- genes that encode for proteins that prevent tumor development. – like a break pedal they prevent proliferation and initiate cell death if there are problems like DNA damage. They keep proliferation and oncogenes in check. – Stop the cell cycle, induce apoptosis, DNA repair, etc. – turned off by inactivating mutations, deletions, and lack of expression. – ex. p53, BRCA1/2, PTEN, RB

Myc (oncogene) is one of the most frequent amplifications in BC. p53 (tumor suppressor) is one of the most frequently mutated or lost genes in BC. too much

Outline What is cancer and how is it different from other diseases? Major players in tumor initiation (Understand one before many) – Oncogenes – Tumor suppressors Types of mutations The many – informatics – arrays – sequencing – How to read a heatmap Personalized Medicine/cancer genomics WORKS – examples from breast cancer. Summary Molecular biology of cancer Cancer Informatics (omics)

Genetic alterations (All mutations are not equal) Point mutations

Frameshift mutation K A* DNA RNA Protein

Genetic alterations Point mutations Duplications or amplifications Myc

Genomic alterations Point mutations Duplications or amplifications Deletions p53

Genomic alterations Point mutations Duplications or amplifications Deletions Insertions/ Inversions

Genomic alterations Point mutations Duplications or amplifications Deletions Insertions/ Inversions Translocations

Expression alterations outside of the genome Overexpression. – ex. Myc (high mitogenic signaling results in high expression of unmutated Myc. Epigenetic silencing – ex. methylation

Outline/Summary Molecular Biology of Cancer What is cancer and how is it different from other diseases? Major players in tumor initiation (Understand one before many) – Oncogenes – Tumor suppressors Types of mutations The many – informatics – arrays – sequencing – How to read a heatmap Personalized Medicine/cancer genomics WORKS – examples from breast cancer. Summary Molecular biology of cancer Cancer Informatics (omics)

Outline What is cancer and how is it different from other diseases? Major players in tumor initiation (Understand one before many) – Oncogenes – Tumor suppressors Types of mutations The many – informatics – arrays – sequencing – How to read a heatmap Personalized Medicine/cancer genomics WORKS – examples from breast cancer. Summary Molecular biology of cancer Cancer Informatics (omics)

Now we know a lot about individual genes, but how do we study global genetic or expression changes? Microarrays – Hybridization based (complementary base pairing) – Comparative Genome Hybridization (CGH) or SNP arrays for known DNA variants and copy number changes. – mRNA Expression arrays for RNA expression. Next-Gen sequencing – Sequence based – can be used for DNA or RNA – can detect mutations, copy number changes, and expression differences.

Microarrays Used for: 1)Gene expression 2)Copy number changes whole transcript expression array covers 28,869 well- annotated genes with 764,885 distinct probes. affy 6.0 SNP chip- 906,000 SNPs and ~ 1,000,000 ADVANTAGES: Relatively cheap Analyze all known genes Analysis is relatively easy DISADVANTAGES Cannot detect novel genes, mutations, or copy number changes

Next-gen sequencing Advantages- -No limitation on novel detections -simultaneously discover expression or copy number changes AND mutations. Disadvantages: -expensive -analysis is complex and difficult (but not for iBRIC scholars!) Human Genome Project took ~13 years and cost ~3 billion dollars. 3billion bases Now in a few weeks for less than a few thousand dollars, we can sequence a genome.

The sequencing revolution $1000 genome Human genome project 15 years $3,000,000, genomes 1 day $45,000

The Cancer Genome Atlas (TCGA) The Cancer Genome Atlas (TCGA) is a comprehensive and coordinated effort to accelerate the understanding of the molecular basis of cancer through the application of genome analysis technologies, including large- scale genome sequencing. 33 cancers 275 million dollars patients 2700 publications since started in 2006 RNA expression (RNAseq and microarray) Exome sequencing (mutation) Whole genome (for some) Copy number Methylation miRNA expression

ENCODE (encyclopedia of DNA elements) – The human genome project or our generation Build a comprehensive parts list of functional elements in the human genome.

Outline What is cancer and how is it different from other diseases? Major players in tumor initiation (Understand one before many) – Oncogenes – Tumor suppressors Types of mutations The many – informatics – arrays – sequencing – How to read a heatmap Personalized Medicine/cancer genomics WORKS – examples from breast cancer. Summary Molecular biology of cancer Cancer Informatics (omics)

HOW DO WE MAKE SENSE OF IT ALL?!?!

Pattern recognition/Logic Problems

Personalized medicine Bayer

Breast Cancer- IT WORKS!! Historically classified based on tumor size, nodal involvement, invasion, histology, etc. More recently genomics and transcriptomics have provided clues to what drives tumor initiation and progression and has the potential to divide patients into separate groups that might respond to different therapies. Sørlie T et al. PNAS 2001;98:

Gene expression patterns of 85 experimental samples representing 78 carcinomas, three benign tumors, and four normal tissues, analyzed by hierarchical clustering using the 476 cDNA intrinsic clone set. ER- ER+ basalHER2+Normal basal like luminal A luminal B Finding Patterns!!! Gene Expression Patterns Reveal Novel Breast Cancer Sub-types patient gene highlow

Overall and relapse-free survival analysis of the 49 breast cancer patients, uniformly treated in a prospective study, based on different gene expression classification. Sørlie T et al. PNAS 2001;98: ©2001 by National Academy of Sciences Personalized medicine WORKS!!!!

Gene expression use in the clinic Mammaprint – approved by FDA – 70 gene signature for patients with node negative and ER+ tumors – compared to conventional classification in ~300 patients 87 would have been treated differently. 67 were determined high risk by conventional methods and were given chemo but by Mammaprint were classified low risk. Followed patients for 10 yrs and Mammaprint was more accurate Oncotype PAM50 All good for deciding chemo vs. endocrine therapy, but still need classifications for ER- tumors

Summary Cancer is different than other diseases because it is really 100s or 1000s of diseases and is your own cells gone bad. Oncogenes – cause cancer – hyperactive – mutation, amplification Tumor Suppressors- prevent cancer – Prevent cell cycle progression – Induce cell death – DNA repair – lost or turned off – mutation, deletion, methylation Different types of alterations – Somatic mutation – amplification – deletion – methylation Global analysis (omics) – Microarray – Sequencing Personalized Medicine – Biomarker – How to read a heatmap and KM curve

12 highlow 345 A

Sample Preparation Lyse cells DNA RNA cDNA Histone modifications/TF Binding ChIPseq RNAseq WGS ChIPseq

Library Preparation Or cDNA

Cluster Generation and Sequencing Illumina Sequencing by synthesis

Data Analysis RNAseq Additionally may require 1) transcript assembly and 2) estimation of abundance

Problem After performing RNAseq analysis on a matched pair of tumor and normal tissue from a single patient you find that a novel gene is expressed 4 times higher in the tumor sample. Your collaborator conclude that this gene is transcribed at higher rates in tumors. – What are alternative interpretations? Think of caveats to RNAseq.

RNAseq caveats Rnaseq is a steady state measurement – RNA abundance is a result of transcription and degradation RNAseq is the measurement of the average expression in the population. – Doesn’t tell you expression in individual cells. Instead demonstrates expression in the pool of cells and perhaps different cell types. RNA temporally and spatially regulated.

Other sequencing caveats. Aligning to the ‘reference genome’ 38 th main alignment 20 th main release Still have regions of unknown location

Up to 50% of human genome is non-unique Problem for aligning Many isoforms share exons Other sequencing caveats..Repetitive regions and shared exons