Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.

Slides:



Advertisements
Similar presentations
The Diagnostic Laboratory ……the ideal system……. Molecular Genetics Diagnostic Laboratory Exciting area of medical pathology Need to continually up-date.
Advertisements

LS-SNP: Large-scale annotation of coding non- synonymous SNPs based on multiple information sources -Bioinformatics April 2005.
Yan Guo Assistant Professor Department of Cancer Biology Vanderbilt University USA.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Wrapup. NHGRI strategic plan What does the NIH think genomics should be for the next 10 years? [Nature, Feb. 2011]
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
CHAPTER 15 Microbial Genomics Genomic Cloning Techniques Vectors for Genomic Cloning and Sequencing MS2, RNA virus nt sequenced in 1976 X17, ssDNA.
Human Genetics Overview.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
 MicroRNAs (miRNAs) are a class of small RNA molecules, about ~21 nucleotide (nt) long.  MicroRNA are small non coding RNAs (ncRNAs) that regulate.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Give me your DNA and I tell you where you come from - and maybe more! Lausanne, Genopode 21 April 2010 Sven Bergmann University of Lausanne & Swiss Institute.
Introduction to Molecular Epidemiology Jan Dorman, PhD University of Pittsburgh School of Nursing
Michael Cummings David Reisman University of South Carolina Genomes and Genomics Chapter 15.
Bioinformatics Ayesha M. Khan Spring Phylogenetic software PHYLIP l 2.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Control of Gene Expression Eukaryotes. Eukaryotic Gene Expression Some genes are expressed in all cells all the time. These so-called housekeeping genes.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Development of Bioinformatics and its application on Biotechnology
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Presented by: Andrew McMurry Boston University Bioinformatics Children’s Hospital Informatics Program Harvard Medical School Center for BioMedical Informatics.
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Gene Hunting Natália F. Martins. Resumo Motivação Estratégia Automatização (?) Exemplos Referências.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Precision Medicine A New Initiative. The Concept of Precision Medicine (PM) The prevention and treatment strategies that take individual variability into.
Data Analysis Summary. Elephant in the room General Comments General understanding that informatics is integral in medical sequencing and other –omics.
Genetics-multistep tumorigenesis genomic integrity & cancer Sections from Weinberg’s ‘the biology of Cancer’ Cancer genetics and genomics Selected.
Genomes and Genomics.
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Overview of Bioinformatics 1 Module Denis Manley..
Using Predictive Classifiers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
OMICS International welcomes submissions that are original and technically so as to serve both the developing world and developed countries in the best.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
Central dogma: the story of life RNA DNA Protein.
Bioinformatics and Computational Biology
Lecture 11. Topics in Omic Studies (Cancer Genomics, Transcriptomics and Epignomics) The Chinese University of Hong Kong CSCI5050 Bioinformatics and Computational.
HW2: exome sequencing and complex disease Jacquemin Jonathan de Bournonville Sébastien.
The International Consortium. The International HapMap Project.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Finding genes in the genome
Starter What do you know about DNA and gene expression?
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
A high-resolution map of human evolutionary constraints using 29 mammals Kerstin Lindblad-Toh et al Presentation by Robert Lewis and Kaylee Wells.
Human Genomics Higher Human Biology. Learning Intentions Explain what is meant by human genomics State that bioinformatics can be used to identify DNA.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
ARCH/VCDE F2F BoF And the Presentation Subtitle Goes Here Ravi Madduri December 2008.
Sungkyunkwan University, School of Medicine.
Dept of Biomedical Informatics University of Pittsburgh
Introduction to bioinformatics lecture 11 SNP by Ms.Shumaila Azam
Content and Labeling of Tests Marketed as Clinical “Whole-Exome Sequencing” Perspectives from a cancer genetics clinician and clinical lab director Allen.
Genetics: From Genes to Genomes
The Study of Biological Information
The Genetic Basis for Cancer Treatment Decisions
Volume 58, Issue 4, Pages (May 2015)
Pan-cancer genome and transcriptome analyses of 1,699 paediatric leukaemias and solid tumours By: Anh Pham.
Presentation transcript:

Presented by Karen Xu

Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but it is thought to be primarily caused by mutations in specific tissues that accumulate over time

Difference between cancer genome analysis and GWAS GWAS use large cohorts of cases to analyze the relationship between the disease and thousands or millions of mutations across the entire genome The study of cancer genome is different. During the lifetime of the organism variants only accumulate in the tumor or the affected tissue and they are not transmitted from generation to generation-----somatic mutations

Types of cancer genome analysis May focus on the cancer type or the patient 1. examining a cohort of patients suffering from a particular type of cancer and is used to identify biomarkers, characterize cancer subtypes with clinical or therapeutic implications or to simply advance our understanding of the tumorigenic process 2. examining the genome of a particular cancer patient in the search for specific alterations that may be susceptible to tailored therapy

Figure 1. Idealized cancer analysis pipeline. Vazquez M, de la Torre V, Valencia A (2012) Chapter 14: Cancer Genome Analysis. PLoS Comput Biol 8(12): e doi: /journal.pcbi

Sequencing, Alignment and Variant calling After samples are sequenced, sequencing reads are aligned to a reference genome and all differences are identified through a process known as variant calling. The output of the variant calling is a list of genomic variations that is organized according to their genomic location (chromosome and position) and the variant allele. They may be accompanied by scores measuring the sequencing quality over that region or the prevalence of the variant allele in the samples. The workflow employed for this type of analysis is commonly known as a primary analysis.

Consequence, Recurrence analysis and candidate drivers DNA mutations are translated into mutations in RNA transcripts, and from RNA into proteins, potentially altering their amino acid sequence. The impact of these amino acid alterations on protein function can range from largely irrelevant to highly deleterious Severity of these alterations can be assessed using specialized software tools known as protein mutation pathogenicity predictors Mutations are also examined to identify recurrence, which may point to key genes and mutational hotspots Not all mutations that have deleterious consequences for protein function are necessarily involved in cancer

Pathways and Functional Analysis Genes recurrently mutated in cancer tend to be easily identifiable. Examples, TP53 and KRAS However, most often mutations are more widely distributed and the probability of finding the same gene mutated in several cases is low, making it difficult to identify common functional features associated with a given cancer Pathway analysis offers a means to overcome this challenge by associating mutated genes with known signaling pathways Cancer is not only a disease of the genes but also a disease of the pathways

Integration, Visualization and Interpretation Gene expression and alterations in the copy number of each gene, a very common phenomenon in cancer Mutations in promoters and enhancers Variation in the affinity of transcription factors and DNA binding proteins Dysregulation of epigenetic control

Current Challenges 1. The heterogeneity of the data to be analyzed, which ranges from genomic mutations in coding regions to alterations in gene expression or epigenetic marks 2. The range of databases software resources required to analyse and interpret the results 3. The comprehensive expertise required to understand the implications of such varied experimental data

Critical Bioinformatics Tasks in Cancer Genome Analysis

4 Critical Bioinformatics Tasks in Cancer Genome Analysis Mapping between coordinate systems Driver Mutations and Pathogenicity Prediction Functional Interpretation Actionable results: patient stratification and drug targets

Mapping between Coordinate systems Translating mutational information derived from genomic coordinates to other data types is the first step. Example: point mutations in coding regions can be mapped to different transcripts by finding the exon affected, the offset of the mutation inside that exon and the position of the exon inside the transcript

Driver Mutations and Pathogenicity Prediction “driver”----mutations that drive cancer onset and progesssion “passenger”----mutations that play little or no role in tumorigenic process but are propagated by their co- existence with driver mutations Experimental assays of activity are one means of testing the tumorigenic potential of mutations, although such assays are difficult to perform to scale. Statistical approaches seek to identify traces of mutation selection during tumor formation by looking at the prevalence of mutations in particular genes in sample cohorts, or the ratios of synonymous versus non- synonymous mutations in particular candidate genes.

Functional Intrepretation Frequently genomic data reveals the presence of mutated genes that are far less prevalent, and the significance of these genes must be considered in the context of the functional units they are part of. The involvement of genes in specific biological, metabolic and signaling pathways is the type of functional annotation most commonly considered and thus, functional analysis is often termed ‘pathway analysis’. The current systems for functional interpretation have been derived from the systems previously developed to analyze expression arrays, and they have been adapted to analyze lists of cancer-related genes.

Applicable results: diagnosis, patient stratification and drug therapies For clinical applications, the results of cancer genome analysis need to be translated into practical advice for clinicians, providing potential drug therapies, better tumor classification or early diagnostic markers.

Resources for Genome Analysis in Cancer Databases Some databases describe entities and their properties, such as: proteins and the drugs that target them; germline variations and the diseases with which they are associated; or genes along with the factors that regulate their transcription. Other databases are repositories of experimental data, such as the Gene Expression Omnibus and ArrayExpress, which contain data from microarray experiments on a wide range of samples and under a variety of experimental conditions. Software In cancer analysis pipelines, several tasks must be performed that require supporting software. These range from simple database searches to cross-check lists of germline mutations with lists of known SNPs, to running complex computational methods to identify protein- protein interaction sub-networks affected by mutations.

Workflow Enactment Tools and Visual Interfaces Given the complexity of cancer genome analysis, it is worth discussing how to design and execute (enact) workflows, which may become very elaborate. Workflows can be thought of as analysis recipes, whereby each analysis entails enacting that workflow using new data. Ideally a workflow should be comprehensive and cover the complete analysis process from the raw data to the final results. ---Improve Efficiency Limitations of Visual interfaces: overly complex, inflexible, and limited utility compared w/ general purpose programming language

Videos