National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC Berkeley UC San Diego UIUC Biology Thrust.

Slides:



Advertisements
Similar presentations
1 Inference of Regulatory Networks via Systems Genetics Ina Hoeschele.
Advertisements

Biological pathway and systems analysis An introduction.
1 Genetics The Study of Biological Information. 2 Chapter Outline DNA molecules encode the biological information fundamental to all life forms DNA molecules.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
COG and GO tutorial.
Basic Biology for CS262 OMKAR DESHPANDE (TA) Overview Structures of biomolecules How does DNA function? What is a gene? How are genes regulated?
Introduction to Genomics, Bioinformatics & Proteomics Brian Rybarczyk, PhD PMABS Department of Biology University of North Carolina Chapel Hill.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Bioinformatics Student host Chris Johnston Speaker Dr Kate McCain.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Bioinformatics Jan Taylor. A bit about me Biochemistry and Molecular Biology Computer Science, Computational Biology Multivariate statistics Machine learning.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC Berkeley UC San Diego UIUC Biology Thrust.
Paper in “Evolution and Development” The Evolution of Signaling Pathways In Animal Development Andr é Pires-daSilva & Ralf J. Sommer Nature Reviews Genetics,
Epigenome 1. 2 Background: GWAS Genome-Wide Association Studies 3.
Shankar Subramaniam University of California at San Diego Data to Biology.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.
Biology 1060 Chapter 17 From Gene to Protein. Genetic Information Important: Fig Describe how genes control phenotype –E.g., explain dwarfism in.
Synthetic biology: New engineering rules for emerging discipline Andrianantoandro E; Basu S; Karig D K; Weiss R. Molecular Systems Biology 2006.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Science & Technology Centers Program Center for Science of Information National Science Foundation Science & Technology Centers Program Bryn Mawr Howard.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard University MIT Princeton Purdue.
Genomes and Their Evolution. GenomicsThe study of whole sets of genes and their interactions. Bioinformatics The use of computer modeling and computational.
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
The Cancer Systems Biology Consortium (CSBC)
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
DNA to Protein – 12 Part one AP Biology. What is a Gene? A gene is a sequence of DNA that contains the information or the code for a protein or an RNA.
Claim, Evidence, Reasoning and Experimental Design Review.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Genes and Genomic Datasets. DNA compositional biases Base composition of genomes: E. coli: 25% A, 25% C, 25% G, 25% T P. falciparum (Malaria parasite):
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Announcements Urban Forestry data and photos due next week after the break. Reading. Writing assignment due Oct 18. Choose one of the characteristics out.
OMICS International welcomes submissions that are original and technically so as to serve both the developing world and developed countries in the best.
Decoding the Network Footprint of Diseases With increasing availability of data, there is significant activity directed towards correlating genomic, proteomic,
Recombination breakpoints Family Inheritance Me vs. my brother My dad (my Y)Mom’s dad (uncle’s Y) Human ancestry Disease risk Genomics: Regions  mechanisms.
Stephanie J. Culler, Kevin G. Hoff, Christina D. Smolke
Bioinformatics and Computational Biology
Personalized Lung Cancer Treatment: Targeting Stem Cell Pathways David M. Jablons, M.D. Professor and Chief Thoracic Surgery Ada Distinguished Professor.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
Shankar Subramaniam University of California at San Diego Data to Biology.
Chap 18 The Genetics of Viruses and Bacteria. Structure of Virus Approximately 20 nm in diameter Their genome can contain DNA or RNA. Enclosed by a.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Finding genes in the genome
TSC1/Hamartin and Facial Angiofibromas Biology 169 Ann Hau.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
KEY CONCEPT 8.5 Translation converts an mRNA message into a polypeptide, or protein.
Transcriptional Enhancers Looking out for the genes and each other Sridhar Hannenhalli Department of Cell Biology and Molecular Genetics Center for Bioinformatics.
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Sungkyunkwan University, School of Medicine.
University of California at San Diego
Emerging Challenges in Information Theory for Life Sciences
Genomes and Their Evolution
University of California at San Diego
University of California at San Diego
Cell Signaling.
THE NEW MEDICINE AND BIOLOGY Will they be Information Sciences?
Relationship between Genotype and Phenotype
Psychiatric Disorders: Diagnosis to Therapy
Psychiatric Disorders: Diagnosis to Therapy
The Content of the Genome
Deep Learning in Bioinformatics
Presentation transcript:

National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC Berkeley UC San Diego UIUC Biology Thrust

Science & Technology Centers Program Biology has rapidly become a data rich science While broad disciplines within biology, over the past five decades have taken a deconstructive view, there is tremendous activity in an integrated systems view of bio-systems. Traditional concepts in Information Theory have been critical for traditional analyses and modeling and bioinformatics. 2

Science & Technology Centers Program Information Source Transmitter ReceiverDestination Noise Source MessageSignal Received Signal Message A generalized communication system, from Shannon (1948) 3

Science & Technology Centers Program RNA Polymerase Transmitter RNA Polymerase Receiver RNA Destination Ribosome Noise Source Transcription Error, Mutation Message Sequence Signal RNA Sequence Received Signal Completed RNA Sequence Message RNA Sequence Information Source DNA 4

Science & Technology Centers Program Information Theory and Life Sciences: Renaissance Initial efforts focused on sequence conservation, gene finding, motifs, their structural and functional implications, evolution, and phylogeny. Complemented by phenotype databases, significant advances have been made in understanding the genetic basis of disease through information theoretic methods and formalisms.

Science & Technology Centers Program Information Theory and Life Sciences: Some Examples Allikmets et al., Gene A G/C mutation at location 366 in the ABCR gene is implicated in macular degeneration (glycene to alanine in exon 17). This was identified through information theoretic analysis of splice acceptors.

Science & Technology Centers Program Information Theory and Life Sciences: Some Examples Rogan et al., Human Mutation, Splicing varies among 3 common alleles that differ in length in the polymorphic polythymidine tract of the IVS 8 acceptor of the gene encoding the cystic fibrosis transmembrane regulator

Science & Technology Centers Program Long block code, discrete alphabet, extensive redundancy, perhaps to control against the infiltration of errors. DNA also controls gene expression, an intra-organism process, so a comprehensive theory of intra-organism communication, i.e. a channel theory is needed. DNA enables two organisms to communicate; it’s designed for inter-organism communication. 8

Science & Technology Centers Program For genetic information, the context includes –Impact of cellular environment –Impact of the context within the sequences themselves; are there larger patterns within the genetic code? –Impact of multiple reading frames Beyond cells, there is context for tissue-specific development, at coarser levels, organs, organisms, ecosystems, and beyond 9

Science & Technology Centers Program Information Theory and Life Sciences: Scratching the Surface Fatima et al. Cancer Epidemiol Biomarkers Prev 2008 Enriched functional categories and pathways in colorectal cancer cell lines following treatment

Science & Technology Centers Program Information Theory and Life Sciences: Emerging Frontiers Sun et al., JCI 2007 Hedgehog (HH), Notch, and Wnt signaling are key stem cell self-renewal pathways that are deregulated in lung cancer and thus represent potential therapeutic targets

Science & Technology Centers Program Key Outstanding Challenges Information in spatio-temporal data Scaling from molecular processes within the cell to entire populations Timescales ranging from femtosecond-scale ligand binding to eons

Science & Technology Centers Program Key Outstanding Challenges Information in systems/networks Modularity and function-based information measures Comparative/ discriminant analysis Methods and validation

Science & Technology Centers Program Key Outstanding Challenges Information and context Tissue specific pathways Normal physiology versus pathology Data transformation, reduction, and abstraction Data complexity, noise Signal transduction Models, manifestation, and granularity

Science & Technology Centers Program Information in Systems: Near-Term Challenges Information Theoretic measures and methods for modularity in biochemical networks Models and methods for conservation in large networks Methods for in-silico network inference Integration of tools into the BioPathways WorkBench Identification/ Curation of data sources for phenotype-characterized data in support of discriminant analysis

Science & Technology Centers Program Information in Systems: Medium-Term (years 2/3) Challenges Role of spatial compartmentalization in function (spatio-temporal information flow) Characterization of phenotype-implicated data Models and methods for discriminant and discriminating sub-networks Relationship between information content/ flow, network stability, and biological function Scaling up from cellular to intra-cellular networks

Science & Technology Centers Program Frameworks and Portals Over a million sessions and counting!