Genome Informatics 2005 ~ 220 participants 1 keynote speaker: David Haussler 47 talks 121 posters.

Slides:



Advertisements
Similar presentations
Periodic clusters. Non periodic clusters That was only the beginning…
Advertisements

Gene regulation /function card Anatomical network card Tassy et al., Figure S1: Navigation diagram of ANISEED Anatomical structure card Expression card.
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Combined analysis of ChIP- chip data and sequence data Harbison et al. CS 466 Saurabh Sinha.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Basic Genomic Characteristic  AIM: to collect as much general information as possible about your gene: Nucleotide sequence Databases ○ NCBI GenBank ○
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Bioinformatics Needs for the post-genomic era Dr. Erik Bongcam-Rudloff The Linnaeus Centre for Bioinformatics.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Predicting RNA Structure and Function
Gene expression analysis summary Where are we now?
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Introduction to BioInformatics GCB/CIS535
Data visualization in the post-genomics era Carol Morita Genentech, Inc.
Displaying associations, improving alignments and gene sets at UCSC Jim Kent and the UCSC Genome Bioinformatics Group.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Alternative Splicing As an introduction to microarrays.
Bioinformatics Alternative splicing Multiple isoforms Exonic Splicing Enhancers (ESE) and Silencers (ESS) SpliceNest Lecture 13.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Predicting RNA Structure and Function. Nobel prize 1989 Nobel prize 2009 Ribozyme Ribosome.
UCSC Known Genes Version 3 Take 10. Overall Pipeline Get alignments etc. from database Remove antibody fragments Clean alignments, project to genome Cluster.
Gene Finding Genome Annotation. Gene finding is a cornerstone of genomic analysis Genome content and organization Differential expression analysis Epigenomics.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Computational Molecular Biology Biochem 218 – BioMedical Informatics Gene Regulatory.
Genome organization Eukaryotic genomes are complex and DNA amounts and organization vary widely between species.
A number of slides taken/modified from:
Ch10. Intermolecular Interactions and Biological Pathways
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
From Structure to Function. Given a protein structure can we predict the function of a protein when we do not have a known homolog in the database ?
ModENCODE August 20-21, 2007 Drosophila Transcriptome: Aim 2.2.
Verna Vu & Timothy Abreo
1 RNA Bioinformatics Genes and Secondary Structure Anne Haake Rhys Price Jones & Tex Thompson.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
© Wiley Publishing All Rights Reserved. RNA Analysis.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Exploring Alternative Splicing Features using Support Vector Machines Feature for Alternative Splicing Alternative splicing is a mechanism for generating.
Computational Genomics and Proteomics Lecture 8 Motif Discovery C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E.
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
From Genomes to Genes Rui Alves.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Eukaryotic Gene Prediction Rui Alves. How are eukaryotic genes different? DNA RNA Pol mRNA Ryb Protein.
Central dogma: the story of life RNA DNA Protein.
Bioinformatics and Computational Biology
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
Pre-mRNA secondary structures influence exon recognition Michael Hiller Bioinformatics Group University of Freiburg, Germany.
Cool BaRC Web Tools Prat Thiru. BaRC Web Tools We have.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Biological Networks.
Gene Structure and Identification III BIO520 BioinformaticsJim Lund Previous reading: 1.3, , 10.4,
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Motif Search and RNA Structure Prediction Lesson 9.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
Using public resources to understand associations Dr Luke Jostins Wellcome Trust Advanced Courses; Genomic Epidemiology in Africa, 21 st – 26 th June 2015.
bacteria and eukaryotes
The Transcriptional Landscape of the Mammalian Genome
Networks and Interactions
High-throughput Biological Data The data deluge
Presentation transcript:

Genome Informatics 2005 ~ 220 participants 1 keynote speaker: David Haussler 47 talks 121 posters

Rodger Voelker:Two classes of splice junctions Search for 5-7 base motifs in exonic and intronic flanking sequences of known splice junctions Computational analysis of collocations between different motifs Many collocations between exonic and intronic sequences Known ESEs display collocations with intronic sequences (including ISEs) Nearly all introns (89%) can be classified into 2 classes

Chip Lawrence: futility of optima in inferences The strong focus in bioinformatics on optimal solutions is fundamentally flawed, because the asymptotic underpinnings of these solutions, such as consistency, do not apply The curse of dimensionality can render optimal solutions very unlikely and misleading Example: minimum free energy predictions of RNA structures Reason: incomplete energy function used, only sec structure considered, no tertiary

Minimum free energy predictions of RNA structures Assumption: –molecule folds into lowest energy state –unique solution to folding problem (optimum) Many programs (e.g. Zuker's Mfold) use the Boltzmann probability function –Most include calculations of suboptimal structures –but not all structures are computed –PPV of MFE: 48 %

Alternative prediction of RNA structures Sample the ensemble of sec structures in proportion to their Boltzmann weights Cluster the structures Use centroid structure in predictions –Improved PPV compared to MFE Srna module of Sfold ( )

A.tumefaciens 5S rRNA energy landscape

Alternative prediction of RNA structures Improved PPV compared to MFE: –Ensemble centroid + 30 % –Largest cluster centroid +18 % –Best centroid + 47 %

Data mining Geneseer – searchable name-translation database ( ) Access to genomic information through gene names Mapping sequences to gene names Identification of homologs across several species for a given gene Used in RNAi Codex ( )

Data mining Ulysses – annotate human genes based on gene interactions in model organisms ( ) Interologs: conserved protein-protein interactions Regulogs: conserved protein-DNA interactions Almost no overlap between data in interaction databases BIND  DIP: 984 refs; BIND  5 DB's: 3 refs

Data mining Integrated Genome Browser (IGB) – visualize: – Genomic annotations from multiple data resources – Experimental data from Affymetrix arrays ( tools/download_igb.affx ) tools/download_igb.affx

Gene expression and pathways Skypainter tool in Reactome database: –allows overlay of gene expression data on pathway graphs –allows generation of a "movie" of a time series ( )

Gene expression ArrayBlast: Compares gene expression signatures generated on different platforms Uses public microarray data sets (GEO) Used to create conserved cancer-related expression signature ( )

Gene expression C. elegans Gene Expression Consortium: SAGE data from specific stages, tissues and cell types Database of gene expression data/pictures/movies of transgenic worms with promoter::GFP fusions for 2000 genes with human orthologs ( )

Michael Caudy: Whole genome analysis of combinatorial and architectural transcription codes Search for TFBS in known neural pathway genes Determine architecture: number, type, order, orientation and spacing of TFBS Compare architecture of activated and repressed genes Determine activity of promoters with TFBS mutations Architecture is critical for differential response to Notch signalling

Regulatory sequence identification Evoprinter: highlights multi-species conserved sequences within orthologous DNAs in the context of a single species of interest ( )

Regulatory sequence identification NestedMICA: –method for discovering many over-represented short motifs in large sets of strings in a single run –candidate transcription factor binding sites ( mica/ ) mica/