eQTL Tools a collaboration in progress

Slides:



Advertisements
Similar presentations
Bioinformatics Platform Three-tier Architecture Object-based Relational Database implemented using Oracle Middleware implemented using Entity-Class Operations,
Advertisements

Copyright © 2008, SAS Institute Inc. All rights reserved. Discovering Meaningful Patterns in Genomics Data with JMP Genomics Jordan Hiller JMP Genomics.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Gene Set Enrichment Analysis (GSEA)
EBI Proteomics Services Team – Standards, Data, and Tools for Proteomics Henning Hermjakob European Bioinformatics Institute SME forum 2009 Vienna.
A Systematic approach to the Large-Scale Analysis of Genotype- Phenotype correlations Paul Fisher Dr. Robert Stevens Prof. Andrew Brass.
Inferring Causal Phenotype Networks Elias Chaibub Neto & Brian S. Yandell UW-Madison June 2010 QTL 2: NetworksSeattle SISG: Yandell ©
Computational Infrastructure for Systems Genetics Analysis Brian Yandell, UW-Madison high-throughput analysis of systems data enable biologists & analysts.
The Center for Computational Genomics and Bioinformatics Christopher Dwan Mike Karo Tim Kunau.
Teresa Przytycka NIH / NLM / NCBI RECOMB 2010 Bridging the genotype and phenotype.
Cornell University Library Instruction Statistics Reporting System Members: Patrick Chen (pyc7) Soo-Yung Cho (sc444) Gregg Herlacher (gah24) Wilson Muyenzi.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Informatics Support for Vaccine Projects Using and extending the UCSC bioinformatics infrastructure.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
Demonstration Trupti Joshi Computer Science Department 317 Engineering Building North (O)
Computational Infrastructure for Systems Genetics Analysis Brian Yandell, UW-Madison high-throughput analysis of systems data enable biologists & analysts.
Characterizing the role of miRNAs within gene regulatory networks using integrative genomics techniques Min Wenwen
Detecting enriched regions (Chip- seq, RIP-seq) Statistical evaluation of enriched regions Data displayed in Genome Browser Detection of enriched motifs.
Quantile-based Permutation Thresholds for QTL Hotspots Brian S Yandell and Elias Chaibub Neto 17 March © YandellMSRC5.
Copyright OpenHelix. No use or reproduction without express written consent1.
Professor Brian S. Yandell joint faculty appointment across colleges: –50% Horticulture (CALS) –50% Statistics (Letters & Sciences) Biometry Program –MS.
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
Nobody’s Unpredictable Ipsos Portals. © 2009 Ipsos Agenda 2 Knowledge Manager Archway Summary Portal Definition & Benefits.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
MMAP: mouse Metabolomics Analysis Platform Preeti Bais 09/09/2014.
Breaking Up is Hard to Do: Migrating R Project to CHTC Brian S. Yandell UW-Madison 22 November 2013.
Introduction to caArray caBIG ® Molecular Analysis Tools Knowledge Center April 3, 2011.
Project of CZ5225 Zhang Jingxian:
Taverna Workflows for Systems Biology Katy Wolstencroft School of Computer Science University of Manchester.
CERN-PH-SFT-SPI August Ernesto Rivera Contents Context Automation Results To Do…
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Bioinformatics lectures at Rice University Li Zhang Lecture 11: Networks and integrative genomic analysis-3 Genomic data
A collaborative tool for sequence annotation. Contact:
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Yandell © June Bayesian analysis of microarray traits Arabidopsis Microarray Workshop Brian S. Yandell University of Wisconsin-Madison
October 2008BMI Chair Talk © Brian S. Yandell1 networking in biochemistry: building a mouse model of diabetes Brian S. Yandell, UW-Madison October 2008.
13 October 2004Statistics: Yandell © Inferring Genetic Architecture of Complex Biological Processes Brian S. Yandell 12, Christina Kendziorski 13,
Gene Mapping for Correlated Traits Brian S. Yandell University of Wisconsin-Madison Correlated TraitsUCLA Networks (c)
Konstantin Okonechnikov Qualimap v2: advanced quality control of
Summon® 2.0 Discovery Reinvented
Quantile-based Permutation Thresholds for QTL Hotspots
Quantile-based Permutation Thresholds for QTL Hotspots
Professor Brian S. Yandell
Operating System Concepts
Computational Infrastructure for Systems Genetics Analysis
Invest. Ophthalmol. Vis. Sci ;52(6): doi: /iovs Figure Legend:
Attie Bioinformatics Server Redesign
Data challenges in the pharmaceutical industry
ENCODE Pseudogenes and Transcription
Multiple Traits & Microarrays
Graphical Diagnostics for Multiple QTL Investigation Brian S
Inferring Genetic Architecture of Complex Biological Processes BioPharmaceutical Technology Center Institute (BTCI) Brian S. Yandell University of Wisconsin-Madison.
Inferring Genetic Architecture of Complex Biological Processes Brian S
Inferring Causal Phenotype Networks
High Throughput Gene Mapping Brian S. Yandell
Inferring Causal Phenotype Networks Driven by Expression Gene Mapping
Seattle SISG: Yandell © 2008
Schedule for the Afternoon
Inferring Genetic Architecture of Complex Biological Processes Brian S
Seattle SISG: Yandell © 2006
R/qtlbim Software Demo
Note on Outbred Studies
RecTech - Associated Recreation Council
SDLC Phases Systems Design.
Membership Login/sign in
A systems view of genetics in chronic kidney disease
The first two principal components for the islet gene expression data for the 181 microarray probes that map to the chromosome 6 trans-eQTL hotspot with.
FaceBase Hub Years 1 through 5
Presentation transcript:

eQTL Tools a collaboration in progress Brian Yandell & Bioinformatics Team Attie Lab, UW-Madison 1 jul 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 experimental context B6 x BTBR obese mouse cross model for diabetes and obesity 500+ mice from intercross (F2) collaboration with Rosetta/Merck genotypes 5K SNP Affymetrix mouse chip care in curating genotypes! (map version, errors, …) phenotypes clinical phenotypes (>100 / mouse) gene expression traits (>40,000 / mouse / tissue) other molecular phenotypes eQTL Tools Seattle SISG: Yandell © 2009

how does one filter traits? want to reduce to “manageable” set 10/100/1000: depends on needs/tools How many can the biologist handle? how can we create such sets? data-driven procedures correlation-based modules Zhang & Horvath 2005 SAGMB, Keller et al. 2008 Genome Res Li et al. 2006 Hum Mol Gen mapping-based focus on genome region function-driven selection with database tools GO, KEGG, etc Incomplete knowledge leads to bias random sample eQTL Tools Seattle SISG: Yandell © 2009

why build Web eQTL tools? common storage/maintainence of data one well-curated copy central repository reduce errors, ensure analysis on same data automate commonly used methods biologist gets immediate feedback statistician can focus on new methods codify standard choices eQTL Tools Seattle SISG: Yandell © 2009

how does one build tools? no one solution for all situations use existing tools wherever possible new tools take time and care to build! downloaded databases must be updated regularly human component is key need informatics expertise need continual dialog with biologists build bridges (interfaces) between tools Web interface uses PHP commands are created dynamically for R continually rethink & redesign organization eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

steps in using Web tools user enters data on Web page PHP tool interprets user data PHP builds R script R run on script creates plots, summaries, warnings PHP grabs results & displays on page user examines, saves user modifies data and reruns eQTL Tools Seattle SISG: Yandell © 2009

raw data or fancy results? raw data flexible but slow LOD profiles for 100 (1000) traits? fancy results from sophisticated analysis IM, MIM, BIM, MOM analysis too complicated to put in biologists’ hands? methods are unrefined, state-of-art, research tools use of methods involved many subtle choices batch computation over weeks compute once, save, display many times eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 automated R script library('B6BTBR07') out <- multtrait(cross.name='B6BTBR07', filename = 'scanone_1214952578.csv', category = 'islet', chr = c(17), threshold.level = 0.05, sex = 'both',) sink('scanone_1214952578.txt') print(summary(out)) sink() bitmap('scanone_1214952578%03d.bmp', height = 12, width = 16, res = 72, pointsize = 20) plot(out, use.cM = TRUE) dev.off() eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 eQTL Tools Seattle SISG: Yandell © 2009

Seattle SISG: Yandell © 2009 Swertz & Jansen (2007) eQTL Tools Seattle SISG: Yandell © 2009