GridQTL High Performance QTL analysis via the Grid/Cloud.

Slides:



Advertisements
Similar presentations
Chapter 14~ Mendel & The Gene Idea
Advertisements

Linkage and Genetic Mapping
Genetic research designs in the real world Vishwajit L Nimgaonkar MD, PhD University of Pittsburgh
Tutorial #1 by Ma’ayan Fishelson
ASSOCIATION MAPPING WITH TASSEL Presenter: VG SHOBHANA PhD Student CPMB.
6.6 Meiosis and Genetic Variation KEY CONCEPT Independent assortment and crossing over during meiosis result in genetic diversity.
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
Bob Weaber, Ph.D. Cow-Calf Extension Specialist Assistant Professor Dept. of Animal Sciences and Industry
Meiosis and Genetic Variation
The Inheritance of Complex Traits
Searching for interacting QTL Sverker Holmgren, Kajsa Ljungberg, Scientific Computing, UU Örjan Carlborg, Roslin Institute, Scotland Leif Andersson, Animal.
Fokkerij in genomics tijdperk Johan van Arendonk Animal Breeding and Genomics Centre Wageningen University.
Simulation/theory With modest marker spacing in a human study, LOD of 3 is 9% likely to be a false positive.
Genome-Wide Association Studies
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Human Genetics Unit.
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
Rare and common variants: twenty arguments G.Gibson Homework 3 Mylène Champs Marine Flechet Mathieu Stifkens 1 Bioinformatics - GBIO K.Van Steen.
Introduction to Medical Genetics
6.6 Meiosis and Genetic Variation KEY CONCEPT Independent assortment and crossing over during meiosis result in genetic diversity.
Modes of selection on quantitative traits. Directional selection The population responds to selection when the mean value changes in one direction Here,
Methods of Genome Mapping linkage maps, physical maps, QTL analysis The focus of the course should be on analytical (bioinformatic) tools for genome mapping,
Chapter 9 – Patterns of Inheritance
Introduction to BST775: Statistical Methods for Genetic Analysis I Course master: Degui Zhi, Ph.D. Assistant professor Section on Statistical Genetics.
ConceptS and Connections
Multifactorial Traits
Thursday from QTL to candidate genes Xidan Li Xiaodong Liu DJ de Koning.
Computational research for medical discovery at Boston College Biology Gabor T. Marth Boston College Department of Biology
6.6 Meiosis and Genetic Variation Let’s Visit the Nursery! What did we learn from our little Potato Lab darlings? What were some examples of genetic variations.
Non-Mendelian Genetics
The medical relevance of genome variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen,
Jeopardy Genes and Chromosomes Basics
Gene Hunting: Linkage and Association
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 7-1 Chapter 7 Multifactorial Traits.
Announcements: Proposal resubmission deadline 4/23 (Thursday).
Polygenic and Multifactorial Inheritance
Experimental Design and Data Structure Supplement to Lecture 8 Fall
Discovery of a rare arboreal forest-dwelling flying reptile (Pterosauria, Pterodactyloidea) from China Wang et al. PNAS Feb. 11, 2008.
FINE SCALE MAPPING ANDREW MORRIS Wellcome Trust Centre for Human Genetics March 7, 2003.
Lecture 24: Quantitative Traits IV Date: 11/14/02  Sources of genetic variation additive dominance epistatic.
What’s the Difference? Genetic and Common Diseases.
6.6 Meiosis and Genetic Variation Independent assortment and crossing over during meiosis result in genetic diversity.
Www. geocities.com/ResearchTriangle/Forum/4463/anigenetics.gif.
P.M. VanRaden and D.M. Bickhart Animal Genomics and Improvement Laboratory, Agricultural Research Service, USDA, Beltsville, MD, USA
An quick overview of human genetic linkage analysis Terry Speed Genetics & Bioinformatics, WEHI Statistics, UCB NWO/IOP Genomics Winterschool Mathematics.
Computational Biology and Genomics at Boston College Biology Gabor T. Marth Department of Biology, Boston College
Chapter 22 - Quantitative genetics: Traits with a continuous distribution of phenotypes are called continuous traits (e.g., height, weight, growth rate,
Genetics of Gene Expression BIOS Statistics for Systems Biology Spring 2008.
Sub 1 유전자를 벼에 넣었을 때의 효과 (127일을 40초 동안 압축).
1 Finding disease genes: A challenge for Medicine, Mathematics and Computer Science Andrew Collins, Professor of Genetic Epidemiology and Bioinformatics.
Difference between a monohybrid cross and a dihybrid cross
MULTIPLE GENES AND QUANTITATIVE TRAITS
Statistical Applications in Biology and Genetics
Harnessing the Power of Condor for Human Genetics
Mendel & the Gene Idea.
Quantitative traits Lecture 13 By Ms. Shumaila Azam
Quantitative genetics
Gene mapping in mice Karl W Broman Department of Biostatistics
Recombination (Crossing Over)
Jeopardy Genes and Chromosomes
MULTIPLE GENES AND QUANTITATIVE TRAITS
what are autosomes? What are sex chromosomes?
Linking Genetic Variation to Important Phenotypes
Meiosis and Sexual Life Cycles
Mendelian Inheritance
Linkage analysis and genetic mapping
Lecture # 6 Date _________
Chapter 7 Beyond alleles: Quantitative Genetics
10.2 Inheritance Skills: Calculation of the predicted genotypic and phenotypic ratio of offspring of dihybrid crosses involving unlinked autosomal genes.
Modes of selection.
Presentation transcript:

GridQTL High Performance QTL analysis via the Grid/Cloud

GridQTL BBSRC funded 5 Years initially, then 3 years (£1.5M, then £750K) Part of "Integrative Biology" vision - "Allow prediction from gene sequence to consequence” Institute of Evolutionary Biology (IEB), Edinburgh University Roslin Institute, Edinburgh National e-Science Centre, Edinburgh EPCC, Edinburgh Information Services, Edinburgh University

QTLs Quantitative Trait Loci Positions along a chromosome that have an influence on a continuously varying physical trait. Traits (Phenotypes) Weight, Height, Eye Colour, Hypertension, Cancer... Influenced by many loci and environmental factors - "Multifactorial". NOT looking for single position effects 70% Cystic Fibrosis cases. Huntington's Disease.

Genomic Data Look at structure of chromosome pair. Discover positions that differ from norm. Locate alleles SNPs. Deletions. Insertions.

Phenotypic Data Keep record of the trait for each sample. Roslin Institute uses Pigs. Easy to create pedigrees. Similar genome to humans. Many studies in short time.

Statistical Process Genetic information mixed during reproduction (Meiosis). Positions close on chromosome tend to be crossed together. A statistical process that needs mathematical modelling.

QTLs - Calculation Genomic Data Known markers on chromosomes or other regions. determine alleles (variants) of these markers. Phenotypic Data variation of trait data over pedigrees recorded. Pedigree Data Build up pedigrees to model inheritance of chosen markers and their variants. which pedigree can best identify QTLs?

QTLExpress 2001 Web tool using Java servlets evolved from Fortran applications Simple statistical models employed. Data sets of size KBytes Running time minutes on 2GHz Pentium

GridQTL Ramp up in Data Size and Processing Time Data sets MBytes Processing times hours/days on 2GHz Intel Pentium More users expected More advanced models. Variance, principal, independent components analyses. Bayesian statistics. Random Walk Monte Carlo (MCMC). So more computing resources Clusters - UK National Grid Service, ECDF HPC - investigate parallelism and optimisation of algorithms. Hector

Complex QTL models Need more complex models that need more data so that: Effect of QTL interactions can be modelled. Epistasis - how genes interact Effect of QTL on more than one trait. Pleiotropy Managing data from DNA chips (many markers and traits at once. eQTL Fine mapping of QTL loci. Linkage Disequilibrium (LD) Variance Component Analysis (VCA)

GridQTL Local machine– tomcat web server Portal Technologies – GridSphere Grid – NGS and ECDF Grid middleware (globus) Now qsub Digital Certificates - authentication Now ssh key pair

EPCC Sub contract programming work General system programming Queuing system for local and grid jobs Portal work Memory and parallel issues Cloud work

Usage Released Autumn users use portal in a month. 40 analyses/day local server. 4 cpu hours/day local server. 50 analyses/day Grid. 40 cpu hours/day on Grid. 500 users and 70 citations summer 2012.

Demo

User Count

Analyses & CPU count

User Studies User Projects Sheep – birth weight, milk & fleece quality Cattle, Sheep, Pigs & Chickens – growth, quality Horses – airway obstructions for racehorses Fish harvest traits Crocodiles – scale quality Eucalyptus Trees – wood quality Mouse – obesity Foxes - domesticity

CloudQTL Solution to long term sustainability of service. No infrastructure cost. Guaranteed analyse in time. Pay as you go model. Google, Microsoft, Amazon offer routes. Amazon preferred. EPCC route to ECDF