Www.eu-eela.org E-science grid facility for Europe and Latin America E2GRIS1 Rolando Navarro Jara Omar Palomino Huamaní International Potato Center Itacuruça.

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

ISecurity GUI User-Friendly Interface. Features Full support of all green-screen functionality Simultaneous views of multiple iSecurity screens and activities.
SPAGeDi a program for Spatial Pattern Analysis of Genetic Diversity
Genetic Heterogeneity Taken from: Advanced Topics in Linkage Analysis. Ch. 27 Presented by: Natalie Aizenberg Assaf Chen.
Evaluation of a new tool for use in association mapping Structure Reinhard Simon, 2002/10/29.
SALSA HPC Group School of Informatics and Computing Indiana University.
E-science grid facility for Europe and Latin America E2GRIS1 Jaime Parada, Edgar Perdomo – UCV Itacuruça (Brazil), 2-15 November 2008 CATIVIC.
Admixture Mapping Qunyuan Zhang Division of Statistical Genomics GEMS Course M Computational Statistical Genetics Computational Statistical Genetics.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
A pilot application 12/9/2008Microsoft eScience Workshop 2008 Robert Bukowski and Jarek Pillardy Computational Biology Service Unit Cornell University.
Sun Grid Engine Grid Computing Assignment – Fall 2005 James Ruff Senior Department of Mathematics and Computer Science Western Carolina University.
High Performance Computing (HPC) at Center for Information Communication and Technology in UTM.
Genetic Diversity of the Phaseolus acutifolius A. Gray Collection of the USDA National Plant Germplasm System Using Targeted Region Amplified Polymorphism.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
Polymorphism and Variant Analysis Lab
E-science grid facility for Europe and Latin America WAM Final Report Yassine LASSOUED & Ali Al Othman Coastal and Marine Resources Centre.
Tools and Utilities for parallel and serial codes in ENEA-GRID environment CRESCO Project: Salvatore Raia SubProject I.2 C.R. ENEA-Portici. 11/12/2007.
LOGO Scheduling system for distributed MPD data processing Gertsenberger K. V. Joint Institute for Nuclear Research, Dubna.
E-science grid facility for Europe and Latin America gRREEMM Status Report-3 Nov 13, 2008 E2GRIS1 Alina Roig Rassi Maikel Dominguez Garcia.
E-science grid facility for Europe and Latin America Installation and configuration of a top BDII Gianni M. Ricciardi – Consorzio COMETA.
E-science grid facility for Europe and Latin America OurGrid E2GRIS1 Rafael Silva Universidade Federal de Campina.
E-science grid facility for Europe and Latin America Watchdog: A job monitoring solution inside the EELA-2 Infrastructure Riccardo Bruno,
Polymorphism & Variant Analysis Lab Saurabh Sinha Polymorphism and Variant Analysis Lab v1 | Saurabh Sinha 1 Powerpoint by Casey Hanson.
E-science grid facility for Europe and Latin America Marcelo Risk y Juan Francisco García Eijó Laboratorio de Sistemas Complejos Departamento.
EZYFLO. Aim of EZYFLO To draw simple flowcharts To reduce the memory size of the flowchart To create a software which runs in DOS environment also.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
E-science grid facility for Europe and Latin America E2GRIS1 Raúl Priego Martínez – CETA-CIEMAT (Spain)‏ Itacuruça (Brazil), 2-15 November.
E-science grid facility for Europe and Latin America E2GRIS1 André A. S. T. Ribeiro – UFRJ (Brazil) Itacuruça (Brazil), 2-15 November 2008.
MATRIX MULTIPLY WITH DRYAD B649 Course Project Introduction.
Resource Brokering in the PROGRESS Project Juliusz Pukacki Grid Resource Management Workshop, October 2003.
MStruct: A New Admixture Model for Inference of Population Structure in Light of Both Genetic Admixing and Allele Mutations Suyash Shringarpure and Eric.
1 Catania, 4 th EEGE User Forum/OGF 25, OurGrid integration with gLite based grids in EELA-2 Francisco Brasileiro Universidade.
E-science grid facility for Europe and Latin America E2GRIS1 Claudio Baeza Retamal and Rodrigo Delgado Urzúa SAEMC Project (
E-science grid facility for Europe and Latin America E2GRIS1 Alina Roig Rassi Maikel Dominguez Garcia CUBAENERGIA Itacuruça (Brazil), 2-15.
E-science grid facility for Europe and Latin America E2GRIS1 Gustavo Miranda Teixeira Ricardo Silva Campos Laboratório de Fisiologia Computacional.
E-science grid facility for Europe and Latin America Bridging the High Performance Computing Gap with OurGrid Francisco Brasileiro Universidade.
E-science grid facility for Europe and Latin America GridwWin: porting gLite to run under Windows Fabio Scibilia – Consorzio COMETA 30/06/2008.
LOGO Development of the distributed computing system for the MPD at the NICA collider, analytical estimations Mathematical Modeling and Computational Physics.
Lab 7. Estimating Population Structure. Goals 1.Estimate and interpret statistics (AMOVA + Bayesian) that characterize population structure. 2.Demonstrate.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Worker Node installation & configuration.
Youngil Kim Awalin Sopan Sonia Ng Zeng.  Introduction  Concept of the Project  System architecture  Implementation – HDFS  Implementation – System.
California Pacific Medical Center
CPSC 171 Introduction to Computer Science System Software and Virtual Machines.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Enabling the use of e-Infrastructures with.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid2Win: Porting of gLite middleware to.
Populations: defining and identifying. Two major paradigms for defining populations Ecological paradigm A group of individuals of the same species that.
Lab 7. Estimating Population Structure
Copyright © 2012, SAS Institute Inc. All rights reserved. SAS ® GRID AT PHAC SAS OTTAWA PLATFORM USERS SOCIETY, NOVEMBER 2012.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Cuban Grid for e-Learning CuGfL FINAL REPORT.
1 Circuitscape Capstone Presentation Team Circuitscape Katie Rankin Mike Schulte Carl Reniker Sean Collins.
BIO1130 LAB 4 MICROEVOLUTION. Objectives of the lab: Understand various concepts of microevolution using simulated populations: Allelic and genotypic.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Introduction to Computer Programming Concepts M. Uyguroğlu R. Uyguroğlu.
Mainframe – Control-M Architecture.
Genetic mapping and QTL analysis - JoinMap and QTLNetwork -
Tracking, Computing & other Stuff. Correlation of detector hits The track segments of inner and outer MDCs are matched on Cluster level The track segments.
E-science grid facility for Europe and Latin America gRREEMM Report-1 Nov 7, 2008 E2GRIS1 Alina Roig Rassi Maikel Dominguez Garcia CUBAENERGIA.
PRISM: PROCESSING AND REVIEW INTERFACE FOR STRONG MOTION DATA SOFTWARE
CRESCO Project: Salvatore Raia
CMU Access via Launch Cluster Management Utility GUI.
Bruce Pullig Solution Architect
Linux: A Product of the Internet
Basic concepts on population genetics
Bruce Pullig Solution Architect
Extensive admixture in Brazilian sickle cell patients: implications for the mapping of genetic modifiers by Maria Clara F. da Silva, Luciana W. Zuccherato,
Linkage analysis and genetic mapping
Genotyping Results Each person was typed for 3 unlinked Short Tandem Repeat loci (STR) vWFII – chromosome 12, intron 40 of the vWF gene UT 2203 – chromosome.
Presentation transcript:

E-science grid facility for Europe and Latin America E2GRIS1 Rolando Navarro Jara Omar Palomino Huamaní International Potato Center Itacuruça (Brazil), 2-15 November 2008 GCP HPC Structure

E-science grid facility for Europe and Latin America OVERVIEW –What is CIP? –CGIAR – Consultative Group on International Agricultural Research –GCP - Generation Challenge Programme Subprogram 4 – Bioinformatics

E-science grid facility for Europe and Latin America Structure STRUCTURE software: - Structure is a free software to genetics analysis developed by Pritchard, Stephens & Donnelly (2000). - From website: “structure is a free software package for using multi-locus genotype data to investigate population structure. Its uses include inferring the presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed.” - Structure command line is easy to install upon Linux – Windows – Sun Operative Systems.

E-science grid facility for Europe and Latin America How does it work?

E-science grid facility for Europe and Latin America INPUT file: The input file is the data that will be processed by Structure. It has the following format: Number of Individuals: 3 Number of locus: 10 Missing input: -9 Ploid: 4

E-science grid facility for Europe and Latin America Mainparams(1): #define INFILE testdata1 // (str) name of input data file #define OUTFILE results //(str) name of output data file #define NUMINDS 3 // (int) number of diploid individuals in data file #define NUMLOCI 10 // (int) number of loci in data file #define LABEL 1 // (B) Input file contains individual labels #define POPDATA 1 // (B) Input file contains a population identifier #define POPFLAG 0 // (B) Input file contains a flag which says whether to use popinfo when USEPOPINFO==1 #define PHENOTYPE 1 // (B) Input file contains phenotype information #define EXTRACOLS 0 // (int) Number of additional columns of data before the genotype data start.

E-science grid facility for Europe and Latin America Mainparams(2): #define PHASEINFO 0 // (B) the data for each individual contains a line indicating phase #define MARKOVPHASE 0 // (B) the phase info follows a Markov model. #define MISSING -9 // (int) value given to missing genotype data #define PLOIDY 4 // (int) ploidy of data #define ONEROWPERIND 0 // (B) store data for individuals in a single line #define MARKERNAMES 0 // (B) data file contains row of marker names #define MAPDISTANCES 0 // (B) data file contains row of map distances // between loci Program Parameters #define MAXPOPS 2 // (int) number of populations assumed #define BURNIN 2000 // (int) length of burnin period #define NUMREPS 2000 // (int) number of MCMC reps after burnin

E-science grid facility for Europe and Latin America Extraparams #define FREQSCORR 1 // (B) allele frequencies are correlated among pops #define ONEFST 0 // (B) assume same value of Fst for all subpopulations. #define INFERALPHA 1 // (B) Infer ALPHA (the admixture parameter)‏ #define POPALPHAS 0 // (B) Individual alpha for each population #define INFERLAMBDA 0 // (B) Infer LAMBDA (the allele frequencies parameter)‏ #define POPSPECIFICLAMBDA 0 //(B) infer a separate lambda for each pop (only if INFERLAMBDA=1). #define NOADMIX 0 (B) Use no admixture model #define LINKAGE 0 // (B) Use the linkage model model #define PHASED 0 // (B) Data are in correct phase (required unless data are diploid)‏

E-science grid facility for Europe and Latin America PLATFORM LSF: - GCP HPC Structure is an application implemented to work for High Performance Computing environment using a management software (LSF) that permit run jobs in parallel way inside the cluster. - Platform LSF is software for managing and accelerating batch workload processing for compute-and data-intensive applications taking maximum advantage of modern multi-core and multi-threaded architectures with advanced new scheduling controls for both sequential and parallel jobs. - Checking LSF status: ~]$ lsload HOST_NAME status r15s r1m r15m ut pg ls it tmp swp mem hpc-cip.cgiar.o ok % M 3176M 1490M compute-0-2.cgi ok % M 1000M 3690M compute-0-1.cgi ok % M 1000M 3660M compute-0-0.cgi ok % M 996M 3644M

E-science grid facility for Europe and Latin America PLATFORM LSF and STRUCTURE

E-science grid facility for Europe and Latin America GCP HPC STRUCTURE – GUI - Developed in CIP by Luis Avila and Reinhard Simmon from Research Informatics Unit

E-science grid facility for Europe and Latin America GOALS Break GCP HPC STRUCTURE dependence from Platform LSF. Run upon GRID environment, for several populations and more than one number of run for each analysis. Getting a friendly interface to users.

Itacuruça (Brazil), E2GRIS1, – Questions … 13

Itacuruça (Brazil), E2GRIS1, – THANK YOU