Aimee Stewart (KU) Nadya Williams (UCSD)

Slides:



Advertisements
Similar presentations
Virtualizing Lifemapper for PRAGMA: Step 2 - The Computational Tier By Aimee Stewart, Cindy Zheng, Phil Papadopoulos, C.J. Grady University of Kansas Biodiversity.
Advertisements

System Center 2012 R2 Overview
Using Specimen Data in Scientific Workflow Environments to Connect to Metadata Archive and Discovery Services in Environmental Biology CJ Grady, J.H. Beach,
Lifemapper Provenance Virtualization
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
DevOps and Private Cloud Automation 23 April 2015 Hal Clark.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Astrophysics, Biology, Climate, Combustion, Fusion, Nanoscience Working Group on Simulation-Driven Applications 10 CS, 10 Sim, 1 VR.
IT Administrator Lifecycle Lifecycle Services Dashboard & CustomerSource Roles Developer Business Analyst Information Tools/Service s Project.
Lower costs and improve predictability Automation Enable service owners to focus on work that adds business value Reduce error-prone manual activities.
Integrate into existing systems with PowerShell integration modules Extend by building PS modules to enable integrating into other systems Optimize.
BizTalk Deployment using Visual Studio Release Management
Agenda Overcome flat budgets Coping with relentless growth Meeting increasing business demands Managing escalating complexity Maintaining service levels.
WMU GNL Automation How to make my IT life easier CHRISTOPHER KEYAERT CONSULTANT AT INOVATIV CLOUD AND DATACENTER MANAGEMENT MVP.
© 2006, Cognizant Technology Solutions. All Rights Reserved. The information contained herein is subject to change without notice. Automation – How to.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
Web-Based Tool and Why Cross Platform Support Multi-User No special software to install… just a browser Offload real work to server No worrying about versions.
StudioSysAdmins 2 nd Annual SIGGRAPH Birds-of-a-Feather John Hickson - 08/09/2011 StudioSysAdmins 2 nd Annual SIGGRAPH Birds-of-a-Feather John Hickson.
©2013 Lavastorm Analytics. All rights reserved.1 Lavastorm Analytics Engine 5.0 New Feature Overview.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
EC Grant Agreement no GEOSS Interoperability for Weather Ocean and Water Enhancing the GEOSS Infrastructure for all the Stakeholders.
Bridging Species Niche Modeling and Multispecies Ecological Modeling and Analysis Jeffery Cavner, J.H. Beach, Aimee Stewart, CJ Grady
Specify Software Project – Quick Facts
Cluster Reliability Project ISIS Vanderbilt University.
material assembled from the web pages at
Geospatial Technical Support Module 2 California Department of Water Resources Geospatial Technical Support Module 2 Architecture overview and Data Promotion.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
DORII Joint Research Activities DORII Joint Research Activities Status and Progress 6 th All-Hands-Meeting (AHM) Alexey Cheptsov on.
COMS E Cloud Computing and Data Center Networking Sambit Sahu
We have developed a GUI-based user interface for Chandra data processing automation, data quality evaluation, and control of the system. This system, known.
1 Geospatial and Business Intelligence Jean-Sébastien Turcotte Executive VP San Francisco - April 2007 Streamlining web mapping applications.
IBM Express Runtime © 2007 IBM Corporation 1 Cognos needed to supply customers with additional choices and complete flexibility as they design and deploy.
SONIC-3: Creating Large Scale Installations & Deployments Andrew S. Neumann Principal Engineer, Progress Sonic.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EVGM081 Multi-Site Virtual Cluster: A User-Oriented, Distributed Deployment and Management Mechanism for Grid Computing Environments Takahiro Hirofuchi,
A Personal Cloud Controller Yuan Luo School of Informatics and Computing, Indiana University Bloomington, USA PRAGMA 26 Workshop.
Microsoft’s System Center
SONIC-3: Creating Large Scale Installations & Deployments Andrew S. Neumann Principal Engineer Progress Sonic.
SQL Server 2008 R2 Manageability. Challenges facing database administrators today: Scaling management to multiple data centers Proactively monitoring.
Biodiversity Data Exchange Using PRAGMA Cloud Umashanthi Pavalanathan, Aimee Stewart, Reed Beaman, Shahir Shamsir C. J. Grady, Beth Plale Mount Kinabalu.
Managing and Monitoring the Microsoft Application Platform Damir Bersinic Ruth Morton IT Pro Advisor Microsoft Canada
Lifemapper II: Finding the Good Life Aimee Stewart James H. Beach, C.J. Grady, David A. Vieglias Biodiversity Institute, KU.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
EGEE is a project funded by the European Union under contract IST Installation and configuration of gLite services Robert Harakaly, CERN,
PRAGMA 25 Working Group Updates Resources Working Group Yoshio Tanaka (AIST) Phil Papadopoulos (UCSD) Most slides by courtesy of Peter, Nadya, Luca, and.
Aimee Stewart (KU) Nadya Williams (UCSD) 1.
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
Software tools for digital LLRF system integration at CERN 04/11/2015 LLRF15, Software tools2 Andy Butterworth Tom Levens, Andrey Pashnin, Anthony Rey.
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
DECTRIS Ltd Baden-Daettwil Switzerland Continuous Integration and Automatic Testing for the FLUKA release using Jenkins (and Docker)
Microsoft Virtual Academy. Microsoft Virtual Academy First HalfSecond Half (01) Introduction to Microsoft Virtualization(05) Hyper-V Management (02) Hyper-V.
MAUS Status A. Dobbs CM43 29 th October Contents MAUS Overview Infrastructure Geometry and CDB Detector Updates CKOV EMR KL TOF Tracker Global Tracking.
Lifemapper 2.0 Using and Creating Geospatial Data and Open Source Tools for the Biological Community Aimee Stewart, CJ Grady, Dave Vieglais, Jim Beach.
Microsoft SharePoint Server 2016
Expanding and Scaling Lifemapper Computations Using CCTools
Performance Testing Methodology for Cloud Based Applications
StratusLab Sustainability
XSEDE’s Campus Bridging Project
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
DAT381 Team Development with SQL Server 2005
JOINED AT THE HIP: DEVSECOPS AND CLOUD-BASED ASSETS
Configuration management suite
Introduction to Portal for ArcGIS
DBOS DecisionBrain Optimization Server
PyWBEM Python WBEM Client: Overview #2
What is UiPATH? For more details visit this link online-training.
Presentation transcript:

Aimee Stewart (KU) Nadya Williams (UCSD)

Lifemapper Data library – Climate Observed IPCC Predicted Future Climate – Species Occurrence Points Potential habitat maps Tools – LmSDM: Species Distribution Modeling – LmRAD: Range and Diversity GBIF

LmSDM: Species Distribution Modeling Species Occurrence Data Potential Habitat Environmental Data

LmRAD: Range and Diversity Species Habitat Data Presence Absence Matrix (PAM) Range and Diversity Quantifications Multi-species analyses

External Clients QGIS, Browser, Python Client Visualize, Explore, Analyze LmDbServer Pipeline Data updater Job tracker Climate Predicted habitat PAM assembly Maps & Models LmWebServer Website REST Web Services Submit job Request data Post result LmCompute Actual/Virtual cluster LmSDM LmRAD LmCompute Actual/Virtual cluster LmSDM LmRAD LmCompute Actual/Virtual cluster LmSDM LmRAD Species KU SDSC UF

Increase availability and flexibility of Lifemapper Server as a complete system – reduce cost of installing/configuring/replicating and ease burden of integrating hardware and software Enable a fast “workflow” from software update to server availability: – Minimize time spent on software build and configuration – Automate most hands-on tasks. – Essential: have test cases for all installed components and their configuration Prepare for greater quantity and quality of data and complexity of operations – From low resolution climate data to high resolution satellite imagery for Mt. Kinabalu – From simple single-species SDM experiments to multi- species macro-ecology experiments with more species This work is a part of PRAGMA’s “Resources and Data” working group Build production server Development Production Lifemapper Rocks Pragmagrid GitHub

Manual hands-on tasks: Software packages build Custom scripts User work-around Repetitive tedious functions Tracking errors, exceptions and problems Automation is no longer nice to have it is a must have that allows to: Know your system real-time status: what is installed, what version, configuration, data population Robust system: can reliably build and rebuild Address complexity of configuring – no more manual settings What issues need to be documented? Facilitate easy-to-use solution to provide seamless integration of hw/sw Allow to virtualize infrastructure to improve hw utilization and scalability Enable operational efficiency and flexibility: deployment and operation of virtual servers Enables to clone multiple servers Time

Lifemapper LM code Hdf44/hdf5 Subversion Cmake Byacc Libraries Gdal Geos Mod_python SpatialIndex Tiff LM data Dependencies Postgresql Pgdg repo Server Client Devel Contrib Openssl Postgis Geos Proj Pgbouncer Python modules Cheetah Cherrypy Cython Psycopg2 Pylucene Rtree Mapserver Elgis repo Vera fonts Fribidi Total: 56 RPMS Configuration Pgbouncer Postgresql Postgis LM components

Scripts for hands-on tasks (previously done only once): Software build Data population Configuration Monitoring Troubleshooting For build and post-install stages for installed components and their configuration during the. Datasets to emulate application’s run-time workflow Large changes in the build process Differences in dependencies and configuration between Linux flavors: Ubuntu vs. CentOS Simplify, harden and streamline code What are SW installation defaults and configuration Application “minimal resources” requirements (i.e. what memory/disk/network/other is needed for application to work) changed with different hardware and data Can not make any assumptions about the system Software refactoring Need automation Need unit tests

Data integration – generalize more to use heterogeneous datasets from different sources Cloud approach – Extend and leverage with PCC and PRAGMA_Boot Virtualized environment testing – Storage system – Performance estimates – What I/O volume can we handle without degraded performance? – Check for “storage I/O blender effect” Need to define requirements for – performance – data storage – data management Need to “translate” application/data requirements when application is moved from physical to virtual servers – Is there a bottleneck? – What is needed for IO-intensive application (database)?

Acknowledgements This work is funded in part by National Science Foundation and NASA grants PRAGMA US NSF Lifemapper US NSF EPSCoR US NSF EPSCoR US NSF EHR/DRL US NSF BIO/DBI US NSF OCI/CI-TEAM US NASA NNX12AF45A Rocks US NSF OCI US NSF OCI

Aimee Stewart (KU) Nadya Williams (UCSD)