Installation - Plus Loretta Auvil National Center for Supercomputing Applications University of Illinois at Urbana-Champaign

Slides:



Advertisements
Similar presentations
Implementing Tableau Server in an Enterprise Environment
Advertisements

HATHI TRUST A Shared Digital Repository Delivering Data For New Generations of Research Strategies and Challenges Jeremy York NISO/BISG Forum ALA 2010.
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
Goals Give you a feeling of what Eclipse is.
1 Configuring Internet- related services (April 22, 2015) © Abdou Illia, Spring 2015.
University of Illinois Role of Mashups, Cloud Computing, and Parallelism for Visual Analytics Loretta Auvil.
SEASR Overview Loretta Auvil, Boris Capitanu National Center for Supercomputing Applications University of Illinois at Urbana-Champaign
VMware vCenter Server Module 4.
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
Understanding and Managing WebSphere V5
Platform as a Service (PaaS)
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
Integrating HADOOP with Eclipse on a Virtual Machine Moheeb Alwarsh January 26, 2012 Kent State University.
SEEM4570: XAMPP, Eclipse, Summary of Html Kangfei Zhao Room 711,ERB
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
OM. Brad Gall Senior Consultant

Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
C Copyright © 2009, Oracle. All rights reserved. Appendix C: Service-Oriented Architectures.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation SEASR Overview Loretta Auvil and Bernie Acs National.
PrimoGENI Tutorial Miguel Erazo, Neil Goldman, Nathanael Van Vorst, and Jason Liu Florida International University Other project participants: Julio Ibarra.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
The Pipeline Processing Framework LSST Applications Meeting IPAC Feb. 19, 2008 Raymond Plante National Center for Supercomputing Applications.
PHP With Oracle 11g XE By Shyam Gurram Eastern Illinois University.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
SURENDER SARA 10GAS Building Corporate KPI’s
SEASR Applications and Future Work University of Illinois at Urbana-Champaign.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
SEASR Analytics for Zotero Loretta Auvil Automated Learning Group Data-Intensive Technologies and Applications, National Center for.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Module 10 Administering and Configuring SharePoint Search.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Meandre Workbench National Center for Supercomputing Applications University of Illinois at Urbana-Champaign.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 14 Database Connectivity and Web Technologies.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The SEASR project and its Meandre infrastructure are sponsored by The Andrew W. Mellon Foundation Meandre Workbench National Center for Supercomputing.
Microsoft Management Seminar Series SMS 2003 Change Management.
Tools and Deployment University of Illinois at Urbana-Champaign.
CMPE 226 Database Systems October 28 Class Meeting
Windows Azure. Azure Application platform for the public cloud. Windows Azure is an operating system You can: – build a web application that runs.
Paperless Timesheet Management Project Anant Pednekar.
Module 9 User Profiles and Social Networking. Module Overview Configuring User Profiles Implementing SharePoint 2010 Social Networking Features.
SEASR Overview Loretta Auvil, Boris Capitanu University of Illinois at Urbana-Champaign
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
Creating Zotero Flows Data-Intensive Technologies and Applications, National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Software Architecture in Practice Mandatory project in performance engineering.
Managing deployment and activation of Web Applications in a distributed e-Infrastructure EGI Technical Forum September 2011 Lyon
Selenium server By, Kartikeya Rastogi Mayur Sapre Mosheca. R
Test Automation Using Selenium Presented by: Shambo Ghosh Ankit Sachan Samapti Sinhamahapatra Akshay Kotawala.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
V7 Foundation Series Vignette Education Services.
Aaron Corso COSC Spring What is LAMP?  A ‘solution stack’, or package of an OS and software consisting of:  Linux  Apache  MySQL  PHP.
 Project Team: Suzana Vaserman David Fleish Moran Zafir Tzvika Stein  Academic adviser: Dr. Mayer Goldberg  Technical adviser: Mr. Guy Wiener.
Configuring SQL Server for a successful SharePoint Server Deployment Haaron Gonzalez Solution Architect & Consultant Microsoft MVP SharePoint Server
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
Platform as a Service (PaaS)
Nithyamoorthy S Core Mind Technologies
Deployment of Flows Loretta Auvil
Platform as a Service (PaaS)
Platform as a Service (PaaS)
SEASR & Meandre for Second Generation Digital Libraries
Installation - Plus Loretta Auvil
Module 01 ETICS Overview ETICS Online Tutorials
Intro to PHP.
Distributing META-pipe on ELIXIR compute resources
Web Application Development Using PHP
Presentation transcript:

Installation - Plus Loretta Auvil National Center for Supercomputing Applications University of Illinois at Urbana-Champaign

Outline Installation Meandre servers and clusters Development Tools: Eclipse Plugin Hands-On

Considerations Do you want to use SEASR-powered services? –May not need to install anything (besides a browser) Do you want to run analytics on your laptop? –Quick 3 step process Do you want to provide SEASR-powered services? –Start simple –Scale as needed Deploying all the extra goodies

Using SEASR-Powered Services SEASR provides some demo services Requires a browser You can access them from –Community Hub to execute a flow –Meandre Server Client to execute a flow; or tune properties and execute a flow Hosted at –Meandre Workbench to execute a flow; or tune properties and execute a flow; or create a flow Hosted at –Zotero to analyze your collections with existing flows

I Need To Run SEASR on my laptop I want to run on my laptop (server) –I have copyrighted information –I have collection for analysis that is too big to be moved –I just want to test it and have fun with it Getting a Meandre server up and running in 3 steps 1.Install Java 2.Download the Meandre server jar into a new directory Use the “Start-Infrastructure” or type “java –jar meandre-server jar” Access your new installation at –

Specialized Downloadable Bundles On the SEASR/Meandre download site – Installation bundles available for: –Mac OS –Linux –Windows Bundles contain: –Zip file that includes executable files –Set of demo components and flows Requires Java (1.5 or greater) to be installed

Bundles Include The bundle comes with –Meandre Server –ZigZag console/compiler/runtime –Meandre Workbench (also provided as a war file) Provides simple scripts to –Start/stop the Meandre server –Start/stop the Meandre Workbench

What About Setting Up My Own Server? You can also deploy the bundles on a server using the same approach.

Customization of My Server This will support –Moderated traffic ??we don’t know what this means?? –Persistent web services can be provided using this server Application Server –Workbench can be deployed alongside the Meadre Server in the embedded Jetty Application Server –Workbench can be deployed using your favorite application server using the.war file Database Options –Meandre uses an embedded Derby as the database –Meandre can be also be setup to use Mysql for the database

Backend Using Derby Database default meandre-config-store.xml org.apache.derby.jdbc.EmbeddedDriver Derby jdbc:derby:./MeandreStore;create=true;logDevice=./DerbyLog

Backend Using MySQL meandre-config-store.xml to USERNAME com.mysql.jdbc.Driver MySQL PASSWORD Changing from Derby to MySQL –Stop the server –Change the meandre-config-store.xml file –Restart the server –Now your server is backend on MySQL

Scaling Up Two possible routes –Deploy a farm of self-contained services (via zigzag) –Use the Meandre Cluster solution Both require your sysadmin/netadmin to provide a highly available load balancer (some virtual appliances available) To create a cluster –Use the previous MySQL set up –Point all the servers to the same database –The server interface pages will allow you to monitor of the servers

Installing The Workbench Use the installation bundles Use the war file –Install your favorite application server –Deploy the war file against the application server

Installing the Community Hub The community hub is a Wordpress plugin Allows to point to a Meandre server Makes all the flows available for execution Pages and posts can add the tag –[meandre-desc SERVER_REPOSITORY_URL FLOW_URI] –E.g. [meandre-desc Renders the description of the flow information and provides a simple execute button to allow visitors to run the flow Deploy the zip file into Wordpress plugins directory

Eclipse Plugin for Developers On the SEASR/Meandre download site – Steps for installation Exit Eclipse Download zip file into Eclipse/dropins directory Unzip file Restart Eclipse

Developers: Eclipse Plugin Uploads components to the Meandre Server Lists components installed Allows for removal of components Shows additional data of interest to a programmer

Meandre: The Architecture The design of the Meandre architecture follows three directives: –provide a robust and transparent scalable solution from a laptop to large-scale clusters –create an unified solution for batch and interactive tasks –encourage reusing and sharing components To ensure such goals, the designed architecture relies on four stacked layers and builds on top of service-oriented architectures (SOA)

Meandre: Basic Single Server

Meandre MDX: Cloud Computing Servers can be –instantiated on demand –disposed when done or on demand A cluster is formed by at least one server The Meandre Distributed Exchange (MDX) –Orchestrates operational integrity by managing cluster configuration and membership using a shared database resource.

Meandre MDX: The Picture MDX Backbone

Meandre MDX: The Architecture Virtualization infrastructure –Provide a uniform access to the underlying execution environment. It relies on virtualization of machines and the usage of Java for hardware abstraction. IO standardization –A unified layer provides access to shared data stores, distributed file-system, specialized metadata stores, and access to other service-oriented architecture gateways.

Meandre MDX: The Architecture Data-intensive flow infrastructure –Provide the basic Meandre execution engine for data-intensive flows, component repositories and discovery mechanisms, extensible plugins and web user interfaces (webUIs). Interaction layer –Can provide self-contained applications via webUIs, create plugins for third-party services, interact with the embedding application that relies on the Meandre engine, or provide services to the cloud.

Meandre: ZigZag Script Language ZigZag is a simple language for describing data- intensive flows – Modeled on Python for simplicity. –ZigZag is declarative language for expressing the directed graphs that describe flows. Command-line tools allow ZigZag files to compile and execute. –A compiler is provided to transform a ZigZag program (.zz) into Meandre archive unit (.mau). –Mau(s) can then be executed by a Meandre engine.

Meandre: ZigZag Script Language As an example the Flow Diagram –The flow below pushes two strings that get concatenated and printed to the console –

ZigZag code that represents example flow: Meandre: ZigZag Script Language # # Imports the three required components and creates the component aliases # import alias as PUSH alias as CONCAT alias as PRINT # # Creates four instances for the flow # push_hello, push_world, concat, print = PUSH(), PUSH(), CONCAT(), PRINT() # # Sets up the properties of the instances # push_hello.message, push_world.message = "Hello ", "world!" # # Describes the data-intensive = push_hello(), = concat( string_one: phres.string; string_two: pwres.string ) print( object: cres.concatenated_string ) #

Meandre: ZigZag Script Language # # Describes the data-intensive flow = = pass( string:pu.string ) print( object:pt.string ) Automatic Parallelization –Multiple instances of a component could be run in parallel to boost throughput. – Specialized operator available in ZigZag Scripting to cause multiple instances of a given component to used Consider a simple flow example show in the diagram The dataflow declaration would look like

Automatic Parallelization –Adding the operator [+AUTO] to middle component –[+AUTO] tells the ZigZag compiler to parallelize the “pass component instance” by the number of cores available on system. –[+AUTO] may also be written [+N] where N is an numeric value to use for example [+10]. Meandre: ZigZag Script Language # Describes the data-intensive flow = = pass( string:pu.string ) [+AUTO] print( object:pt.string )

Automatic Parallelization –Adding the operator [+4] would result in a directed grap Meandre: ZigZag Script Language # Describes the data-intensive flow = = pass( string:pu.string ) [+4] print( object:pt.string ) # Describes the data-intensive flow = = pass( string:pu.string ) [+4!] print( object:pt.string )

Scaling Genetic Algorithms with Meandre Intel 2.8Ghz QuadCore, 4Gb RAM. Average of 20 runs.

And Beyond with Hadoop 60 Dual Quad Core Xeons with 8GB RAM. GB Ethernet Resources exhaustion

Meandre: Flows to MAU Flows can be executed using their RDF descriptors Flows can be compiled into MAU MAU is: –Self-contained representation –Ready for execution –Portable –The base of flow execution in grid environments

Demonstration Installation of Meandre Meandre Eclipse Plugin JIRA, Confluence, Bamboo - what they are and what we use them for Usage of ZigZag –Compiling and executing flows using ZigZag –Usage of ZigZag for Zotero-enabled flows –Usage of ZigZag for Fedora flows

Learning Exercises Open an existing ZigZag flow Convert your flow from yesterday to ZigZag Compile the script Execute the script Have participants download and install SEASR on their personal computers Have participants sign up for accounts to access the SEASR suite of Atlassian tools Use JIRA to log a support request

Discussion Questions What challenges (if any) would scholars have installing the SEASR software? Do you see your institution's IT department running the SEASR environment or would it be your research group? Which environment would you most likely use, the Meandre Workbench or the ZigZag scripting language?