Download presentation
Presentation is loading. Please wait.
Published byLoraine Rogers Modified over 9 years ago
1
QuakeSim Project: Portals and Web Services for Geo-Sciences Marlon Pierce Indiana University mpierce@cs.indiana.edu
2
QuakeSim Project Summary Goal is to provide a distributed environment for connecting scientific computing and data resources with Web based user interfaces. QuakeSim’s IT development includes Portals for user interfaces. Web Services for running remote applications and accessing databases Databases for semantic fault models and InSAR data (USC)
3
Some QuakeSim Applications and Their Data Disloc, Simplex Fault models are used to calculate surface displacements (Disloc) using Okada method. Simplex is the inverse. These help researchers refine their fault models from observed displacements, or model displacements associated with faults. GeoFEST Finite element code for detailed modeling of fault stresses, seismic displacements, uses fault models as input. Coupled to mesh generation tools Can (for example) calculate post- and co-seismic displacements. RDAHMM Time series analysis code, can be applied to GPS and seismic archives. Identifies signal components (possibly associated with underlying physical causes) with no fixed parameters.
4
Portlets + Client Stubs DB Service JDBC DB Job Sub/Mon And File Services Operating and Queuing Systems WSDL Visualization Or Map Service DB, etc WSDL Host 1 (QT or GRWS) Host 2 (Comp Grid)Host 3 (GIS) SOAP/HTTP HTTP(S) WSDL
5
Daily RDAHMM Updates Daily analysis and event classification of GPS data from REASoN’s GRWS.
6
April 21, 2006: What Happened?
7
We can also anlyze real-time GPS data from the CRTN.
8
Real time state changes?
9
Disloc model of Northridge fault. Disloc used in Gerry Simila’s geophysics classes (CSUN).
10
Integrating QuakeSim and UAVSAR July 29, 2008 M 5.4 Chino Hills Earthquake Used QuakeSim to model expected surface displacements from the event Passed on KML file to UAVSAR program/project Overlaid displacements with UAVSAR image Will continue to merge projects using the Los Angeles ShakeOut in mid–November as a testbed
11
TeraGrid Supercomputing Resources (GPIR)
12
GeoFEST Finite Element Modeling portlet and plotting tools
13
QuakeSim and Web 2.0 Export all observations and computational results as KML, GeoRSS. Use Social Networks to share projects, results, papers, proposals, etc. Facebook and OpenSocial have open APIs. Use social (Google) gadgets to deliver your Web components to everyone. Use Google’s APIs to integrate your services with Calendar, Blogspot, YouTube, etc.
14
More Information Email: mpierce@cs.indiana.edu QuakeSim Web Site: www.quakesim.org Portal URL: http://gf7.ucs.indiana.edu:8080/gridsphere Portal SourceForge Page: https://sourceforge.net/projects/crisisgrid Code SVN: http://crisisgrid.svn.sourceforge.net/viewvc/crisisgrid / http://crisisgrid.svn.sourceforge.net/viewvc/crisisgrid /
15
Acknowledgments QuakeSim work is funded by NASA AIST (A. Donnellan, PI) and ACCESS (Y. Bock, PI) programs. Indiana University developers: Galip Aydin, Xiaoming Gao, Zhigang Qi Robert Granat (JPL), Jay Parker (JPL), Maggi Glasscoe (JPL), John Rundle (UC-Davis), Harout Nazerian (JPL), Rami Al-Ghanmi (USC), Dennis Mcleod (USC), Paul Jamason (Scripps), Ruey-Juin Chang (Scripps), Gerry Simila (CSUN)
16
Stop Talking Now, Champ
17
Enterprise ApproachWeb 2.0 Approach JSR 168 PortletsGadgets, Widgets Server-side integration and processing AJAX, client-side integration and processing, JavaScript SOAPRSS, Atom, JSON WSDLREST (GET, PUT, DELETE, POST) Portlet ContainersOpen Social Containers (Orkut, LinkedIn, Shindig); Facebook; StartPages User Centric GatewaysSocial Networking Portals Workflow managers (Taverna, Kepler, etc) Mash-ups Grid computing: Globus, condor, etcCloud computing: Amazon WS Suite, Xen Virtualization
18
Q: What Is Web 2.0? A: IT for everyone Too much of “Enterprise” computing requires specialized knowledge and specialized tools. Result: specialization of tasks within teams like QuakeSim. Waste of talent: scientists can write code, just don’t have time to waste on difficult operating environments. What then is Web 2.0 in detail?
19
What Is a Gadget? Simple gadgets for getting a Grid proxy credential and running remote commands. Both run on my own Web server.
20
Google Reader and GeoRSS
21
Google Maps and GeoRSS
22
Google Earth and KML
23
Cloud Computing and Gateways Cloud computing is the combination of virtualization (Xen, VMWare, OpenVZ,…) with Web Services Web Services control the life cycle of the virtual machines. The virtual machines are under the control of the application developer. UC-D can distribute the VC Service VM, for example Examples include Amazon EC2, Eucalyptus (UCSD), and Virtual Workspace/Nimbus (UChicago) Data clouds focus on data virtualization Google’s BigTable, Facebook’s Cassandra Apache’s Hadoop and related projects (HBase, HDFS) Challenges MPI on clouds Mounting high performance file systems
24
What Would You Want a Cloud? Application Developers: Reproducible operating environments Develop your application and be sure it will be deployed under the same conditions. Distribute reproducible results. Have control of your operating environment Move applications closer to data. Data replication built-in Assume vast amounts of cheap diskspace
25
Simplex refines fault models from GPS displacements
26
UCSB’s Queue Prediction Service (QBETS) Forecasts time you will wait in the queue on various TG super computers. Inherited from OGCE project.
27
OGCE’s XBaya Workflow Composer
28
Some Design Choices Build portals out of portlets (Java Standard) Reuse capabilities from our Open Grid Computing Environments (OGCE) project, the REASoN GPS Explorer project, and many TeraGrid Science Gateways. Decorate with Google Maps, Yahoo UI gadgets, etc. Use Java Server Faces to build individual component portlets. Build standalone tools, then convert to portlets at the very end. Use simple Web Services for accessing codes and data. Keep It Stateless … Use Condor-G and Globus job and file management services for interacting with high performance computers. TeraGrid Favor Google Maps and Google Earth for their simplicity, interactivity and open APIs. Generate KML and GeoRSS Use Apache Maven based build and compile system, SVN on SourceForge
29
QuakeSim, Version 1Reason to ReviseQuakeSim, Version 2 Application Web Service for wrapping a.out executables. Execution management service built with Apache Ant. Services too coupled to portal; no simple WSDL programming interface; could not be used in workflow engines; not self contained Give each code a proper service interface. Retain Apache Ant core but extend. Keep WSDL message structure simple (Strings, ints, doubles, URLs), wrapped as Java Beans File Management ServiceUnnecessary, too coupled to Apache Axis 1.0 HTTP GET, URLs Context Management Service manages persistent portal sessions using recursive XML structure. Too slow (file system); didn’t scale; XML databases didn’t mature; Object-Relational Mappings (ORM) not efficient Using DB40; all services communicate with easily XML serializable JavaBeans. OGC-compatible map and data services Too complicated; ORM is a big overhead. Google Maps, KML generating services Serial job submissionNSF TeraGrid and Open Science Grid run full time production Grids for HPC. Condor-G/Birdbath based job management extensions to GeoFEST service.
30
Grid Job Submission Globus provides a universal queuing system interface. PBS, LoadLeveler, Sun Grid Engine, LSF We chose Condor-G as our job management software for submitting jobs to HPC queuing systems. University of Wisconsin Works with Globus, Matlab DCE, Unicore, etc. We co-locate Condor-G with our GeoFEST Web Service. Communication is through Birdbath, Condor’s Web Service interface. So GeoFEST service API is more or less the same, just now Grid enabled. We also plan to release a general version of this service. Condor command line and Birdbath have different names for job description parameters. Big Easter Egg hunt to find this, but now we know.
31
Portlet Summary RDAHMMSet up and run RDAHMM, query Scripps GRWS GPS Service, maintain persistent user sessions. ST_FilterSimilar to RDAHMM portlet; ST_Filter has much more input. Station MonitorShows GPS stations on a Google Map, displays last 10 minutes of data. Real Time RDAHMMDisplays RDAHMM results of last 10 minutes of GPS data in a Google map. Daily RDAHMMCalculates, updates RDAHMM event classifications with daily updated GPS data from SOPAC’s GRWS service (14 day delay, but uses all the data). GeoFESTCreate input geometries, generate FE meshes, run parallel FEM solvers. Disloc, SimplexCalculate service displacements from fault models.
32
Security Concerns They’ll see the Big Board!
36
QuakeSim Distributed Environment for Modeling Observations
37
Managing Real Time GPS Data Slides from Galip Aydin
39
California Real Time Network Network Data Rates Message Format TimeRYOASCIIGML CRTN GPS Site Positions (9 Stations) 1 second1.5KB4.03KB48.7KB 1 hour5.31MB14.18MB171.31MB 1 day127.44MB340.38MB4.01GB 1 month3.8GB9.97GB123.3GB 1 year45.8GB119.67GB1.41TB Entire SCIGN Network (250 stations) 1year1.23TB16.18TB160TB Continuous GPS Stations (CGPS) are depicted as triangles while the Real-Time stations are represented as circles. Image is obtained from SOPAC GPS Explorer at http://sopac.ucsd.edu/projects/realtime How does one manage all the data generated by the 85 stations? How can you get just the data you want? Note this is fundamentally different from traditional request/response style Web Services.
40
40 Processing Real-Time GPS Streams ryo2nb Raw Data 7010 7011 7012 RYO Ports NB Server ryo2asc ii ascii2gm l ascii2pos Single Station Displaceme nt Filter Station Health Filter RDAHMM Filter Scripps RTD Server Scripps RTD Server ryo2nb Raw Data ryo2asc ii ascii2pos Single Station RDAHMM Filter A Complete Sensor Message Processing Path, including a data analysis application. /SOPAC/GPS/CRTN01/RY O /SOPAC/GPS/CRTN01/AS CII /SOPAC/GPS/CRTN01/PO S /SOPAC/GPS/CRTN01/DSM E GPS Networks
41
41 Application Integration with Real-Time Filters Station Monitor Filter records real-time positions for 10 minutes and calculates position changes Graph Plotter Application creates visual representation of the positions. RDAHMM Filter records real-time positions for 10 minutes and invokes RDAHMM application which determines state changes in the XYZ signal. Graph Plotter Application creates visual representation of the RDAHMM output.
42
NB Server RYO To ASCII Converter Simple Filter RYO Publisher 1 RYO Publisher 2 RYO Publisher n 42 2 – Multiple Publishers Test We add more GPS networks by running more publishers. The results show that 1000 publishers can be supported with no performance loss. This is an operating system limit. Topi c 1A Topi c 1B Topi c 2 Topi c n
43
43 4 – Multiple Brokers Test NaradaBrokering allows creation of Broker networks. We create a two-broker network. Messages published to first broker can be received from the second broker. We take timings on each broker. We connect 750 clients to each broker and run for 24 hours. We chose 750 clients to stay well below the saturation limit. The results show that the performance is very good and similar to single broker test. NB Server 1 NB Server 2 RYO To ASCII Converter Simpl e Filter 1 RYO Publisher Topi c 1A Topi c 1B Simpl e Filter 2 Simple Filter 750 Simple Filter 751 Simple Filter 752 Simple Filter 1500 Topi c 1B NB Serve r 2
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.