VO-Neural / Data Mining Exploration

Slides:



Advertisements
Similar presentations
Building Portals to access Grid Middleware National Technical University of Athens Konstantinos Dolkas, On behalf of Andreas Menychtas.
Advertisements

LEAD Portal: a TeraGrid Gateway and Application Service Architecture Marcus Christie and Suresh Marru Indiana University LEAD Project (
TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
A PPARC funded project AstroGrid Framework Consortium meeting, Dec 14-15, 2004 Edinburgh Tony Linde Programme Manager.
A Blackboard Building Block™ Crash Course for Web Developers
The Developer Perspective Michelle Osmond. Design – Requirements Gathering Sales & Research projects –Prototypes/Demos User group meetings Usability workshops.
A PPARC funded project AstroGrid’s Common Execution Architecture Guy Rixon, reporting on behalf of Paul Harrison and the other AstroGrid developers SC4DEVO-1,
Grid Programming Environment (GPE) Grid Summer School, July 28, 2004 Ralf Ratering Intel - Parallel and Distributed Solutions Division (PDSD)
CS CS 5150 Software Engineering Lecture 13 System Architecture and Design 1.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Jun Peng Stanford University – Department of Civil and Environmental Engineering Nov 17, 2000 DISSERTATION PROPOSAL A Software Framework for Collaborative.
Apache Jakarta Tomcat Suh, Junho. Road Map Tomcat Overview Tomcat Overview History History What is Tomcat? What is Tomcat? Servlet Container.
Understanding and Managing WebSphere V5
Struts 2.0 an Overview ( )
UNIT-V The MVC architecture and Struts Framework.
Programming mobile devices Part II Programming Symbian devices with Symbian C++
Construction of efficient PDP scheme for Distributed Cloud Storage. By Manognya Reddy Kondam.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
- 1 - Grid Programming Environment (GPE) Ralf Ratering Intel Parallel and Distributed Solutions Division (PDSD)
Leicester, February 24, 2005 VisIVO, a VO-Enabled tool for Scientific Visualization and Data Analysis. VO-TECH Project. Stage01 Ugo Becciani INAF – Astrophysical.
Web Applications Harry R. Erwin, PhD University of Sunderland CIT304/CSE301.
|Tecnologie Web L-A Anno Accademico Laboratorio di Tecnologie Web Introduzione ad Eclipse e Tomcat
ANSTO E-Science workshop Romain Quilici University of Sydney CIMA CIMA Instrument Remote Control Instrument Remote Control Integration with GridSphere.
Fundamentals of Database Chapter 7 Database Technologies.
GumTree Feature Overview Tony Lam Data Acquisition Team Bragg Institute eScience Workshop 2006.
Contents 1.Introduction, architecture 2.Live demonstration 3.Extensibility.
Installation and Development Tools National Center for Supercomputing Applications University of Illinois at Urbana-Champaign The SEASR project and its.
E-Science Data Information and Knowledge Transformation Edikt : e-Science Data, Information and Knowledge Transformation E-Science Centres of Excellence.
MobileMAN Internal meetingHelsinki, June 8 th 2004 NETikos activity in MobileMAN project Veronica Vanni NETikos S.p.A.
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
ALICE, ATLAS, CMS & LHCb joint workshop on
Plug-In Architecture Pattern. Problem The functionality of a system needs to be extended after the software is shipped The set of possible post-shipment.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
DAME Astrophysical DAta Mining & Exploration on GRID M. Brescia – S. G. Djorgovski – G. Longo & DAME Working Group Istituto Nazionale di Astrofisica –
Paperless Timesheet Management Project Anant Pednekar.
NeuroLOG ANR-06-TLOG-024 Software technologies for integration of process and data in medical imaging A transitional.
CS562 Advanced Java and Internet Application Introduction to the Computer Warehouse Web Application. Java Server Pages (JSP) Technology. By Team Alpha.
CMS Luigi Zangrando, Cern, 16/4/ Run Control Prototype Status M. Gulmini, M. Gaetano, N. Toniolo, S. Ventura, L. Zangrando INFN – Laboratori Nazionali.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Convert generic gUSE Portal into a science gateway Akos Balasko.
Modern Programming Language. Web Container & Web Applications Web applications are server side applications The most essential requirement.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
UNDERSTANDING YOUR OPTIONS FOR CLIENT-SIDE DEVELOPMENT IN OFFICE 365 Mark Rackley
Live Mobile and Cloud Hacking by ACEs OGh | Fusion Middleware Experience (FME) 16-feb :45 – 16:45.
Singleton Academy, Pune. Course syllabus Singleton Academy Pune – Course Syllabus1.
GROUP PresentsPresents. WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston.
Plug-In Architecture Pattern. Problem The functionality of a system needs to be extended after the software is shipped The set of possible post-shipment.
CMS Luigi Zangrando, Cern, 16/4/ Run Control Prototype Status M. Gulmini, M. Gaetano, N. Toniolo, S. Ventura, L. Zangrando INFN – Laboratori Nazionali.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Grid2Win : gLite for Microsoft Windows Elisa Ingrà - INFN.
REST API to develop application for mobile devices Mario Torrisi Dipartimento di Fisica e Astronomia – Università degli Studi.
OGSA-DAI.
The Holmes Platform and Applications
WebSphere Diego Leone.
Shaowen Wang1, 2, Yan Liu1, 2, Nancy Wilkins-Diehr3, Stuart Martin4,5
Grid Portal Services IeSE (the Integrated e-Science Environment)
Chapter 18 MobileApp Design
CS 501: Software Engineering Fall 1999
GSAF Grid Storage Access Framework
Introduction to J2EE Architecture
CMPE419 Mobile Application Development
Distributed System Using Java 2 Enterprise Edition (J2EE)
Module 01 ETICS Overview ETICS Online Tutorials
JavaServer Faces: The Fundamentals
School of Computer Science
Google Sky.
CMPE419 Mobile Application Development
Plug-In Architecture Pattern
Presentation transcript:

VO-Neural / Data Mining Exploration The VO-Neural/DAME infrastructure: an integrated data mining system support for the e-science community G. Longo – M. Brescia – O. Laurino & Project Team INAF – Osservatorio Astronomico di Capodimonte Dipartimento di Fisica – Università degli Studi di Napoli Federico II California Institute of Technology OACN Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

astrophysical requirements VO-Neural / Data Mining Exploration astrophysical requirements Astronomical data rate Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

astrophysical requirements VO-Neural / Data Mining Exploration astrophysical requirements Astronomical computational rate GRID Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

astrophysical requirements VO-Neural / Data Mining Exploration astrophysical requirements Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration VO methodology Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration Exploration on datasets Knowledge Discoveries in Databases (KDD) is in practice still unknown to most astronomers Exploration on datasets Dimensional reduction Classification Regression Clustering Forecasting Filtering Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Data Mining Exploration with VO-Neural/DAME VO-Neural / Data Mining Exploration Data Mining Exploration with VO-Neural/DAME Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Data Mining Exploration with VO-Neural/DAME VO-Neural / Data Mining Exploration Data Mining Exploration with VO-Neural/DAME Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration What we want to do…? In 2006, a group of astronomers, computer scientists, engineers and physicians started to explore possible joined effort to create a data mining toolset, based on GRID infrastructure and VO standards, for worldwide users who want to share data, methods and discoveries. Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration How to do this…? Object Oriented Programming Internal VO standards and protocols Java language (generic for DMM) User/Session Registry DB (MySQL) Web-based User I/O Web Application and Web Service Technology Plugin Modularity (easy to be integrated/modified) Hardware independent through GRID driver Data conversion and manipulation support Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration FRONT END Component Architecture: MVC (Model-View-Controller); Technology: Struts 2.0 (building infrastructure tool); Java Servlet & JSP (dynamic context-dependent web page generation); Features: User GUI deployment and I/O management; interaction with internal components through standard protocol (XML); Local User/Session data virtualization through Virtual File Store; Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration DR Component Architecture: It depends on the environment choice; In S.Co.P.E. DR is a component running on the GRID UI; Technology (in S.Co.P.E.): GRID Software (middleware gLite); Features: Storage Device(s) + Execution Environment = Deployment Environment; Different Deployment Environments can be more suited for a specific task (e.g. an MLP TEST is unlikely to be a computing intensive task, so GRID latency times are not needed); Dynamic Driver Loading => Driver Plugins; Drivers are available to the Framework WS and to the Plugins; Also used to convert files formats (standard or DMM dependent); Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration DMM Component Architecture: data mining class hierachy for functionality implementation; Technology: available model packages and libraries; custom ad hoc model design and development; custom wrappers for internal standardization; Features: modularity; fast third part application integration; functionality specialization; multi-language programming support; Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration REDB Component Architecture: JDBC; Technology (in S.Co.P.E.): MySQL and JDBC API; Features: management of user (registration, authentication, working sessions, experiments and files) information and their relationships; store and manage information about three different file's categories: “supported”, “exotic” and “custom” (datasets, model configuration and intermediate data); Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

VO-Neural / Data Mining Exploration FRAMEWORK Component Architecture: Restful Web Service (client-server apps with resource addressable with HTTP methods); DM models control interface through Plugin SDK; Technology: Web container SUN Apache Tomcat; Java Servlet for web service; Features: Internal resource representation through ”contextual” VOTables; Experiment configuration and execution; user authentication and working session management; experiment data & working flow trigger and supervision; Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Project Management Approach VO-Neural / Data Mining Exploration Project Management Approach Suite target proposal XP – eXtreme Programming Design & Documentation process: Statement of work & Project Plan Project Design Description Software Requirement Specifications Software Design Description Implementation & Test Procedures Technical Reports Test Reports User & Maintenance Manuals Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

User/Developer Perspective VO-Neural / Data Mining Exploration User/Developer Perspective A simple user can upload and build his datasets, configure the data mining models available, execute different experiments in service mode, load graphical views of partial/final results. You are not considering yourself as a simple user? Ok, so you think to be a Developer. Or at least a scientist who wants to upload and use his application (and possibly to share it with others). Be honest, you don’t trust someone else’s application. So You want to extend our framework? YES, WE CAN! DM Models Development Download our DM Models library; Add new low level/DM shared libraries and related new wrapper; Extend the DM class hierarchy; Model/Driver Plugin Development Download our SDK; Implement and test the DMPlugin abstract class; Provide a method to produce the plugin description and Submit for Registration; The same if you want to develop a new driver for a specific environment or storage system. Just implement the Driver Plugin Interface and register it; Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Application Prototype VO-Neural / Data Mining Exploration Application Prototype Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Application Prototype VO-Neural / Data Mining Exploration Application Prototype Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Application Prototype VO-Neural / Data Mining Exploration Application Prototype Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Application Prototype VO-Neural / Data Mining Exploration Application Prototype Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Application Prototype VO-Neural / Data Mining Exploration Application Prototype Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Application Prototype VO-Neural / Data Mining Exploration Application Prototype Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009

Our first scientific use cases VO-Neural / Data Mining Exploration Our first scientific use cases Mining the SDSS Archive I. Photometric redshifts in the nearby Universe, R. D’Abrusco et al. (The Astrophysical Journal, 663: 752-764, 2007 July 10. astro‐ph/0805.0156v1; to appear soon in MNRAS (R. D’Abrusco et al.) Cavuoti 2008, Thesis (VONeural website, voneural.na.infn.it) In this Conference poster session: A web application for photometric redshifts evaluation Omar Laurino et al. Workshop finale dei Progetti GRID del PON "Ricerca" 2000-2006 – Avviso 1575 – Catania, February 10-12 ,2009