EGRID Project: Experience Report Implementation of a GRID Infrastructure for the Analysis of Economic and Financial data.

Slides:



Advertisements
Similar presentations
29 June 2006 GridSite Andrew McNabwww.gridsite.org VOMS and VOs Andrew McNab University of Manchester.
Advertisements

Overview of local security issues in Campus Grid environments Bruce Beckles University of Cambridge Computing Service.
Andrew McNab - Manchester HEP - 2 May 2002 Testbed and Authorisation EU DataGrid Testbed 1 Job Lifecycle Software releases Authorisation at your site Grid/Web.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
The Community Authorisation Service – CAS Dr Steven Newhouse Technical Director London e-Science Centre Department of Computing, Imperial College London.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Status of Globus activities within INFN (update) Massimo Sgaravatto INFN Padova for the INFN Globus group
INFN Testbed status report L. Gaido WP6 meeting CERN - October 30th, 2002.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
OSG End User Tools Overview OSG Grid school – March 19, 2009 Marco Mambelli - University of Chicago A brief summary about the system.
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
Ákos FROHNER – DataGrid Security Requirements n° 1 Security Group D7.5 Document and Open Issues
A.Guarise – F.Rosso 1 Enabling Grids for E-sciencE INFSO-RI Comprehensive Accounting Views on large computing farms. Andrea Guarise & Felice Rosso.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Moving Large Amounts of Data Rob Schuler University of Southern California.
CMS Stress Test Report Marco Verlato (INFN-Padova) INFN-GRID Testbed Meeting 17 Gennaio 2003.
Architecture and ATLAS Western Tier 2 Wei Yang ATLAS Western Tier 2 User Forum meeting SLAC April
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
The new EGRID infrastructure An update on the status of the EGRID project S.C. on behalf of the EGRID team.
Trusted Virtual Machine Images a step towards Cloud Computing for HEP? Tony Cass on behalf of the HEPiX Virtualisation Working Group October 19 th 2010.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
2-Sep-02Steve Traylen, RAL WP6 Test Bed Report1 RAL and UK WP6 Test Bed Report Steve Traylen, WP6
Rutherford Appleton Lab, UK VOBox Considerations from GridPP. GridPP DTeam Meeting. Wed Sep 13 th 2005.
VO Box Issues Summary of concerns expressed following publication of Jeff’s slides Ian Bird GDB, Bologna, 12 Oct 2005 (not necessarily the opinion of)
EGRID The EGRID project S.Cozzini Athens, 21 rs April 2005.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
StoRM: status report A disk based SRM server.
INFN GRID Production Infrastructure Status and operation organization Cristina Vistoli Cnaf GDB Bologna, 11/10/2005.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
II EGEE conference Den Haag November, ROC-CIC status in Italy
Enabling Grids for E-sciencE INFN Workshop – May 7-11 Rimini 1 Grid Accounting Status at INFN Riccardo Brunetti INFN-TORINO.
Antonio Fuentes RedIRIS Barcelona, 15 Abril 2008 The GENIUS Grid portal.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Virtual Directory Services and Directory Synchronization May 13 th, 2008 Bill Claycomb Computer Systems Analyst Infrastructure Computing Systems Department.
CNAF - 24 September 2004 EGEE SA-1 SPACI Activity Italo Epicoco.
J Jensen / WP5 /RAL UCL 4/5 March 2004 GridPP / DataGrid wrap-up Mass Storage Management J Jensen
1 The EGRID infrastructure Stefano Cozzini on behalf of the EGRID team EGRID project ( and Democritos National Simulation Centre.
Jean-Philippe Baud, IT-GD, CERN November 2007
Workload Management Workpackage
INFNGRID Technical Board, Feb
Grid Computing: Running your Jobs around the World
Job monitoring and accounting data visualization
SuperB – INFN-Bari Giacinto DONVITO.
Regional Operations Centres Core infrastructure Centres
ALICE and LCG Stefano Bagnasco I.N.F.N. Torino
The EDG Testbed Deployment Details
E.Corso, S.Cozzini, A.Leto, R. Murri, A. Terpin, C. Zoicas
S.Cozzini Den Haag, 25th November 2004
Classic Storage Element
StoRM: a SRM solution for disk based storage systems
The EGRID project S.Cozzini Athens, 21rs April 2005.
Evaluation of “data” grid tools
Securing the Network Perimeter with ISA 2004
Data services on the NGS
Accounting at the T1/T2 Sites of the Italian Grid
Introduction to Data Management in EGI
VOCE Peter Kaczuk, Dan Kouril, Miroslav Ruda, Jan Svec,
SUBMITTED BY: NAIMISHYA ATRI(7TH SEM) IT BRANCH
Artem Trunov and EKP team EPK – Uni Karlsruhe
Interoperability & Standards
Report on GLUE activities 5th EU-DataGRID Conference
From Prototype to Production Grid
The GENIUS portal and the GILDA t-Infrastructure
INFNGRID Workshop – Bari, Italy, October 2004
gLite The EGEE Middleware Distribution
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

EGRID Project: Experience Report Implementation of a GRID Infrastructure for the Analysis of Economic and Financial data

EGRID Project: Experience Report Econophysics GRID Italian Ministry of Education (MIUR) funded project. Purpose: pilot project for future Italian National GRID facility for Economics and Finance. Serves the computing needs of two select research projects: INFM’s High frequency dynamics in financial markets. AREA Trieste’s Softcomputing techniques applied to modern finance. (Both applying models from Physics to Economics/Finance)

EGRID Project: Experience Report Summary: User requirements The EGRID facility Deficiencies of EDG middleware EGRID solutions/workarounds Next steps of EGRID

I. User requirements

User requirements AREA Big DB of corporate budget analysis: to be exported to GRID + WEB. Access must be: secure + authenticated + authorised. No real need for computing power.

User requirements INFM: Management services for 2TB Stock Exchange data (NYSE, Milan, etc.). Data privacy and security: legally binding contracts. Processing facility for raw data.

II. The EGRID facility

The EGRID facility Physical Infrastructure: Non-partner centre INFN Padova supplies all bulk computing power + storage. Resources: 2.6TB storage + 4 exclusive CPUs CPUs best effort. INFN Padova already part of national High Energy Physics GRID – INFN-GRID. Our Users provide limited local GRID-enabled buffer storage to offset bandwidth problems.

The EGRID facility RB (Padova) CE SE 2.6 TB WNs 100 CPUs CE+SE+WN Padova Trieste Palermo Firenze.. site

The EGRID facility Software Infrastructure: Peripheral Sites with same middleware of INFN-GRID: GLOBUS 2.2/2.4 based EDG/LCG2. EGRID software layer on top of EDG/LCG2 to simplify data management: egrid-upload /nyse tar.gz lfn:/fonti/cd/nyse tar.gz edg-replica-manager --vo=egrid copyAndRegisterFile \ file:///home/usr/nyse tar.gz \ -d sfn://egrid-10.egrid.it/flatfiles/SE00/egrid/fonti/cd/nyse tar.gz \ -l lfn:/fonti/cd/nyse tar.gz

The EGRID facility Software Infrastructure: Raw data processing EGRID SW: Stock Exchange format -> more usable research format. Ad-Hoc solution for AREA DB access: web- enabling techniques (CGI, JSP, etc.) + GSI security (Apache MOD_GRIDSITE) + GRID Information System integration.

III. Deficiencies of EDG Middleware

Deficiencies of EDG Middleware Data privacy and security GSIFTP protocol moves data around the GRID but GridFTP daemon only enforces access restrictions by way of standard UNIX permission triple. Pool account mechanism on SE does not allow access rights partitioning within same VO. Neither authentication nor authorization enforced on RLS: replica catalogue easily corrupt!

Deficiencies of EDG Middleware Middleware deployment EDG based on Red Hat Linux 7.3 No complete installation instructions. LCFGng installation tool poorly documented + needs dedicated machine + does not allow useful software combinations (i.e. no CE+SE+WN on same machine). UI needs dedicated machine: cannot be installed on user’s own workstation.

IV. EGRID solutions/workarounds

EGRID solutions/workarounds Data privacy and security: Data resides in SE - that’s where security must be guaranteed; no ACLs available – RedHat 7.3 limit. Pool account mechanism disabled in SE. Each GRID user mapped to his/her own corresponding local account. UNIX groups formed by gathering users based on contract rights to data access. Files on SE protected by group ownership rights. A nested directory structure allows: read access to group + write access to subset of group. Central LDAP server publishes user/group account maps + propagates them to SE.

EGRID solutions/workarounds Middleware Deployment: Painstaking job of: documentation tracking down + deriving from LCFGng installation explicit procedures for single GRID elements + interpretation of obscure error messages + trial and error. Knoppix based LiveCD technology for UI and SuperNode: can be run on the fly from the CD, or can be installed on a machine. Script installs UI on any WorkStation – no need to re- install machine + no need for RedHat 7.3.

V. EGRID next steps

EGRID next steps Present security mechanism is only a temporary solution (scalability issues)! EGRID working with INFN to develop StoRM SRM server: features ACL enforced security to GRID files. Portal for User Applications to replace CLI. Porting of Parallel Applications.