E-Science Experiences: Software Engineering Practice and the EU DataGrid Lee Momtahan and Andrew Martin Oxford University Software Engineering Centre.

Slides:



Advertisements
Similar presentations
Metrics and Databases for Agile Software Development Projects David I. Heimann IEEE Boston Reliability Society April 14, 2010.
Advertisements

Consortium within a consortium: the basis for the York service model Elizabeth Heaps (University Librarian) Elizabeth Harbord (Head of Collection Management)
Jens G Jensen Atlas Petabyte store Supporting Multiple Interfaces to Mass Storage Providing Tape and Mass Storage to Diverse Scientific Communities.
Particle physics – the computing challenge CERN Large Hadron Collider –2007 –the worlds most powerful particle accelerator –10 petabytes (10 million billion.
Iñaki Merino Albaina MSc Program: Media & Knowledge Engineering Daily supervisors: drs. L.H.T.E. Yamane dr. ir. M.H. Vastenburg SCID group Faculty of Industrial.
EInfrastructures (Internet and Grids) US Resource Centers Perspective: implementation and execution challenges Alan Blatecky Executive Director SDSC.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Göte Nyman, 2008 Future jobs are created where knowledge is created EUPIDE 2008, Paris, France; University Pierre and Marie Curie & UNICA Göte Nyman 13th.
A Maturity Model for Quality Assurance*
FACULTY OF Management Sciences Department of Office Management and Technology 1 THE IMPORTANCE OF USING INFORMATION SYSTEMS AS VEHICLES FOR KNOWLEDGE PRODUCTION.
April 2009 OSG Grid School - RDU 1 Open Science Grid John McGee – Renaissance Computing Institute University of North Carolina, Chapel.
Dimensions of Data Quality M&E Capacity Strengthening Workshop, Addis Ababa 4 to 8 June 2012 Arif Rashid, TOPS.
How ISO 9001 Fits Into The Software World? Management of Software Projects and Personnel CIS 6516 March 6, 2006 Prepared by Olgu Yilmaz Swapna Mekala.
Science as an Open Enterprise: Open Data for Open Science Professor Brian Collins CB, FREng UCL, June 2012 Emerging conclusions from a Royal Society Policy.
The role of Big Laboratories Accelerating Science and Innovation Accelerating Science and Innovation R.-D. Heuer, CERN Nobel Symposium, 16 May 2013.
Resolving Unique and Persistent Identifiers for Digital Objects Why Worry About Identifiers? Individuals and organizations, including governments and businesses,
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
David Giaretta Associate Director (Development) Funders: DCC Development Digital Curation Centre a centre of expertise in data curation and preservation.
GridPP Tuesday, 23 September 2003 Tim Phillips. 2 Bristol e-Science Vision National scene Bristol e-Science Centre Issues & Challenges.
1 A Local and Remote Radio Frequency Identification Learning Environment Andrew Shields & David Butcher Wireless and Mobility Research Group, Institute.
DiFac Consortium 3rd INTUITION Workshop “VR/VE & Industry – Challenges and Opportunities” Schwabenlandhalle, Fellbach / Stuttgart, Germany 30th of November.
Data Infrastructures Opportunities for the European Scientific Information Space Carlos Morais Pires European Commission Paris, 5 March 2012 "The views.
1 Chapter 2 The Process. 2 Process  What is it?  Who does it?  Why is it important?  What are the steps?  What is the work product?  How to ensure.
1999 Asian Women's Network Training Workshop What the Internet Offers Communications  Across the country or across the world Information resources and.
Creating knowledge-based environments in the public service by using the Balanced Scorecard - An APS Implementation Case Study - Centrelink Communication,
Software Quality Assurance Lecture #2 By: Faraz Ahmed.
In-Kind Contribution Management Update Allen Weeks March 20, Lund.
DAME: Distributed Engine Health Monitoring on the Grid
Progress towards a National Collaboratory Stu Loken Lawrence Berkeley Laboratory.
Innovating with Open Data Jon Blower University of Reading Project coordinator.
Object-Oriented Software Engineering Practical Software Development using UML and Java Chapter 1: Software and Software Engineering.
PhD seminar A case study of the mentoring approach in a SPIKE company By Finn Olav Bjørnson.
E-science in the Netherlands Maria Heijne TU Delft Library Director / Chair Consortium of University Libraries and National Library.
A Grid Computing Use case Datagrid Jean-Marc Pierson.
Chapter 4 Realtime Widely Distributed Instrumention System.
Sophie Sergent Ifremer European Affairs Department / MariFish WP7 ERANET MariFish COORDINATION OF EUROPEAN MARINE FISHERIES RESEARCH Presentation of MariFish.
Internet2 Middleware Initiative. Discussion Outline  What is Middleware why is it important why is it hard  What are the major components of middleware.
The roots of innovation Future and Emerging Technologies (FET) Future and Emerging Technologies (FET) The roots of innovation Proactive initiative on:
The DAME project Professor Jim Austin University of York.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
DAME: A Distributed Diagnostics Environment for Maintenance Duncan Russell University of Leeds.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
…building the next IT revolution From Web to Grid…
Dr. Fran Berman, RPI BRDI Sponsor Forum 2/13. Dr. Fran Berman, RPI Focus: Discussion of planned BRDI activities and key interests of sponsors Improving.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
E-Science Research Councils awarded e-Science funds ” science increasingly done through distributed global collaborations enabled by the Internet, using.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
Chapter 6 CASE Tools Software Engineering Chapter 6-- CASE TOOLS
Group Science J. Marc Overhage MD, PhD Regenstrief Institute Indiana University School of Medicine.
Digital repositories and scientific communication challenge Radovan Vrana Department of Information Sciences, Faculty of Humanities and Social Sciences,
The DEER The Distributed European Electronic Resource.
Workshop on Eurocodes: Training the trainers, Moscow, 9-10 December, European Legislation and Standardization: Benefits of International Cooperation.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
Why an Intellectual Property Policy? Sofia, November 24 and 25, 2015 Mr. Evgeniy Sesitsky, Department for Transition and Developed Countries, World Intellectual.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
Ian F. C. Smith Pilot study design. 2 Disclaimer This is mostly opinion. Suggestions are incomplete. There are other methods.
U.S. Grid Projects and Involvement in EGEE Ian Foster Argonne National Laboratory University of Chicago EGEE-LHC Town Meeting,
Importing record using DOIs Catherine Jones & Robert Darby eScience Centre, Science & Technology Facilities Council.
Status Reports: Measuring against Mission National Institute of Standards and Technology U.S. Department of Commerce 1 Technology Program Evaluation: Methodologies.
NERC e-Science Meeting Malcolm Atkinson Director & e-Science Envoy UK National e-Science Centre & e-Science Institute 26 th April 2006.
LHC Computing at RAL PPD Dave Newbold RAL PPD / University of Bristol The LHC computing challenge PPD and the Grid Computing for physics PPD added value.
Software Engineering Process - II 7.1 Unit 7: Quality Management Software Engineering Process - II.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Computing models, facilities, distributed computing
Knowledge Management Tools
Trends in Open Source Research Jesus M. Gonzalez-Barahona
Meeting Expectations - the Web in the 2(.0)1st Century
Comparison to existing state of security experimentation
Presentation transcript:

e-Science Experiences: Software Engineering Practice and the EU DataGrid Lee Momtahan and Andrew Martin Oxford University Software Engineering Centre

Contents EU-DataGrid Challenges Comparisons Conclusions

EU-DataGrid 9.8m Euro project over 3 years; 21 partners in 15 countries; application in particle physics (and bioinformatics, and earth sciences); PetaBytes of data: datasets to be catalogued, replicated where necessary; seamless delivery of computing resources 200 staff, meeting infrequently (60 FTE)

Project Goals build application frameworks potentially involving huge amounts of data, compute power and distribution provide secure, managed, uniform access to such resource facilitate collaboration, and remote access to data and scientific instruments manage such facilities as a persistent service

Work Package Structure

Our role becoming involved after project started funded to bring computer science/software engineering experience to the project intending to help by modelling aspects of design in order that the system may be better understood, designed, built, documented in passing made the observations documented here interested in the generality of these issues for e-Science

Challenges Requirements Volatility –Novel paradigm; New diversity; Volatile off-the-shelf components Geographical Separation –communication can easily become a limiting factor (Brookes); Physicists are used to collaborating in experiments –but software? System Decomposition –Political concerns; geographic determination

Challenges Project Processes and Authority –there is a quality plan… how do you get people to follow it? is a commercially-based process appropriate? what about traditional academic means of QA?

Challenges Planning and tracking –exit criteria for an iteration seems to be the completion of a document detailing the problems found in testing

Comparisons academic software production vs. commercial software production academic software production vs. other academic activity CMM Level for Software? For Paper/Proposal Writing? open source software vs. open source development open source models vs. publicly-funded research publication in journals vs. publication to a repository

Conclusions Practice of software engineering in an e- Science context is substantially different to industrial practice Industrial models do not seem appropriate Open source models seem to fit better Publication and review are the key to quality and process improvement