BLOGGING MEETS COMPUTATIONAL CHEMISTRY Dr Kieron Taylor University of Southampton*

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyProxy and EGEE Ludek Matyska and Daniel.
Advertisements

Data Storage & Security Dr Alastair F. Brown Head of Computing MRC Human Genetics Unit MRC Institute of Genetics and Molecular Medicine The University.
A Toolbox for Blackboard Tim Roberts
Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)
Experience of the SRB in support of collaborative grid computing Martin Dove University of Cambridge.
A Presentation Management System for Collaborative Meetings Krzysztof Wrona (ZEUS) DESY Hamburg 24 March, 2003 ZEUS Electronic Meeting Management System.
OPEN RESEARCH DATA, EPFL, 28 October 2014, M. Töwe, M. Bärlocher docuteam packer: viewer and editor for file structures and metadata.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Administration & Workflow
Chapter 9 Chapter 9: Managing Groups, Folders, Files, and Object Security.
Brief Overview of Major Enhancements to PAWN. Producer – Archive Workflow Network (PAWN) Distributed and secure ingestion of digital objects into the.
What is a blog? “Web log” In simple terms, a blog is a web page where what you write goes in chronological order on the front page Author can write, viewers.
ISACS Assessment Tool Advanced Guide About this guide This guide is designed to detail this software’s functions and features. Before getting started.
Biometric Daily Time Record
INTRODUCTION TO CLOUD COMPUTING Cs 595 Lecture 5 2/11/2015.
AMI S.A. Datasets… Solveig Albrand. AMI S.A. A set is… A number of things grouped together according to a system of classification, or conceived as forming.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
SOFTWARE.
BECOME A BLOGGER: Create a Classroom that Extends Beyond the Boundaries of the School Building.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
WIKI IN EDUCATION Giti Javidi. W HAT IS WIKI ? A Wiki can be thought of as a combination of a Web site and a Word document. At its simplest, it can be.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
CCP4 Study Weekend 3rd January 2003 CCP4i - “Tricks and Tools” Peter Briggs CCP4 Daresbury.
Team working in distributed environments M253 Project Logs Faculty of Computer Studies Arab Open University Kuwait Branch 9/19/20151Kwuait Branch.
Eric GrahamNathan Yau Staff Ecologist, CENSGraduate Student, Department of Statistics Use CasesSensorBase Coupled Human-Observational Systems Technology.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
Royal Society of Chemistry activities to develop a data repository for chemistry-specific data Aileen Day, Alexey Pshenichnov, Ken Karapetyan, Colin Batchelor,
The Information Environment for Neuroscientists David R Newman
About Me I have been working with sharepoint since 2008 My blog:
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
1 Instant Data Warehouse Utilities Extended (Again!!) 14/7/ Today I am pleased to announce the publishing of some fantastic new functionality for.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Wenjing Wu Computer Center, Institute of High Energy Physics Chinese Academy of Sciences, Beijing BOINC workshop 2013.
Collecting Data Types, coding, accuracy, file formats and the effect of data loss.
Copyright © 2015 – Curt Hill Version Control Systems Why use? What systems? What functions?
Reorientation for Moodle 2 Staff Guide. File Repositories With Moodle 2’s file repository system: Duplicate files are only stored once, saving disk space.
Jens Thomas Lensfield Quixote. Quixote Project An international, open-source, open-data collaboration to design, test and.
(1) A “Software ICU” for assessing and maintaining software project health Philip Johnson Collaborative Software Development Laboratory Information and.
Walk through the reporting process for Barcelona Convention using Reportnet Miruna Badescu, Giuseppe Aristei.
NGS Portal.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
Automatic Intelligent Scheduler By  Patil Chetan Pravin  Patel Javed Abbas  Raorane Pratik Anil.
1 Understanding Cataloging with DLESE Metadata Karon Kelly Katy Ginger Holly Devaul
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Web Portal for Chemists M. Sterzel,
Sep 13, 2006 Scientific Computing 1 Managing Scientific Computing Projects Erik Deumens QTP and HPC Center.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
Application Software System Software.
Why A Software Review? Now have experience of real data and first major analysis results –What have we learned? –How should that change what we do next.
GdI/ICS 1 WS 2009/2010 Telecooperation/RBG Prof. Dr. Max Mühlhäuser Dr. Guido Rößling Dr. Dirk Schnelle-Walka, Stefan Radomski.
Dataset citation Clickable link to Dataset in the archive Sarah Callaghan (NCAS-BADC) and the NERC Data Citation and Publication team
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space The Capabilities of the GridSpace2 Experiment.
Cameron Neylon School of Chemistry, University of Southampton & Science and Technology Facilities Council, Rutherford Appleton Laboratory, Oxfordshire.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
A Data Handling System for Modern and Future Fermilab Experiments Robert Illingworth Fermilab Scientific Computing Division.
Scientific data storage: How are computers involved in the following?
Activities Assignments Blogs* Chat* Choice Database Forum* Hot Potatoes* Journals Quizzes (Tests) Wiki* Audio Recorder Podcasts* Surveys VodCasts* Webquest*
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
Earth Observation inputs to ATF Annalisa Terracina EU-DataGrid Project Work Package 9 – EO Applications April 2003 CERN.
Simulation Production System
Skill Based Assessment
11/16/2018.
Rui Wu, Jose Painumkal, Sergiu M. Dascalu, Frederick C. Harris, Jr
Code Analysis, Repository and Modelling for e-Neuroscience
Friday 12/7/18- Period 6 Bell Ringer: Models in science have the purpose of making a particular part or feature of the world easier to understand.
Code Analysis, Repository and Modelling for e-Neuroscience
Presentation transcript:

BLOGGING MEETS COMPUTATIONAL CHEMISTRY Dr Kieron Taylor University of Southampton*

Grid computing + lab notebooks

Managing concurrent jobs and handling the results.  Paper notebooks are a disaster for multiple computational jobs. Users must log file paths and job names by hand.  Simulation archive must be “synchronized” with the lab notebook.  Science is only as good as the record-keeping, particularly after significant time has elapsed.  It is easier to re-run than it is to figure out what happened to the answers!

Build a database? No!  Job management systems already exist e.g. eMinerals RMCS, but they only operate on one system. No help for trial runs on private hardware.  Chemistry simulations can generate gigabytes of data each. A complete archive is unmanageable, but we must keep the data while we process it.  Processing trajectories is often custom and not always suitable for Grids.  Management system still does not provide contextual scientific discourse.

Computational chemistry is one ongoing experiment  Simulations are not guaranteed to finish.  Parameters must be tweaked.  Surprisingly little real time is spent in “production”.  Failures often need careful examination before they can be fixed.  Data is static, but analysis and opinion can change over time. It is super-important to know what conditions a simulation was performed under.

Enter the Blog  Southampton University chemistry Bloggers attempt to extend blogging into a useful experimental tool.  Autoblogging laser rigs  Open science experimental blogs from people  Now computational chemistry too  Blogging must be worth the effort!

Blogging computational experiments  Writing a Blog entry requires thought and some presentational effort. This is irritating, but very useful in retrospect. Daily digest.  Computational jobs have input decks and result files that must be kept with the observations. Inter-Blog links do this well, but uploading files is a significant problem. Trajectories?  The Blog is useful for presenting progress to others. The work is already done.  Writing a Blog is easy. Writing a useful Blog is not.

Autoblogging eases the task  Manual Blog  User submits job  User collects results  User writes Blog entry  User uploads result files to Blog  User (maybe) assigns metadata  Autoblog  User submits job  Job submission system Blogs automatically at start.  Job submission system Blogs at end of job.

Blog-supported Grid computing Private Repository Blog API

Merits and limitations of Blogs  Blogs are stupid.  Blog posts are automatically chronological.  Writing a blog post forces the user to order their thoughts and present them on a regular basis.  Boss can easily see what people are getting up to.  Restricted access allows collaboration without global disclosure.  User defined tagging allows management of discrete experiments in addition to finding data by timestamp.

Better Blogs Blog API allows read and write, so we can write helper- tools to do additional actions for us.

The Future  Meta-Blog interface to collect together posts from different Blogs into one coherent report about an experiment.  Clever storage management on- and off-Grid. When is data truly dispensable?  Lablog 3.0, a better Blogging platform.  Easier Grid use for molecular simulations.  Researchers who can tell you what they did last year!

Acknowledgments  NGS staff: Jonathan Churchill, Gordon Brown, Keir Hawker  DL_POLY author: Dr William Smith (Daresbury)  DL_POLY user: Robert Hawtin (unknown)  Blog coder: Andrew Milsted (Southampton)