Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics.

Slides:



Advertisements
Similar presentations
Large-Scale, Adaptive Fabric Configuration for Grid Computing Peter Toft HP Labs, Bristol June 2003 (v1.03) Localised for UK English.
Advertisements

Chapter 1: The Database Environment
Chapter 27 Software Change.
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
OMV Ontology Metadata Vocabulary April 10, 2008 Peter Haase.
…to Ontology Repositories Mathieu dAquin Knowledge Media Institute, The Open University From…
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Cultural Heritage in REGional NETworks REGNET Technological Implementation Plan – D12.
Source of slides: Introduction to Automata Theory, Languages and Computation.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Presented to: By: Date: Federal Aviation Administration Registry/Repository in a SOA Environment SOA Brown Bag #5 SWIM Team March 9, 2011.
1 Building scientific Virtual Research Environments in D4Science Paul Polydoras University of Athens, Greece.
EA Demonstration Study : Dissemination Forum – 8 June EA Views and Sub-views Patrick Bardet EA Unit.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Multiplying binomials You will have 20 seconds to answer each of the following multiplication problems. If you get hung up, go to the next problem when.
0 - 0.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
SUBTRACTING INTEGERS 1. CHANGE THE SUBTRACTION SIGN TO ADDITION
MULT. INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
Addition Facts
Jone Garmendia, Head of Cataloguing 25 November 2011 The National Archives Taxonomy.
The National Grid Service Mike Mineter.
|epcc| NeSC Workshop Open Issues in Grid Scheduling Ali Anjomshoaa EPCC, University of Edinburgh Tuesday, 21 October 2003 Overview of a Grid Scheduling.
Copyright 2006 Digital Enterprise Research Institute. All rights reserved. MarcOnt Initiative Tools for collaborative ontology development.
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
1 DTI/EPSRC 7 th June 2005 Reacting to HCI Devices: Initial Work Using Resource Ontologies with RAVE Dr. Ian Grimstead Richard Potter BSc(Hons)
Metadata for images Michael Day UKOLN: UK Office for Library and Information Networking University of Bath
Pure Silver Reusing and Repurposing Bibliographic Data in a Current Research Information System and Institutional Repository 15 September.
ZMQS ZMQS
Preserving and Sharing Digital Data Greg Colati, Director, Archives and Special Collections May 11, 2012.
Copyright Pearson Prentice Hall
BT Wholesale October Creating your own telephone network WHOLESALE CALLS LINE ASSOCIATED.
1 The OneGeology project IC GS Ian Jackson, February 2007.
1 Quality Indicators for Device Demonstrations April 21, 2009 Lisa Kosh Diana Carl.
ABC Technology Project
Collections and services in the information environment JISC Collection/Service Description Workshop, London, 11 July 2002 Pete Johnston UKOLN, University.
1 IC GS J. Broome, Mar Introduction to the Informatics and Data Aspects John Broome (Canada)
Twenty Questions Subject: Twenty Questions
Project Overview Slide 2 of 15 Overview Project in a Nutshell ◦Motivation ◦Aims and Objectives ◦Expected Outcomes PlanetData Programs Join PlanetData.
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
At Reading Frank Bisby, Alistair Culham, Paul Valdes, Neil Caithness, Tim Sutton, Peter Brewer At Cardiff Alec Gray, Andrew Jones, Nick Fiddian, Nick Pittas,
© 2012 National Heart Foundation of Australia. Slide 2.
An introduction to collections and collection-level description Collection-Level Description & NOF-digitise projects NOF-digitise programme seminar, London,
4D4Life Opening Meeting, September, Reading, UK The Global Multi-Hub Network in Concept Frank Bisby An EC Seventh Framework Scientific Data Infrastructures.
EDIT General Meeting Carvoeiro, January 2008.
Addition 1’s to 20.
25 seconds left…...
CSTA K-12 Computer Science Standards (rev 2011)
Week 1.
We will resume in: 25 Minutes.
1 Unit 1 Kinematics Chapter 1 Day
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 31 Slide 1 Service-centric Software Engineering 1.
E uropean N etwork for B iodiversity I nformation Cees H.J. Hof Universiteit van Amsterdam EC supported 5th framework programme.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Common Data Models and Protocols Richard White, Cardiff University Talk given at “Making Species Databases Interoperable”,
10 March 2004Richard J. White – COMSC / BB Unit Reliable knowledge discovery in a biodiversity Grid Part 2: Litchi and ambiguous names by Richard J. White.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
BIS TDWG Conference 28 October 2013, Florence Documenting data quality in a global network: the challenge for GBIF Éamonn Ó Tuama, Andrea Hahn, Markus.
115 October 2005Richard White - Sp2000/ENBI - Stockholm Litchi: interlinking species information systems Richard White, Andrew Jones, Ed Donovan Computer.
Species Banks a GBIF mechanism to provide electronic access to quality species information Peter H. Schalk, Marc Brugman ETI, University of Amsterdam Tinde.
Indexing the Species Names of the World - for the World Frank Bisby (Species 2000), Michael Ruggiero (ITIS) Per de Place Bjørn (GBIF - ECAT)
BiodiversityWorld GRID Workshop NeSC, Edinburgh – 30 June and 1 July 2005 Taxonomic verification: Species 2000 and the Catalogue of Life Frank Bisby.
1October 2006Richard White, Andrew Jones & Frank Bisby - TDWG - St Louis Federating taxonomic databases: progress with the Catalogue of Life Dynamic Checklist.
The role of persistent identifiers in tracking taxon changes Andrew C. Jones, Richard J. White, Ewen R. Orme, School of Computer Science, Cardiff University,
Progress Alastair Culham. i4Life – the BIG aim To move Catalogue of Life from a research project to a sustainable service 1.To enhance the content 2.To.
GBIF - ECAT  Electronic Catalogue of Names of Known Organisms  Program Officer;  Per de Place Bjørn 
Big Data Needs Little CRUD:
Presentation transcript:

Cardiff School of Computer Science & Informatics Biodiversity Informatics at COMSC Andrew Jones & Richard White School of Computer Science & Informatics

Cardiff School of Computer Science & Informatics 2 Richard White’s interests Design and construction of database systems to deliver biodiversity data Methods for making these systems –interoperable with other systems –adaptable for multiple uses –capable of following concept changes deducing and maintaining information on changes (Extracting numerical information from images, e.g. in “Morphidas” project, not described here)

Cardiff School of Computer Science & Informatics 3 Premise Bioinformaticians want to use information about the species whose genetic material is being studied to understand their development Biodiversity scientists (including taxonomists, ecologists, etc.) want to use molecular data to enhance their classifications, phylogenies and models

Cardiff School of Computer Science & Informatics 4 Biodiversity informatics Therefore Bioinformatic and biodiversity data need to be linked together in many analyses Links often involve the species name as the key linking element

Cardiff School of Computer Science & Informatics 5 Species naming in a nutshell ( Corylus avellana L. ) Common (vernacular) names Latin descriptive phrases Linnaeus: binomial nomenclature Adanson: rules for precedence etc. Accepted names and synonyms Checklists (e.g. the Catalogue of Life …) Data (in different formats, e.g. Buffie …) is usually linked to species names Taxon concepts (including species and higher taxa such as genera, families, etc.) Tracking changes in taxon concepts …

Cardiff School of Computer Science & Informatics 6 Species 2000 & ITIS International programme to assemble data from “Global Species Databases” (GSDs) and deliver the Catalogue of Life (CoL) Authoritative up-to-date checklist of all the world’s species (1.3 out of 1.8m) Reference list of taxon concepts (with unique identifiers) to aid indexing and cross- referencing of species data sources Available on DVD, through the Web ( and by using electronic (“web”) services

Cardiff School of Computer Science & Informatics 7 The Catalogue of Life

Cardiff School of Computer Science & Informatics 8 4D4Life project “Distributed Dynamic Diversity Databases for Life”, EU project 2009 – 2012 Carry the Catalogue of Life forward with improved sustainable infrastructure In COMSC we are designing a new architecture and will deliver a working prototype Service-oriented, re-usable components

Cardiff School of Computer Science & Informatics 9 Re-usable components 1.GSD editors create a data resource “GSD1” 2.CoL partners create the Catalogue of Life from such resources 3.A user creates a new product using the Catalogue of Life 123

Cardiff School of Computer Science & Informatics 10 Interoperability Catalogue of Life –GSDs are heterogeneous in Content Access methods More generally –Multiple data representations & exchange formats –Changing concepts of taxa (and geography)

Cardiff School of Computer Science & Informatics 11 ENBI project and BUFFIE “European Network for Biodiversity Information”, EU project Mostly reporting on standards, practices and recommendations In COMSC, R. Sundaravadivelu developed a prototype interoperability demonstrator (BUFFIE, “Biodiversity Users Framework For Information Exchange”) Accepts data sources using different protocols and XML formats Provides a merged response in an XML format and protocol of the user’s choice

Cardiff School of Computer Science & Informatics 12 THIS SLIDE INTENTIONALLY LEFT NOT QUITE BLANK

Cardiff School of Computer Science & Informatics 13 A world of resources Imagine a digital world full of biodiversity data and analytical resources like these, just as there is in bioinformatics How will users be able to find out what resources there are and how to use them in combination to answer scientific questions?

Cardiff School of Computer Science & Informatics 14 The cross-mapping problem Taxonomy 1 Vicia faba Caesalpinia crista L. Taxonomy 2 Faba faba Caesalpinia crista L. Caesalpinia bonduc (L.) Roxb. Caesalpinia crista L., p.p.

Cardiff School of Computer Science & Informatics 15 i4Life 4D4Life

Cardiff School of Computer Science & Informatics 16 Constraints and checklists (From Litchi 1) “A full name which is not a pro-parte name may not appear as both an accepted name and a synonym in the same checklist”

Cardiff School of Computer Science & Informatics 17 Persistent identifiers and change In i4Life we need to Use persistent identifiers for taxon concepts –(started in TDWG-TIP project) Link taxonomies and track change –create and maintain “cross-maps”

Cardiff School of Computer Science & Informatics 18 Joining things up: workflow systems

Cardiff School of Computer Science & Informatics 19

Cardiff School of Computer Science & Informatics 20 Workflow problems addressed Incorporation of biodiversity services in workflows (BiodiversityWorld) Authentication in a workflow environment (ASMIMA) Rich annotation of services; discovery (Ewen Orme’s PhD) Knowledge-based assistance for workflow creators (Russell McIver’s PhD) Improving the User Experience (ACJ’s main contribution to BioVeL proposal)

Cardiff School of Computer Science & Informatics 21 Andrew Jones’ interests Naming & concepts –Accurately identifying concepts –Tracking change Making scientific workflow systems usable by non-computer scientists –Hiding “programming” complexity –Helping to find resources & build workflows Environments to support collaborative scientific research –E.g. “doing” taxonomy

Cardiff School of Computer Science & Informatics 22 Future projects We research solutions for data-handling problems faced by biologists and bioinformaticians If you think you might have an interesting and challenging problem, please get in touch