Migrating Entomologys Collection Management System to EMu Adrian Hine.

Slides:



Advertisements
Similar presentations
Testing Relational Database
Advertisements

Database vocabulary. Data Information entered in a database.
StEPS at EIAWhere We Are Now Paula Weir and Sue Harris Energy Information Administration, U.S. Department of Energy ICES3 Topic Contributed Session: Generalized.
Node Lessons Learned James Hudson Wisconsin Department of Natural Resources.
Depends entirely on support from the user base Many technical issues still need to be resolved Long term development horizon Proposal for a Simplified.
Key-word Driven Automation Framework Shiva Kumar Soumya Dalvi May 25, 2007.
The Caught and Coloured website: its EMu origins Alex Chubaty – Collection Information Systems Craig Churchill – IT Software Development Museum Victoria.
SOFTWARE MAINTENANCE 24 March 2013 William W. McMillan.
NYBG + KE EMu The New York Botanical Garden + KE EMu Melissa Tulig Botanical Information Management.
NHM Science Strategy 2013 … Challenges - Digitise 20,000,000 specimens within 5 years Ian Owens, Dir. Sci.
Loans Management at the Natural History Museum Dave Smith UK EMu Users Group Meeting th April 2007.
Collections Management Museums EMu – Data Cleaning with EMu Data Cleaning with EMu Warren Hindley.
Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Wet Specimen Collections and Alcohol Management A presentation by Giselle Stanton Collection Information - Standards and Support Collection Information.
Modules, Hierarchy Charts, and Documentation
Chapter 1 Introduction to Databases
Collections Management Museums Reporting in KE EMu.
Collections Management Museums EMu 3.1 / 3.2 – New Features EMu 3.1 / 3.2 New Features Bernard Marshall Chief Technology Officer KE Software.
Research Methods & Data AD140Brendan Rapple 2 March, 2005.
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Chapter 3 – Agile Software Development 1Chapter 3 Agile software development.
Chapter 8: Systems analysis and design
Avalanche Internet Data Management System. Presentation plan 1. The problem to be solved 2. Description of the software needed 3. The solution 4. Avalanche.
AS Level ICT Selection and use of appropriate software: Interfaces.
Introduction to Microsoft Access 2003 Mr. A. Craig Dixon CIS 100: Introduction to Computers Spring 2006.
Part 1. Persistent Data Web applications remember your setting by means of a database linked to the site.
Data and its manifestations. Storage and Retrieval techniques.
Expansive learning actions and expressions of transformative agency - A Change Laboratory intervention in a academic Library.
Get your hands dirty cleaning data European EMu Users Meeting, 3rd June. - Elizabeth Bruton, Museum of the History of Science, Oxford
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Database What is a database? A database is a collection of information that is typically organized so that it can easily be storing, managing and retrieving.
Databases. What is a database?  A database is used to store data. The word DATA is actually Latin for FACTS. A database is, therefore, a place, or thing.
Improving Eligibility Documents November, Improving Data Collection The State Office of AIDS (OA) is now working with providers to improve the quality.
Copyright © Prentice Hall Database Management Systems Chapter 13 Getting Data Together.
ITGS Databases.
Maintaining a Database Access Project 3. 2 What is Database Maintenance ?  Maintaining a database means modifying the data to keep it up-to-date. This.
E.g.: MS-DOS interface. DIR C: /W /A:D will list all the directories in the root directory of drive C in wide list format. Disadvantage is that commands.
Database Management Systems (DBMS)
Introduction to KE EMu
EMu in the NHM: A personal perspective Darrell J. Siebert Dept. of Zoology.
Methods and Techniques for Integration of Small Datasets September 13-14, 2005 St. Louis, Missouri Sponsored by the U.S. Department of Housing and Urban.
Data Migration Training Page 1 KE EMu Data Migration
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
Flat Files Relational Databases
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
Extending the biogeographical model Africamuseum 6 (7?) June 2013.
The William and Linda Steere Herbarium The New York Botanical Garden
Darrell Siebert The MOA Programme: Did we really do that?
Information Integration 15 th Meeting Course Name: Business Intelligence Year: 2009.
Adrian Hine, Data Manager (Entomology) Natural History Museum, London Making a molehill out of a mountain – is it achievable to clean 12 million EMu records?
Of An Expert System.  Introduction  What is AI?  Intelligent in Human & Machine? What is Expert System? How are Expert System used? Elements of ES.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
Or how to make 11=1 Dr Penny Berents Australian Museum EMu Project.
PREPARED BY: PN. SITI HADIJAH BINTI NORSANI. LEARNING OUTCOMES: Upon completion of this course, students should be able to: 1. Understand the structure.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 1 Database Systems.
Darrell Siebert Managing EMu in the NHM: Will Botanists Ever Agree with Zoologists? Will Palaeontologists Ever Agree with Anybody?
Overview In this tutorial you will: learn what an e-portfolio is learn about the different things e-portfolios may be used for identify some options for.
Program Design. Simple Program Design, Fourth Edition Chapter 1 2 Objectives In this chapter you will be able to: Describe the steps in the program development.
Strategies for blended learning in an undergraduate curriculum Benjamin Kehrwald, Massey University College of Education.
ICT IGCSE Theory – Revision Presentation The Systems Life Cycle Chapter 7: The Systems Life Cycle Analysis 7.2 Design 7.3 Development.
Databases.
DSS & Warehousing Systems
CIS 155 Table Relationship
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
Introducing the online professional community for Clinical Research
Aleph Implementation Where we are in the process Data Review
Presentation transcript:

Migrating Entomologys Collection Management System to EMu Adrian Hine

Programme Timeline Oct - DecJan - MarApr - JunJul - SepOct - DecJan - MarApr - JunJul - SepJan - MarApr - JunJul - SepOct - Dec Mineralogy Entomology Palaeontology Zoology Botany Separate sub-projects owing to different requirements and very different legacy systems between departments. For Entomology initial planning started in March We have gone live in February 2007.

Legacy System - Paradox Core system was the Collection Management System written in- house on a Paradox for DOS Database in In addition many more specimen and/or taxonomic datasets in Access. AdvantagesDisadvantages Simplicity to enter data on one/two screens. Largely non-relational design (unnormalised data). Stable and reliable system running for more than a decade. Data input into free text fields (no constraints through drop downs/lookups or validation). Requires minimal training to use it.Unsupported programming language (PAL). Good departmental user knowledge built up over this time. Limitations as a Collection Management tool, especially tracking material as it moves around the museum - Location is simple text field. Limited tracking of changes to taxonomic names. Relatively few fields to record data.

Legacy System - Paradox

Legacy System - Datasets Four main kinds of datasets; 1) Accession Register (3,300 records) 2) Specimen level (231,000 records) 3) Collection Index (798,000 records) 4) Taxonomic (55,600 records) All these have different data models. In total 47 datasets were migrated into EMu. The data structures were varied and all had now to fit into the EMu data model.

Data Mapping It was a huge task to reconcile the numerous and varied datasets into a single data repository for the first time ever in the department (and eventually the museum). There were a range of data models present between different datasets. These had to be mapped to the EMu model. Probably the most critical stage of the process was data mapping of our existing data model to the EMu model. Numerous EMu records can be generated from a single Paradox CI record and some quite complex logic was necessary to achieve this. In particular for each CI record four taxonomy records could be generated and these all had be correctly created and attached to one another.

EMu Records for Collection Index Taxonomy Agabus montanus Collection Index Agabus montanus Catalogue (Index Lot) Agabus montanus Catalogue (Index Lot) Agabus montanus Synonym Locations Dry Collection Taxonomy Ilybius melanorcornis Taxonomy Ilybius montanus Taxonomy Agabus melanocornis Parties Linnaeus Bibliography Systema naturae Locations Spirit Collection Original Combination

EMu Catalogue Our main record types in the Catalogue are; 1) Specimen – Specimen Level Datasets 2) Preparation – Specimen level Datasets 3) Acquisition – Accession Register 4) Index Lot - Collection Index Focus on the Index Lot record type and its relationship to another module named the Collection Index.

EMu Index Lot & Collection Index A Catalogue Index Lot is the presence of a species (but can be other taxonomic rank) at a particular location in the museum. The species may be present at multiple locations around the museum in different collections, hence have multiple Index Lot records in the Catalogue. Disparate nature of the material. To be able to mange the material more effectively we wanted to view all Index Lot records together. A new module was created for the NHM client to achieve this – the Collection Index module. There is a single Collection Index record for each species. This record brings together all Index Lots for the same species to be viewable and editable in a single view. Also have functionality built into the client to display all Synonyms of that name.

EMu Index Lot & Collection Index Taxonomy Agabus montanus Collection Index Agabus montanus Filed as Name Catalogue (Index Lot) Agabus montanus Taxonomy Agabus melanocornis Catalogue (Index Lot) Agabus montanus Catalogue (Index Lot) Agabus melanocornis Synonym Locations Spirit Collection Locations Accessory Collection Locations Main Dry Collection Displays Synonym Lots as well

EMu - Collection Index

EMu - Index Lot

What Now? Now migration complete, considerable work to do. Data clean-up – now data in same depository a whole raft of new issues have now arisen e.g. duplication of records sitting in separate datasets – legacy of non-relational model previously. Training – we have a large body of staff. Core Collection management staff of ca. 30. Up to 30 others to train. The department is now in the process of learning the new system – all core staff have undertaken a two day training course followed up by one-one-one sessions. It will take time for everyone to reach a good level of expertise with the new system. Complex system compared to the old Paradox system. The key to success of EMu is ensuring there is an excellent skill base across the department.

Conclusion Has been a challenging process migrating our data into single data depository – it is not an easy process!! However there are considerable advantages to having our data in one place. The process has inevitably thrown up many data clean-up issues that we will be dealing with a long time to come. User training is critical to the success of the project. EMu is substantially more complex than the old system and we need to invest a lot of time and energy bringing everybody in the department to a good standard of competency. We are hopeful that EMu will provide a significantly more effective system to manage our collections at the museum.