Presentation is loading. Please wait.

Presentation is loading. Please wait.

Case studies of practical data management Ben Kreunen Technical Support Officer University Digitisation Service.

Similar presentations


Presentation on theme: "Case studies of practical data management Ben Kreunen Technical Support Officer University Digitisation Service."— Presentation transcript:

1 Case studies of practical data management Ben Kreunen Technical Support Officer University Digitisation Service

2 Putting Metadata to Work Ben Kreunen Technical Support Officer University Digitisation Service

3 Putting metadata to work What? Print collection: –Image management tool created as a spin off from the data used to scan Thesis on demand –Incorporating administrative data into the scanning process to improve business processes Asset stocktake (pilot) –From THEMIS to iPhone and back Creating a contact list from an org chart (concept) –Linking business processes and data

4 Putting metadata to work Why? Time is money –Save 10 seconds on a task performed 2,500 times and you save 1 working day Doing repetitive tasks sucks Doing repetitive tasks again sucks Reducing mental fatigue

5 Putting metadata to work How? Reduce the time it takes to do stuff –Automatically enter related data –Collect data that’s been entered somewhere else –Select from lists rather than type –Re-use data that’s been entered before –Script repetitive processes Simplify interface design –Only show the data you need at the time –Visual feedback

6 Putting metadata to work Who? People who manage the process People who DO the process People with technical skills Working together!

7 Putting metadata to work Simple tools, clever connections http://www.philohome.com/panobot2/panobot2.htm

8 Putting metadata to work Metadata: Data about data

9 Putting metadata to work How do we manage Metadata? Data about data

10 Putting metadata to work How do we manage data? Data about data –Excel? –Access? –Database?

11 Putting metadata to work How do we manage relational data? Data about data –Excel? –Access? –Relational database?

12 Putting metadata to work How do we manage relational data efficiently? Data about data –Excel? –Access? –Relational database?

13 Putting metadata to work What is data management? Making lists of stuff Finding things in lists of stuff Sharing lists of stuff Editing lists of stuff Combining lists of stuff etc….

14 Print Collection Re-using data Open source tools Usability

15 Putting metadata to work Data Requirement There must be only one ID number to link each image to a catalogue record

16 Putting metadata to work Before ~3,500 images on 4 external HDDs without an index (2Tb) File names based on a partial accession number Online images served via KE EMU Duplicate accession number exist Number of duplicate IDs not known ~4,500 prints to be scanned

17 Putting metadata to work Preparation for scanning Prepare data –Export data from EMU –Create separate database to analyse/prepare data –Locate duplicate records (12) –List existing images and calculate ID numbers –Locate invalid file names (7) –Copy master files to network storage (1Tb)

18 Putting metadata to work Scanning requirements Previous images scanned with colour chart Getty “standard” ie. multiple versions at different sizes All versions of images have colour chart (archive master has colour chart, other versions should be cropped) Archive master files = 50% of total file size

19 Putting metadata to work Planning What is the best way to capture a master image and cropped version? Should a cropped version be created of the existing images?

20 Putting metadata to work Planning What is the best way to capture a master image and cropped version? Should a cropped version be created of the existing images? Do we need to create a cropped version? –Saves time digitising –Reduces storage costs ~40%

21 Putting metadata to work Planning What is required to crop images on demand? Is it possible? Can a standard computer do it? What data do we need? How do we collect it? How are the images used? How are requests processed?

22 Putting metadata to work Planning What is required to crop images on demand? Is it possible? Mini Project ImageMagick + coordinates + batch file = automated cropping on demand Hack techniques to collect data Raised awareness of other possible uses

23 Putting metadata to work Acquiring coordinates for cropping

24 Putting metadata to work Scanning issues ID numbers of prints delivered is random –Locating 1 ID number in a list of 8,000…

25 Putting metadata to work What broke Not all ID numbers are unique –modification of naming schema required “Modified” scanning procedure to deal with annoyances was prone to the occasional error –error, cause and solution identified by scanner operator Image from previous project did not match ID number

26 Putting metadata to work Helping others A small step to change our project work into a tool to improve management of image collection –Crop, resize and format images on demand –Fast response to deal with requests for images –Images more secure –Images accessed using familiar identifiers

27 Putting metadata to work Helping others Runtime version of database to be given to collection manager Total software cost: $0

28 Putting metadata to work Helping others

29 Putting metadata to work Helping others

30 Putting metadata to work Helping others Can we browse the images scanned to date?

31 Putting metadata to work Helping others That’s great... Can you do the same thing for everything else you’ve scanned? (currently 250,000 files)

32 Thesis on demand service Automating administrative processes Sharing administrative data Minimising data entry

33 Putting metadata to work About the service Copy of a thesis is requested by a researcher/ academic library for research purposes Thesis is scanned (for a fee) and delivered to client –Print –CD –Cloudstor Recently relocated from the Baillieu to UDS

34 Putting metadata to work Challenges Incorporating administrative data and processes Multiple time frames depending on delivery Variable timing for delivery of theses –accessed locally or from offsite archives Process is now split across 2 departments

35 Putting metadata to work The Request

36 Putting metadata to work Data entry Thesis details –Scan barcode –Automatic collection of required and optional metadata Delivery method – check box –Email address if Cloudstor Date request received Urgency – check box

37 Putting metadata to work

38 Re-using data Date item is to be scanned by calculated from: –Date received –Delivery method –Urgency Work list sorted by “completion status” and “date due” Output filenames automatically generated from metadata (author, year)

39 Putting metadata to work Re-using data File delivery is automated as much as possible: –Copy and rename file to pickup folder –Generate email message to notify Special Collections and Repository team –Load Cloudstor interface if selected as the delivery method Entries for each form field generated and copied to the clipboard Upload form completed with 8 mouse clicks

40 Putting metadata to work Re-using data

41 Putting metadata to work Re-using data

42 Putting metadata to work What broke Client queries could not be answered immediately because of the split –no direct access to our data –daily export of a PDF report enables most queries to be dealt with Not all theses have barcodes Not all theses are catalogued

43 Putting metadata to work What broke

44 Putting metadata to work Outcomes Improved client communication Improved communication between departments Reduced data entry Improved quality of metadata Simplified reporting based on administrative data

45 THEMIS Asset stocktake Local management with centralised data Simplifying data entry Synchronising authoritative data

46 Putting metadata to work Issues Error 401.303 Text box length exceeded. Refer to KB1237 for assistance with this error

47 Putting metadata to work Issues Data in THEMIS is out of date No direct access to update THEMIS –Generates significant workload for 2 organisations Asset data from other sources (CMDB) is out of date Previous updates incomplete

48 Putting metadata to work The Key(?) Excel “wizard” that can be imported into THEMIS

49 Putting metadata to work Useability Where is the data I need to see?

50 Putting metadata to work The Key Not user friendly BUT Consistent data structure for receiving and updating data Create a local copy for collecting current data Populate with “static” data from THEMIS Compare “live” data with THEMIS Export current data to THEMIS

51 Putting metadata to work The Pilot Filemaker 12 database to handle data Accessed via Filmaker Go on iPhone Integrate with CNS barcode app to scan barcodes Streamline onsite data collection

52 Putting metadata to work Simplify data display

53 Putting metadata to work Potential Spin Offs Re-use data for local asset management processes Warn me X weeks before a computer is due for replacement How many computers are due for replacement in X months? Auto-complete asset management forms e.g. disposal

54 Creating a contact directory from an org chart “Hacking” centralised data Linking data management to process management Data visualisation

55 Putting metadata to work The concept An org chart is a list of positions linked to people A contact list is a list of people linked to contact data The people who maintain org charts are often the same people responsible for local contact lists What if I want a list of people sorted by where they work?

56 Putting metadata to work The concept DO NOT update contact details locally –Individuals must update their details in THEMIS Create links for Positions in org chart and link reporting lines Link positions to usernames and lookup other details Export data for viewing –GRAPHML for Org Chart –XML, HTML or PDF for contact list

57 Putting metadata to work Challenge It is technically possible for THEMIS to export an XML data source for re-use (Find an Expert) For various reasons it is not practical at this point in time How do I collect centrally managed contact information efficiently? –Active Directory?

58 Putting metadata to work Raw data: It’s not pretty, but it’s useable

59 What I’ve learnt

60 Putting metadata to work Many people know the problems but without a technical solution nothing happens Working smarter requires everyone to work together –Managers, works, technical people Know when to give up Working smarter is contagious IT support ≠ Technical support

61 © Copyright The University of Melbourne 2009 Discussion/ Questions


Download ppt "Case studies of practical data management Ben Kreunen Technical Support Officer University Digitisation Service."

Similar presentations


Ads by Google