Peabody Museum of Natural History KE EMu at the Peabody Museum of Natural History
The four YPM buildings Peabody (YPM) Environmental Science Center (ESC) 175 Whitney (Anthropology) Geology / Geophysics (KGL)
Confluence of YPM, KGL, ESC
Peabody Collections Counts & Functional Cataloguing Unit Anthropology 325,000 Lot Botany 350,000 Individual Entomology 1,000,000 Individual Invertebrate Paleontology 300,000 Lot Invertebrate Zoology 300,000 Lot Mineralogy 35,000 Individual Paleobotany 150,000 Individual Scientific Instruments 2,000 Individual Vertebrate Paleontology 125,000 Individual Vertebrate Zoology 185,000 Lot / Individual 2.7 million functional units => ~11 million items
Peabody Collections Functional Units Databased Anthropology 325,000 90 % Botany 350,000 1 % Entomology 1,000,000 1 % Invertebrate Paleontology 300,000 55 % Invertebrate Zoology 300,000 20 % Mineralogy 35,000 85 % Paleobotany 150,000 60 % Scientific Instruments 2,000 100 % Vertebrate Paleontology 125,000 60 % Vertebrate Zoology 185,000 95 % 930,000 of 2.7 million => 35 % overall
VZ Kristof Zyskowski (Vert. Zool. - ESC) Greg Watkins-Colwell
(Scientific Instruments – KGL ) HSI Shae Trewin (Scientific Instruments – KGL )
VP Mary Ann Turner (Vert. Paleo. – KGL / YPM)
(Anthro. - YPM / 175 Whitney) Maureen DaRos (Anthro. - YPM / 175 Whitney)
EMu Planning 2002 Collections Computerization Committee deliberations 2003 Q2 on-site demonstrations from CM vendors 2003 Q3+ talk with existing clients of CM vendors
EMu Planning Lots of other folks… 2002 Collections Computerization Committee deliberations 2003 Q2 on-site demonstrations from CM vendors 2003 Q3+ talk with existing clients of CM vendors Lots of other folks…
Classic Argus
Old to the New Argus Module EMu Module Object Conservation Object Catalogue Object Conservation Object Internal Movements Lexicon Taxonomy Activity Accession Lots Activity Loans Site Collecting Events Site Site Constituent Parties Publication Bibliography
Prior Argus Experience Single CMS shared by all Departments for about a decade - YPM staff cohabiting internally and realizing commonalities - streamlined system administration - ease strain of future migration path
EMu Implementation 01/2004 Data mapping starts, contract refinement 02/2004 Major data parsing & cleaning starts in Argus 07/2004 Contracts signed, Vancouver starts 08/2004 Servers configured, YPM-test environment running 09/2004 Begin iterative screen design & tests with tiny datasets 10/2004 Draft YPM-live environment added to server 10/2004 One week of KE training for collections staff 11/2004 Staff working with draft YPM-live environment 02/2005 Larry to Vancouver to work with Negar 03/2005 YPM-live screen design & data mappings stabilizing 04/2005 Last uploads to Argus, prep for final data migration 05/2005 Final data migration, all curatorial disciplines 1.5 years total
EMu Implementation 01/2004 Data mapping starts, contract refinement 02/2004 Major data parsing & cleaning starts in Argus 07/2004 Contracts signed, Vancouver starts 08/2004 Servers configured, YPM-test environment running 09/2004 Begin iterative screen design & tests with tiny datasets 10/2004 Draft YPM-live environment added to server 10/2004 One week of KE training for collections staff 11/2004 Staff working with draft YPM-live environment 02/2005 Larry to Vancouver to work with Negar 03/2005 YPM-live screen design & data mappings stabilizing 04/2005 Last uploads to Argus, prep for final data migration 05/2005 Final data migration, all curatorial disciplines
Local Emu web site
19 trainees EMu training session
Even with best intentions the ‘EMu Big Picture’ can remain a bit fuzzy…
Ian Turnbull Negar Samenirad Clarity… Ian Turnbull Negar Samenirad
EMu Implementation 01/2004 Data mapping starts, contract refinement 02/2004 Major data parsing & cleaning starts in Argus 07/2004 Contracts signed, Vancouver starts 08/2004 Servers configured, YPM-test environment running 09/2004 Begin iterative screen design & tests with tiny datasets 10/2004 Draft YPM-live environment added to server 10/2004 One week of KE training for collections staff 11/2004 Staff working with draft YPM-live environment 02/2005 Larry to Vancouver to work with Negar 03/2005 YPM-live screen design & data mappings stabilizing 04/2005 Last uploads to Argus, prep for final data migration 05/2005 Final data migration, all curatorial disciplines 06/2005 Safe to jump into the water…
EMu Users Group a.k.a. Weekly Therapy - started meetings at go-live - topic du jour as a kick off - then open season on issues - driven by users' experiences Specimens
Some Miscellaneous Statistics 1,200,000 Argus records => 2,100,000 EMu records 3 days for Argus server to spit out the final migration files 12 days for EMu server to ingest the final migration files held a final migration “race” between the KE & YPM servers lost few weeks to Vancouver server hiccups, office move, etc.
Invertebrate Zoology Lourdes Rojas Eric Lazo-Wasem Dan Drew operating in Emu - IZ Chomping at the bit… Invertebrate Zoology Lourdes Rojas Eric Lazo-Wasem Dan Drew
partitioning backlogged multi-species samples… operating in Emu - IZ partitioning backlogged multi-species samples…
Site record for lot operating in Emu - IZ
Site record for lot RTF label operating in Emu - IZ
operating in Emu - IZ Site record for lot RTF label Datamax I-4406 (thermal)
operating in Emu - IZ Site record for lot RTF label Datamax I-4406 (thermal) …label(s) into jar
Excerpts from YPM Modules Catalogue - adjustments for speed Taxonomy - Lexicon tab
Catalogue ANT
Catalogue HSI
Catalogue IZ
DET – IZ4 Cloned the grid elements for: "Current" Identification Yowza ! An identification updater is coming to EMu ! DET – IZ4 Cloned the grid elements for: "Current" Identification "Filed Asth" Identification into additional, separate fields
Tax – higher
Tax - lex
Tax – lex expand1
Tax – lex expand2
Tax – lex expand2 - levels of children to display is regulated by a registry entry - assigning the parent sets the full path back up the tree - optional viewing mechanism grafted onto the base module PRINCIPLE #1 = LEAVE EXTANT MODULE INTACT !
taxontree1
taxontree2
taxontree2 Other external resources? Drag and drop reparenting? Alternate classifications? Autopopulating Luts fields?
Backend Script Tinkering = Creative Destruction JPEG 2000 and EMu - protem freeware fix Admin Module - the proverbial black box
http://www.jpeg.org/jpeg2000 non-proprietary standard mathematically lossless lossy (much smaller files) "perceptually" lossless desktop sofwtare emerging
http://www.fnordware.com/j2k
JP2 spicebush
JP2 spicebush with J2K
JP2 spicebush with J2K and tail target
JP2 spicebush tails with file sizes 1.54 mB 15 kB
JP2 spicebush tails with moral lossless JP2 => circa 30-40% original TIFF size lossy JP2 => go to circa 1%-5%, depending on purpose
Reed Beaman1, Nico Cellinese1, Bryan Heidorn2, Ashley Green1, Youjun Guo1, Michael Donoghue1, Barbara Thiers3, Heather Rolan3 & Melissa Tulig3 1Peabody Museum of Natural History, Yale University 2Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign 1The New York Botanical Garden Funded by the US National Science Foundation
Digital Darkroom TTI 30 x 40 inch vacuum easel copy stand UV free, minimal IR lighting Hardware colorimetric calibration Apple G5 workstation 3.2 Tbytes RAID storage and tape library backup Pipeline to image processing web services Lynn Jones (Botany)
261 kb 1,302 kb 5,166 kb 62,640 kb
JP2 - without oops… no thumbnail
JP2 - coding find imagedir –name *.jp2 –mtime -2 –print loop on the matches and test to see which recently loaded JP2 files are missing a thumbnail JPG, or which JP2 files have been modified more recently than their existing thumbnail JPG ; then build filenames for any qualifying target JPGs ; execute script several times per hour from cron jasper –f match –F tempfile convert tempfile –resize 90x90 target
JP2 - without
JP2 - with
JP2 – size matters Ask your EMu server what it might prefer a hefty TIFF… or a ballet of tiny JP2s ?
Scripting a data load
Save to simple text format Scripting a data load Save to simple text format
Save to simple text format Scripting a data load Save to simple text format Script to convert to Texpress input
Save to simple text format Texpress 'fidsbrief' Scripting a data load Save to simple text format Script to convert for Texpress
Save to simple text format Texpress 'fidsbrief' Scripting a data load Save to simple text format Texpress grammar file Script to convert for Texpress
Save to simple text format Texpress 'fidsbrief' Scripting a data load Save to simple text format Texpress grammar file Script to convert for Texpress
Send output to location of your choice (e.g., Admin Results window)
texload -ch -ddatafile -ggrammarfile ecatalogue Admin Send output to location of your choice (e.g., Admin Results window) texload -ch -ddatafile -ggrammarfile ecatalogue texload -as -ddatafile -ggrammarfile ecatalogue
Local catalog service
Ornis
Paleoportal
Cope and labels Cope MacClintock (Inv. Paleo.)
another Happy Camper Cope chopping
finis