Australian Synchrotron Data curatorship for protein crystallography Julian Adams & Richard Farnsworth
Australian Synchrotron Current Technologies Disks SCSI IDE RAID Tapes LTO DLT Robotic tape changers Optical Media DVD DVD2
Australian Synchrotron Data quantities (worst case) 1 image every 3-5 seconds 20 images per minute 1200 images per hour images per day Each image 15 – 30 MB 20x60x24x20Mb = 576 Mb = 0.6 TB per day 3 x 12weeks x 7days x.6 TB = 250 TB per year
Australian Synchrotron Data management Two copies in as short as practical time Create a take-home copy at the end of the run for the user Retain a single copy onsite for 3 months Deposit data in a long term storage facility This data should only be accessible by requests sent back to the beam line.
Australian Synchrotron Practicality Data collection buffer with 1.5 TB disk Server with TB raid array Users using a portable hard drives (Firewire, USB2), or optical media Long term storage, 5 years = 1.3 PB per PX beamline
Australian Synchrotron Data collection strategy
Australian Synchrotron Data rates LTO technology can stream at up to 80 MB/s given optimum conditions LTO2 typically gets in the region 8-15 MB/s Worst case sufficient for our purposes (just) Off sites limitations are: Gb/s Links to VERN/GRANGENET/AARNET. Cost prohibitive for huge data flow at the moment