Download presentation
Presentation is loading. Please wait.
1
Metadata For CARMEN Phillip Lord and Frank Gibson
2
Problems “In the standard model, one collects data, publishes a paper or papers and then gradually loses the original dataset.” THE NEW KNOWLEDGE ECONOMY AND SCIENCE AND TECHNOLOGY POLICY Geoffrey Bowker, University of California, San Diego Geoffrey Bowker, University of California, San Diego
3
The need for clear metadata Most neurosciences data is relative simple in structure But often contextually complex Sometimes associated with behavioural features
4
Neuroscience spike data The raw data is just a waveform But what is the experiment for? What stimulus is the organism/tissue receiving? Even, which channel is which? The data sets being produced are (reasonably) large (10’s of Gb, or 1Tb in three months)
5
Information Extraction How do we get extract the information? http://en.wikipedia.org/wiki/Image:Brain_090407.jpg http://en.wikipedia.org/wiki/Image:ATTtelephone-large.jpg istockphoto.com
6
Multi-Author data AuthorPMIDTypeSize 1Davierwala et al16155567Synthetic_Lethality627 2Krogan et al14759368Affinity_Capture-MS164 3Hazbun et al14690591Affinity_Capture-MS3210 4Gavin et al11805826Affinity_Capture-MS3596 5Ho et al11805837Affinity_Capture-MS733 6Ito et al11283351Two-hybrid275 From Katherine James, NCL
8
How do we represent… Laboratory Experiments In silico Analysis Derived data
11
Joseph Whitworth http://en.wikipedia.org/wiki/Image:Joseph_whitworth.jpg http://en.wikipedia.org/wiki/Image:Screw_thread_Z%C3%A1vit_M16.jpg
12
Metadata Description of results Sample How it was generated Equipment Processing steps Expensive to capture Important to validate result Lab-book
13
The need for standards! “established by consensus and approved by a recognized body, that provides, […] rules, […] for […] the optimum degree of order in a given context” BSI - http://www.bsi-global.com/en/Standards-and-Publications/About-standards/Glossary/
14
View from microarrays Content Standard – Minimal Information MAGE -- Structure MO -- Terminology From the MGED society
15
Life science communities SocietyDomainWebsite The Genomics Standards Consortium (GCS) Genomicshttp://darwin.nox.ac.uk/gsc/ Microarray and Gene Expression Data Society (MGED) Genomicswww.mged.org Proteomics Standards Initiative (PSI) Proteomicshttp://psidev.info Metabolomics Standards Initiative (MSI) Metabolomicswww.metabolomicssociety.org Flow Cytometry experiment Community Flow Cytometry www.flowcyt.org
17
MINI – electrophysiology General Features Study Subject Recording Location Task Stimulus Recording Time Series Data
18
Recording Location Recording Location Structure Brain Area Slice Thickness Slice Orientation Cell Type –Cell Type co-ordintates –Location conformation
20
View from microarrays Content Standard – Minimal Information MAGE -- Structure MO -- Terminology From the MGED society
21
Functional Genomics Experiment (FuGE) Model of common components in science investigations, such as materials, data, protocols, equipment and software. Provides a framework for capturing complete laboratory workflows, enabling the integration of pre-existing data formats.
22
Robot Reference set of 5,000 mutant strains ‘Folate’ +-+- ‘MMS’ --++ Data curation. Functional analysis. Interactions with in silico programme. * * * Robot Screen mutants for sensitivity to damage/nutrition Part of CISBAN in a nutshell
23
CISBAN dataflow Neil Wipat, Newcastle University
24
Data Entry with SYMBA http://symba.sourceforge.net/ Allyson Lister, Newcastle University
25
Data Entry with SyMBA
26
Summary We are generating metadata “standards” for neurosciences We are following a well-trodden path from bioinformatics We adopted FuGE and have built MINI
27
Future Work More neurosciences experimental datatypes. Minimal Information about a Service –Describe analysis software as well as lab experiments. Outreach!
28
Acknowledgements MINI: Frank Gibson, Paul G Overton, Tom V Smulders, Simon R Schultz, Stephen J Eglen, Colin D Ingram, Stefano Panzeri, Phil Bream, Evelyne Sernagor, Mark Cunningham, Christopher Adams, Christoph Echtermeyer, Jennifer Simonotto, Marcus Kaiser, Daniel C Swan, Martyn Fletcher, Phillip Lord CISBAN: Anil Wipat (PI), Allyson Lister (Research Associate),
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.