Presentation is loading. Please wait.

Presentation is loading. Please wait.

Metadata For CARMEN Phillip Lord and Frank Gibson.

Similar presentations


Presentation on theme: "Metadata For CARMEN Phillip Lord and Frank Gibson."— Presentation transcript:

1 Metadata For CARMEN Phillip Lord and Frank Gibson

2 Problems “In the standard model, one collects data, publishes a paper or papers and then gradually loses the original dataset.” THE NEW KNOWLEDGE ECONOMY AND SCIENCE AND TECHNOLOGY POLICY Geoffrey Bowker, University of California, San Diego Geoffrey Bowker, University of California, San Diego

3 The need for clear metadata Most neurosciences data is relative simple in structure But often contextually complex Sometimes associated with behavioural features

4 Neuroscience spike data The raw data is just a waveform But what is the experiment for? What stimulus is the organism/tissue receiving? Even, which channel is which? The data sets being produced are (reasonably) large (10’s of Gb, or 1Tb in three months)

5 Information Extraction How do we get extract the information? http://en.wikipedia.org/wiki/Image:Brain_090407.jpg http://en.wikipedia.org/wiki/Image:ATTtelephone-large.jpg istockphoto.com

6 Multi-Author data AuthorPMIDTypeSize 1Davierwala et al16155567Synthetic_Lethality627 2Krogan et al14759368Affinity_Capture-MS164 3Hazbun et al14690591Affinity_Capture-MS3210 4Gavin et al11805826Affinity_Capture-MS3596 5Ho et al11805837Affinity_Capture-MS733 6Ito et al11283351Two-hybrid275 From Katherine James, NCL

7

8 How do we represent… Laboratory Experiments In silico Analysis Derived data

9

10

11 Joseph Whitworth http://en.wikipedia.org/wiki/Image:Joseph_whitworth.jpg http://en.wikipedia.org/wiki/Image:Screw_thread_Z%C3%A1vit_M16.jpg

12 Metadata Description of results Sample How it was generated Equipment Processing steps Expensive to capture Important to validate result Lab-book

13 The need for standards! “established by consensus and approved by a recognized body, that provides, […] rules, […] for […] the optimum degree of order in a given context” BSI - http://www.bsi-global.com/en/Standards-and-Publications/About-standards/Glossary/

14 View from microarrays Content Standard – Minimal Information MAGE -- Structure MO -- Terminology From the MGED society

15 Life science communities SocietyDomainWebsite The Genomics Standards Consortium (GCS) Genomicshttp://darwin.nox.ac.uk/gsc/ Microarray and Gene Expression Data Society (MGED) Genomicswww.mged.org Proteomics Standards Initiative (PSI) Proteomicshttp://psidev.info Metabolomics Standards Initiative (MSI) Metabolomicswww.metabolomicssociety.org Flow Cytometry experiment Community Flow Cytometry www.flowcyt.org

16

17 MINI – electrophysiology General Features Study Subject Recording Location Task Stimulus Recording Time Series Data

18 Recording Location Recording Location Structure Brain Area Slice Thickness Slice Orientation Cell Type –Cell Type co-ordintates –Location conformation

19

20 View from microarrays Content Standard – Minimal Information MAGE -- Structure MO -- Terminology From the MGED society

21 Functional Genomics Experiment (FuGE) Model of common components in science investigations, such as materials, data, protocols, equipment and software. Provides a framework for capturing complete laboratory workflows, enabling the integration of pre-existing data formats.

22 Robot Reference set of 5,000 mutant strains ‘Folate’ +-+- ‘MMS’ --++ Data curation. Functional analysis. Interactions with in silico programme. * * * Robot Screen mutants for sensitivity to damage/nutrition Part of CISBAN in a nutshell

23 CISBAN dataflow Neil Wipat, Newcastle University

24 Data Entry with SYMBA http://symba.sourceforge.net/ Allyson Lister, Newcastle University

25 Data Entry with SyMBA

26 Summary We are generating metadata “standards” for neurosciences We are following a well-trodden path from bioinformatics We adopted FuGE and have built MINI

27 Future Work More neurosciences experimental datatypes. Minimal Information about a Service –Describe analysis software as well as lab experiments. Outreach!

28 Acknowledgements MINI: Frank Gibson, Paul G Overton, Tom V Smulders, Simon R Schultz, Stephen J Eglen, Colin D Ingram, Stefano Panzeri, Phil Bream, Evelyne Sernagor, Mark Cunningham, Christopher Adams, Christoph Echtermeyer, Jennifer Simonotto, Marcus Kaiser, Daniel C Swan, Martyn Fletcher, Phillip Lord CISBAN: Anil Wipat (PI), Allyson Lister (Research Associate),

29


Download ppt "Metadata For CARMEN Phillip Lord and Frank Gibson."

Similar presentations


Ads by Google