Cynthia Parr Phenotype RCN NESCent 25 February 2013.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

1 Summary Slides from Enhancing Organism Based Disease Knowledge Using Biological Taxonomy, and Environmental Ontologies Ken Baclawski Northeastern University.
International Barcode Of Life Initiative
How to publish genomic Data papers based on BOL data - Biodiversity Data Journal Lyubomir Penev Bulgarian Academy of Sciences & Pensoft Publishers ViBRANT.
Don’t make me think Biodiversity data publishing made easy Vince Smith, Alice Heaton, Laurence Livermore, Simon Rycroft, Ben Scott & Lyubomir Penev* The.
WP3 Biomapping results to date WP3: NRM, CDF, CEFAS, DINARA, WCS Additional input: WP1, AquaMaps workgroup.
The Data Lifecycle and the Curation of Laboratory Experimental Data Tony Hey Corporate VP for Technical Computing Microsoft Corporation.
University of Illinois Visualizing Text Loretta Auvil UIUC February 25, 2011.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
Making small data big! The Biodiversity Data Journal (BDJ) Lyubomir Penev, Teodor Georgiev, Pavel Stoev, David Roberts, Vincent Smith ViBRANT.
Gene Ontology John Pinney
Corals and sea anemones on line: a functioning biodiversity database D. G. Fautin R. W. Buddemeier University of Kansas: Department of Ecology and Evolutionary.
Global Alignment and Collaboration Jo
Presented by: Charles Pallandt Title: Managing Director EMEA Academic & Governmental Markets Date: April 28 th, Turkey “Driving Research Excellence.
Data Conservancy: A Life Sciences Perspective Sayeed Choudhury Johns Hopkins University
1 Enriching UK PubMed Central SPIDER launch meeting, Wolfson College, Oxford Paul Davey, UK PubMed Central Engagement Manager.
Cynthia Parr Species Pages Group GBIF Briefing 11 Aug 2010.
Improved Cancer Risk Assessment Using Text Mining Ilona Silins 1, Anna Korhonen 2, Johan Högberg 1, Lin Sun 2 and Ulla Stenius 1 1 Institute of Environmental.
An On-line Atlas of Marine Diversity and a growing inventory of others.
Link yourself or perish? PhytoKeys, the next generation journal in systematic botany Lyubomir Penev 1, W. John Kress 2, Sandra Knapp 3, De-Zhu Li 4, Susanne.
Roles and Goals Greg Riccardi. iDigBio People University of Florida o Larry Page, Jose Fortes, Pamela Soltis, Bruce McFadden, Renato Figueiredo, Reed.
Connecting Repositories Zdenek Zdrahal Knowledge Media Institute The Open University, UK UNESCO, Paris, 26 February 2013.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
Drivers for a PRAGMA Biodiversity Science Expedition Reed Beaman Florida Museum of Natural History University of Florida.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
Advances in Technology and CRIS Nikos Houssos National Documentation Centre / National Hellenic Research Foundation, Greece euroCRIS Task Group Leader.
Winners and Losers in the Future Ocean Insights from Millions of Samples Rainer Froese IFM-GEOMAR, Kiel, Germany EDIT Symposium 18th January
Making small data big: The Biodiversity Data Journal (BDJ) Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, David M. Roberts 4 & Vincent.
Practical interoperability across semantic stores of data for blah blah
Beyond a Data Portal: A Collaborative Environment for the Deep Carbon Science Communities Han Wang, Yu Chen, Patrick West, John Erickson, Xiaogang Ma,
Analysis Environments For Scientific Communities From Bases to Spaces Bruce R. Schatz Institute for Genomic Biology University of Illinois at Urbana-Champaign.
Scratchpads Publication Module - A paradigm shift in publishing RBG Kew, Seminar,
Introducing Encyclopedia of Life version 2 International, Personal, Re-usable data Cynthia Parr National Museum of Natural History Smithsonian Institution.
At the frontline of publishing in systematic zoology: A presentation of ZooKeys Lyubomir Penev 1, Terry Erwin 2, Jeremy Miller 3 1 Pensoft Publishers,
The Pensoft Journal System and XML-based workflow Lyubomir Penev Life and Literature Conference, Chicago 2011 ViBRANT Virtual Biodversity.
Key Components and Urgent Needs of the Global Species Information System Rainer Froese IFM-GEOMAR.
Patrick Leary 23 October, 2008 TDWG Fremantle Experiences With Species Profile Model.
The application of phenotype and environment ontologies to Natural History Collections Rutger Vos.
Progress since the February 2005 London DNA Barcode of Life Conference Scott Miller, Chair Consortium for the Barcode of Life Smithsonian Institution.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
Challenges and Opportunities for Academic Libraries Collaborative Imperatives to Support Collections, Digital Initiatives, and New Services for a Changing.
Biodiversity Data Journal: mobilization, reuse and integration of small data Lyubomir D. Penev 1,3, Teodor A. Georgiev 3, Pavel E. Stoev 2,3, Jordan Bisserkov.
29-30 October, 2006, Estonia 1 IST4Balt Information analysis using social bookmarking and other tools IST4Balt Information analysis using social bookmarking.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Encyclopedia of Life Established May 2007 First version of portal went online Feb year goals –Assemble infinitely expandable web pages for all.
Encyclopedia of Life Motivating Public Enthusiasts and Expert Scientists to Document the World’s Species Cynthia Parr, Dana Rotman, Jenny Preece, Derek.
Sara E. Richardson Calit2 Summer Undergraduate Research Scholarship Program Advisor: Jurgen Schulze Ivl.calit2.net/wiki CAMERA is.
Epidemiology 217 Molecular and Genetic Epidemiology Bioinformatics & Proteomics John Witte.
EB3233 Bioinformatics Introduction to Bioinformatics.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Jim Edwards Paddy Patterson Cyndy Parr CoML Synthesis meeting Long Beach, CA 1 February 2009.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
Warwick Cathro Assistant Director-General Resource Sharing and Innovation National Library of Australia Trove – a service built on collaboration OCLC Asia.
Riccardi: DIALOGUE Workshop August 1, 2005 Supported by NSF BDI 1 Representing and Using Phylogenetic Characters in Morphbank Greg Riccardi, David Gaitros,
A Research Collaboratory for Open Source Software Research Yongqin Gao, Matt van Antwerp, Scott Christley, Greg Madey Computer Science & Engineering University.
MBI Summer School 2010: Collaboration in Outsourcing Topics List.
Inspiring and Engaging the Public Towards a Shared Understanding and Sense of Ownership of Freshwater Ecosystems A. Mauroner a, I.J. Harrison ab, & M.
Biodiversity Heritage Library: A Successful Collaboration, A Fully Open Access Collection Marty Schlabach Mann Library, Cornell University Upstate New.
Coordination and Policy Development in Preparation for a European Open Biodiversity Knowledge Management System Supported by the European Commission through.
The OpenAIRE Catalogue of Services
International Congress of Entomology, Orlando
An Open Knowledge & Research Information Infrastructure
Flanders Marine Institute (VLIZ)
RCN Development of an Online Database to Enhance the Conservation of SGCN Invertebrates in the Northeastern Region James W. Fetzner Jr. & John.
Elsevier Activity Range
Data publishing from the viewpoint of a biodiversity publisher
Data Mining: Concepts and Techniques Course Outline
Cynthia S. Parr, Robert Guralnick, Nico Cellinese, Roderic D.M. Page 
Bird of Feather Session
AI Discovery Template IBM Cloud Architecture Center
Presentation transcript:

Cynthia Parr Phenotype RCN NESCent 25 February 2013

EOL aggregates and curates across topics, across the tree of life Scientific Databases, including BHL, GBIF, ALA, INBio, COL, Scratchpads, LifeDesks Scientific Journals Curate Comment Rate, Collect Comment Rate, Collect eol.org Aggregate Quality control API Third party apps

EOL summarizes knowledge Erosaria caputserpentis Serpent's Head Cowrie Depth range based on 51 specimens in 2 taxa. Water temperature and chemistry ranges based on 40 samples. Environmental ranges Depth range (m): Temperature range (°C): Nitrate (umol/L): Salinity (PPS): Oxygen (ml/l): Phosphate (umol/l): Silicate (umol/l): Depth range based on 51 specimens in 2 taxa. Water temperature and chemistry ranges based on 40 samples. Environmental ranges Depth range (m): Temperature range (°C): Nitrate (umol/L): Salinity (PPS): Oxygen (ml/l): Phosphate (umol/l): Silicate (umol/l): From Moorea Biocode From GBIF From OBIS

Statistics 2 years ago 2.8 million pages – one (or more) per taxon 2 million data objects 500 thousand pages with objects 100+ partner databases 700 curators/1000s contributors/~46,000 members Today 3.3 million pages – one (or more) per taxon 5 million CC-licensed data objects Over 1 million pages with objects 200+ partner databases 1200 curators/1000s contributors/~64,000 members

We have an infrastructure... Aggregation mechanisms Names resolution Curation mechanisms Public and machine interfaces User-created collections What are the next use cases to tackle? How could ontologies & annotations help?

See structured info on EOL pages Discover and identify “find taxa with these characteristics”

Browse the whole page semantically, link to related resources (LOD: linked open data) Google Summer of Code with Phenoscape (Alex Ginsca) Using DBPedia Spotlight to extract associations among taxa and add to Linked Open Data cloud ( Devries and Thessen) Linking names, literature, phylogeny (Page) Resolving archeological data on animal domestication in the near east (Alexandria Archive Institute)

Promote NLP text mining and crowdsourcing – Altitude Specificity of Flower Coloration (Wright) – Species Interaction Datasets—Integration, Visualization, and Analysis (Poelen and Mungall) – Crowd-sourced data to examine morphological impacts of extinction risk in ray-finned fishes (Chang) – Macroecological patterns in butterfly-hostplant associations (Ferrer-Parris) – Discovering habitat terms in EOL contents (Pafilis)

Easy access to analyzable data “Are blue organisms more common in high altitudes?” “How can I predict vulnerability to climate change based on life history characteristics?” “What organisms should I collect to fill in gaps in genome quality data?” Look for data, download for all taxa Create a collection of taxa, download all data Use Reol: an R interface to EOL (Banbury, Omeara) Find more specialized data repositories

Dynamic online knowledge Support summaries with networks of evidence – E.g. Bergmann’s rule: animals living in higher latitudes have larger body size As evidence grows or changes, change the knowledge summary Flag evidence that is in conflict with the summary

Erosaria caputserpentis Serpent's Head Cowrie Salinity envelope (n=40) From OBIS Summarize data across providers Flag outlier data

The big picture In progress: Marine computable data Draft phylogenetic tree from Open Tree of Life project TraitBank: access to computable descriptive information across the tree of life

Our funders John D. and Catherine T. MacArthur Foundation Alfred P. Sloane Foundation Smithsonian Institution Marine Biological Laboratory Harvard University David Rubenstein and other funders and donors All our content providers and global partners Volunteer curators and individual contributors via Flickr, Wikimedia, and members of EOL Thanks to