BUILDING HIGHWAYS IN THE INFORMATICS LANDSCAPE Ed Baker 10.6084/m9.figshare.749699.

Slides:



Advertisements
Similar presentations
Instant JChem 2009 US + EU Seminars Confidential. Copyright© 2009 ChemAxon Kft, Informatics Matters Ltd Instant JChem Instant JChem Seminar series Q
Advertisements

Lorna Moloney – 30 November  Portable document format -PDF was developed in the early 1990s as a way to share documents, including text formatting.
Publish or perish? Linking Scratchpads and the new Biodiversity Data Journal for streamlining publication of botanical data D.N Koureas 1, L. Penev 2 &
APPLICATION SUBMISSION MADE EASY. How it all Started One of the largest life insurance companies in the country asked CRL if we could provide an easy.
Scratchpads Publishing biodiversity: The interplay between Scratchpads and the Biodiversity Data Journal Dr Dimitrios Koureas Biodiversity Informatics.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
A New World for Mapping John Spencer Spatial Analysis Unit October 5, 2009.
Lecture Microsoft Access and Relational Database Basics.
Boris Tshibangu. What is a proxy server? A proxy server is a server (a computer system or an application) that acts as an intermediary for requests from.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
What EDIT brings : Funding, Fieldwork, Training, Web, Software Gaël Lancelot EDIT Communication officer.
Fourth Annual Summit | Feb | Tucson, AZ Scratchpads for community involvement for natural history collections Dr Dimitris Koureas Biodiversity.
SERNEC Image/Metadata Database Goals and Components Steve Baskauf
USING HADOOP & HBASE TO BUILD CONTENT RELEVANCE & PERSONALIZATION Tools to build your big data application Ameya Kanitkar.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Richard White Biodiversity Data. Outline Biodiversity: what is it? – Definitions: is biodiversity: A resource? Something which can be measured? How to.
AIRNow-International The future of the United States real-time air quality reporting and forecasting program and GEOSS participation John E. White U.S.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
Small pieces loosely joined Building scientific web communities with Scratchpads S. Rycroft, D. Roberts, K. Harman, V. Smith.
Computer Graphics Communication “Digital Documentation using 3D- CAD data and Web-3D” Tatsuya Mochizuki Shizuoka University of Art and Culture, Hamamatsu,
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
@dimitriskoureas making small data… big. Publications based on countless specimens, images, maps, keys and datasets Typically generated by small communities.
New Features in Release 9.2 (July 27, 2009). 2 Release 9.2 New Features Updated Shopping Experience Home/Shop page Shop at the top search New Hosted Supplier.
An introduction to MEDIN Data Guidelines. What MEDIN data guidelines are not… Protocols for collection methods Prescriptive of how you have to collect.
1 Technologies for distributed systems Andrew Jones School of Computer Science Cardiff University.
{ Microsoft OneNote Organizing Your Thoughts in the Cloud Presented by: Matthew Baker (321)
Standards and tools for publishing biodiversity data Yu-Huang Wang June 25, 2012.
Directory and Map Service Operational Concept  Provides Business directory listings to cell phone users  Provide maps of specified area  Provide driving.
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
Scratchpads The virtual research environment for biodiversity data Simon Rycroft, Dave Roberts, Vince Smith, Alice Heaton, Katherine Bouton, Laurence Livermore,
Scratchpads & Citizen Science Ed Baker /m9.figshare
By: Pramod Jagtap Aniket Bochare. Agenda Introduction to dataset Web service description Service architecture Project plan Intended clients.
An Introduction to Scratchpads: Making your data work for you Laurence Livermore Natural History Museum, London Joinville, Brazil.
by Maria Rita Marruganti DIFFERENT WAYS OF SENDING INFORMATION Passive e.g. newspapers, radio, television. You don’t produce, just receive information.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Hwajung Lee.  Interprocess Communication (IPC) is at the heart of distributed computing.  Processes and Threads  Process is the execution of a program.
Integrating Access to Digital Content Sarah Shreeves University of Illinois at Urbana-Champaign Visual Resources Association 23 rd Annual Conference Miami.
XML stands for Extensible Mark-up Language XML is a mark-up language much like HTML XML was designed to carry data, not to display data XML tags are not.
Don’t make me think Biodiversity Data Publishing Made Easy Laurence Livermore, Vince Smith, Alice Heaton, Simon Rycroft, Ed Baker, Ben Scott & Lyubomir.
Scratchpads and the new Biodiversity Data Journal Biodiversity Data Publishing made… easier Dimitris Koureas Natural History Museum London.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
Taxonomic Workflow in the EDIT Platform for Cybertaxonomy Andreas Kohlbecker, Pepe Ciardelli, Niels Hoffmann, Katja Luther, Andreas Müller Botanic Garden.
GOOGLE FUSION TABLES: WEB- CENTERED DATA MANAGEMENT AND COLLABORATION HectorGonzalez, et al. Google Inc. Presented by Donald Cha December 2, 2015.
Workpackage Infrastructure Tasks WP 3. What is the goal eventually To present the user an interface to: - find, - explore - select and - get presented.
HEMANTH GOKAVARAPU SANTHOSH KUMAR SAMINATHAN Frequent Word Combinations Mining and Indexing on HBase.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
An Enterprise Clinical Data Search Solution. is Designed for: Informatics professionals, clinicians, statisticians, data managers and process/quality.
Scratchpads Virtual Research Environments for taxonomic and biodiversity related data.
COMPASS09 Annual Conference of Compass Informatics.
National Biological Information Infrastructure (NBII) BioBot & IABIN BioBot Ben Wheeler USGS Biological Informatics Office January 23 rd, 2007.
Workshop on Security for Web Services. Amsterdam, April 2010 Applying SAML to Identity Data Exchange.
GB22 TRAINING EVENT FOR NODES – 4 OCTOBER 2015 Session 02: 2015 Data Publishing Landscape Laura Russell.
Top Best Software Development Languages. Microsoft Technologies Microsoft Technology, a fundamental web application plays the role of a multipurpose tool.
Fusion Tables.
XML QUESTIONS AND ANSWERS
Flanders Marine Institute (VLIZ)
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
THE DEVELOPMENT SERVICE
Lesson 1: Introduction to Trifacta Wrangler
GIFT / Fiscal Data Package Iteration 3
Diane Vizine-Goetz OCLC Research
برنامج التميز في خدمة عملاء السادة موظفي مكاتب المساعدة القانونية
Analytics Plus Product Overview 1.
Lesson 1 – Chapter 1C Trifacta Interface Navigation
One Language. One Enterprise.™
Palestinian Central Bureau of Statistics
Login Main Functions Via SAS Information Delivery Portal
Presentation transcript:

BUILDING HIGHWAYS IN THE INFORMATICS LANDSCAPE Ed Baker /m9.figshare

We need to move from this…

…to this

HOW?

All of these systems: Have different data models Are written in different programming languages Are mutually incompatible

How do we make these systems talk? Requirements

How do we make these systems talk? Requirements Each pair can define a way of communicating with each other

How do we make these systems talk? Requirements

How do we make these systems talk? Requirements

We need a lingua franca of biodiversity informatics Requirements

We need a lingua franca of biodiversity informatics Requirements Any language that is widely used as a means of communication among speakers of other languages.

Requirements Common Intermediary

Easily understood People Machines Easy to share Using existing technology and infrastructure Requirements

Easily understood People Machines Easy to share Using existing technology and infrastructure Can read in Excel Requirements

Easily understood People Machines Easy to share Using existing technology and infrastructure Can read in Excel Everything precisely defined Requirements

Easily understood People Machines Easy to share Using existing technology and infrastructure Can read in Excel Everything precisely defined Standard formats: csv, zip Standard delivery: via the web Standard formats: csv, zip Standard delivery: via the web Requirements

Core File (classification) Core File (classification)

Core File (classification) Core File (classification) Images Literature Specimens Taxonomic treatments

Core File (classification) Core File (classification) Images Literature Specimens Taxonomic treatments

Core File (classification) Core File (classification) Images Literature Specimens Taxonomic treatments STAR SCHEMA

Core File (classificati on) Core File (classificati on) Images Literature Specimens Taxonomic treatments Meta.xml

Core File (classificati on) Core File (classificati on) Images Literature Specimens Taxonomic treatments Meta.xml What is each file? What does each column in a file contain? What is each file? What does each column in a file contain?

Core File (classificati on) Core File (classificati on) Images Literature Specimens Taxonomic treatments Meta.xml dwca.zip

WHAT’S THE BENEFIT?

dwca.zip

?

These images came from the Scratchpad

So did this description

This map didn’t

Aggregators allow us to… provide a single user interface to many different systems. search easily across multiple datasets simultaneously (and combine results).

You can also share data with …

ANY OF THE INCREASING NUMBER OF PEOPLE WHO SUPPORT THE STANDARD