Computer, what is the trajectory of the planet Seti Alpha 5?

Slides:



Advertisements
Similar presentations
Closing the Gap Between Global Environmental Sensing Needs and Cyber Infrastructure Tools Jim Gray Jeff Burch Mark Ellisman Miron Livny David Maidment.
Advertisements

Language Technologies Reality and Promise in AKT Yorick Wilks and Fabio Ciravegna Department of Computer Science, University of Sheffield.
Indiana University School of David Wild – CICC Quarterly Meeting, Jan Page 1 Projects 1-4 update David Wild CICC Quarterly Meeting January 27.
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Globalnames.org.  Discovery  Ephemeral  Individualistic  Massive redundancy  Optional  Risk taking.
Clancy Brown William Smith College Rocky Mountain Biological Lab Mentor: Brad W. Taylor Effect of the nuisance diatom Didymosphenia geminata on invertebrates.
Using Sakai to Support eScience Sakai Conference June 12-14, 2007 Sayeed Choudhury Tim DiLauro, Jim Martino, Elliot Metsger, Mark Patton and David Reynolds.
1 JCDL 2011 Report Kazunari Sugiyama WING meeting 19 th August, 2011.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
NetworkedPlanet Networked Information – Networked Knowledge Topic Maps & Web 3.0 © 2007 Networked Planet Limited. Web 3.0 Technology Platform to enable.
Data Conservancy: A Life Sciences Perspective Sayeed Choudhury Johns Hopkins University
System for Mapping and Predicting Species of Concern NASA Biodiversity and Ecological Forecasting Team Meeting 23 April 2015 John Olson.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Interactive Query Processing in Scientific Applications David Liu UC Berkeley Computer Science Division.
A Social Networking Research Environment for Scientific Data Sharing: The D4Science Offering M. Assante, L. Candela, D. Castelli, F. Mangiacrapa, P. Pagano.
The Mind Map of a Data Scientist Rebecca Perry and Carlota Valdivieso, Work Experience Students July 2013 What qualifies Data Science? Many things qualify.
Resource Fabrics: The Next Level of Grids and Clouds Lei Shi.
CLOUD COMPUTING. IAAS / PAAS / SAAS LAYERS. Olena Matokhina Development and Consulting Team Lead 2 ABOUT PRESENTER.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Moving beyond free text. Authors Scientist does research Scientist publishes research results in journal article Old Paradigm:
Mining Large Data at SDSC Natasha Balac, Ph.D.. A Deluge of Data Astronomy Life Sciences Modeling and Simulation Data Management and Mining Geosciences.
Lecture 02 NATURAL RESOURCE PLANNING AND MANAGEMENT Dr. Aneel SALMAN Department of Management Sciences COMSATS Institute of Information Technology, Islamabad.
Semantic Integration for Government and Private Industry 29 November 2012 Eric Little, PhD Director, Information Management
Mining the Semantic Web: Requirements for Machine Learning Fabio Ciravegna, Sam Chapman Presented by Steve Hookway 10/20/05.
Biodiversity Informatics Sarah Faulwetter Hellenic Centre for Marine Research.
GLOBAL BIODIVERSITY INFORMATION FACILITY Cataloging and using Taxonomic Data The Global Names Architecture David Remsen Senior Programme Officer, ECAT.
Interfacing Registry Systems December 2000.
E-science in the Netherlands Maria Heijne TU Delft Library Director / Chair Consortium of University Libraries and National Library.
IPlant cyberifrastructure to support ecological modeling Presented at the Species Distribution Modeling Group at the American Museum of Natural History.
By Hannah McFarlin. Introduction Marine biologists are trained experts in marine life and use a variety of tools to advance our knowledge of marine life.
Species interact in biological communities BY: MATT STANGER & SEAN MARTON Global Garbage - Marine Litter Gateway - Portal Do Lixo Marinho. Web. 27 May.
Big Data EUDAT 2012 – Training Day Adam Carter, EPCC EUDAT Training Task Leader.
LifeWatch E-Science and Observatory Infrastructure for Biodiversity & Ecosystem Science Olaf Bánki.
Global Biodiversity Information Facility GLOBAL BIODIVERSITY INFORMATION FACILITY Meredith A. Lane CODATA/ERPANET Workshop: Scientific Data Selection &
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Template This is a template to help, not constrain, you. Modify as appropriate. Move bullet points to additional slides as needed. Don’t cram onto a single.
Identifying funding and collaboration opportunities to support the Global Names e-Infrastructure Dimitris Koureas & Vince Smith Natural History Museum.
SCAPE Rainer Schmidt SCAPE Training Event September 16 th – 17 th, 2013 The British Library Building Scalable Environments Technologies and SCAPE Platform.
Summary Knowledge Bases from Web are Real, Big & Useful: Entities, Classes & Relations Key Asset for Intelligent Applications: Semantic Search, Question.
Biodiversity Data Exchange Using PRAGMA Cloud Umashanthi Pavalanathan, Aimee Stewart, Reed Beaman, Shahir Shamsir C. J. Grady, Beth Plale Mount Kinabalu.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
IDigBio: Addressing a BIO Big Data Challenge. A. Matsunaga, et al IEEE e-Science. 2013: How iDigBio is Different.
Big Data to Knowledge Panel SKG 2014 Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China August Geoffrey Fox
Data Conservancy and the US NSF DataNet Initiative Fourth Workshop on Data Preservation and Long-Term Analysis in HEP Sayeed Choudhury Johns Hopkins University.
Marine Sciences Oceanography Marine Biology Geosciences Using the sciences and technology to solve problems and answer questions related to oceans, coasts.
CNI Task Force Meeting April 7, 2008 OAI-ORE Project Briefing David Reynolds Tim DiLauro Sayeed Choudhury Library Digital Programs Sheridan Libraries Johns.
Role Activity Sub-role Functional Components Control Data Software.
GLOBAL BIODIVERSITY INFORMATION FACILITY David Remsen Senior Programme Officer, ECAT 3 Oct th Nodes Meeting.
The Global Scene Wouter Los University of Amsterdam The Netherlands.
Introduction to Earth Science THE SCOPE OF EARTH SCIENCE.
HELIO: Discovery and Analysis of Data in Heliophysics Robert Bentley, John Brooke, André Csillaghy, Donal Fellows, Anja Le Blanc, Mauro Messerotti, David.
Accessing the VI-SEEM infrastructure
IV-e: e-Infrastructure Virtualization for e-Science Applications (P20)
Energy Flow in Ecosystems
Supporting Research on Biodiversity: LifeWatch on the Cloud
IaaS Layer – Solutions for “Enablers”
Themes in Geosciences.
Modern Data Management
Plankton Ecology: Primary production, Phytoplankton and Zooplankton
knowledge organization for a food secure world
به نام خدا Big Data and a New Look at Communication Networks Babak Khalaj Sharif University of Technology Department of Electrical Engineering.
מיחזור במערכת החינוך.
Big data Europe Platform concept and architecture
LifeWatch Cloud Computing Workshop
The state of VOEvent semantics THE US NATIONAL VIRTUAL OBSERVATORY
Jena HBase: A Distributed, Scalable, Efficient RDF Triple Store
Jena HBase: A Distributed, Scalable, Efficient RDF Triple Store
Joseph Frias Chris Smith
Presentation transcript:

Computer, what is the trajectory of the planet Seti Alpha 5?

How many algal species can be found on this planet?

What species is this?

BIG = data-centric (like particle physics and astronomy) Characterized by data sharing via a virtual pool New = new skill sets, tools, cyber- infrastructure to exploit the data pool Data driven discovery as a new means of understanding GenBank as a model within the Life Sciences

Large number of providers with small amounts of data. Small number of providers with lots of data.

Aa paleacea Limulus polyphemus Kiwa hirsuta Osedax frankpressi Kingia australis Pieris japonica Pieris rapae Trypanosoma brucei Homo sapiens

Didimosphenia geminata Didymosphenia geminata Rock snot Didymo Echinella geminata Gomphonema geminatum Gomphonema vulgare

Didymosphenia geminata Didimosphenia geminata Didymo Rock Snot Echinella geminata Gomphonema geminatum Gomphonema vulgare

Didymosphenia geminata Didimosphenia geminata Didymo Rock Snot Echinella geminata Gomphonema geminatum Gomphonema vulgare

Contextual data Diatom Chloroplast Frustule Benthic Marine Disambiguate by authority, species, contextual data Contextual data Food Moth Wings Exoskeleton Caterpillar

Provider Services DATA AND SERVICE CONSUMERS DATA AND SERVICE PROVIDERS EXPERTS Consumer Services GNA

Managing names to manage biodiversity data - All names (scientific vernacular surrogate) - For all organisms - Many names for one species reconciled - One name for many species disambiguated Global Names Architecture - a virtual layer, using names services to link together distributed data Globalnames.org Micro*scope (microscope.mbl.edu) and Encyclopedia of Life (eol.org)

Narrative tradition in biology Too much for a human Can we get a machine to do the work? NLP!!!

Use NLP/machine learning to extract names and characters Hong Cui

Spirogyra:chloroplasts:present

Spirogyra:chloroplasts:present:attribution

coffee is a drink

Triple Store

Informatics/computing training Modified workflows Importance of data management and preservation

Big New Biology is coming, taxonomy can benefit from being a part of it Existing data can be made machine-readable using information extraction algorithms Existing workflows can be modified to capture data close to the source Data can be shared using the semantic web

Dima Mozzherin David Shorthouse Sayeed Choudhury Pete DeVries