Using Desktop Data in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12, 2007

Slides:



Advertisements
Similar presentations
Overview of the Science Environment for Ecological Knowledge (SEEK) Ricardo Scachetti Pereira.
Advertisements

Advanced Microsoft Word Lesson 10 – Customizing Tables and Creating Charts Microsoft Office XP: Advanced Course.
DATA ANALYTICS. NORMS Cell Phones on Vibrate Respect all opinions.
SONet (Scientific Observations Network) and OBOE (Extensible Observation Ontology): Mark Schildhauer, Director of Computing National Center for Ecological.
Chad Berkley National Center for Ecological Analysis and Synthesis (NCEAS), University of California, Santa Barbara February.
Experiences in Integration of the 'R' System into Kepler Dan Higgins – National Center for Ecological Analysis and Synthesis (NCEAS), UC Santa Barbara.
Workflow Exchange and Archival: The KSW File and the Kepler Object Manager Shawn Bowers (For Chad Berkley & Matt Jones) University of California, Davis.
Introduction to Kepler Deana Pennington, PhD University of New Mexico LTER Network Office, Sevilleta LTER PI CI-Team: Advancing CI-Based Science through.
GIS Actors in Kepler - Java-based, GDAL-JNI, and C++(Grass) Routines Dan Higgins - UC Santa Barbara (NCEAS) Chad Berkley – UC Santa Barbara (NCEAS) Jianting.
North American initiatives in Ecoinformatics: Vegbank and SEEK Robert K. Peet and The Ecological Society of America Vegetation Panel The SEEK development.
Leveraging semantic metadata for ecological data discovery and integration for analysis and modeling Matthew B. Jones Mark P. Schildhauer with contributions.
The Kepler Project Overview, Status, and Future Directions Matthew B. Jones on behalf of the Kepler Project team National Center for Ecological Analysis.
WINKS 7 Tutorial 6 – Opening an Excel data file Permission granted for use for instruction and for personal use. © Alan C. Elliott, 2015.
Pasewark & Pasewark 1 Access Lesson 6 Integrating Access Microsoft Office 2007: Introductory.
1 Access Lesson 6 Integrating Access Microsoft Office 2010 Introductory Pasewark & Pasewark.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Biology.sdsc.edu CIPRes in Kepler: An integrative workflow package for streamlining phylogenetic data analyses Zhijie Guan 1, Alex Borchers 1, Timothy.
Based on material developed by Samantha Romanello and
January, 23, 2006 Ilkay Altintas
Introduction for BEAM Ecological Niche Modeling Working Meeting Deana Pennington University of New Mexico December 14, 2004.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
Training course on biodiversity data publishing and fitness-for-use in the GBIF Network, 2011 edition How Darwin Core Archives have changed the landscape.
Kepler Exercise Deana Pennington University of New Mexico January 9, 2007.
Pipelines and Scientific Workflows with Ptolemy II Deana Pennington University of New Mexico LTER Network Office Shawn Bowers UCSD San Diego Supercomputer.
Knb.ecoinformatics.org LTER EML Best Practices Data Discovery in the Biological Sciences 7-9 February 2005 Mark Servilla LTER Network Office University.
INSERT BOOK COVER 1Copyright © 2011 Pearson Education, Inc. Publishing as Prentice Hall. Exploring Microsoft Office Excel 2010 by Robert Grauer, Keith.
Directions in observational data organization: from schemas to ontologies Matthew B. Jones 1 Chad Berkley 1 Shawn Bowers 2 Joshua Madin 3 Mark Schildhauer.
Ecological Metadata Language (EML) and Morpho
Science Environment for Ecological Knowledge Bertram Ludäscher San Diego Supercomputer Center University of California, San Diego
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
Introduction to Microsoft Word Date: March 6, 2012 Time: 9:00 AM to 11:00 AM Location: Maher Hall 114 Computer Lab Instructor: Joel Elad.
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
Data, Metadata, and Ontology in Ecology Matthew B. Jones National Center for Ecological Analysis and Synthesis (NCEAS) University of California Santa Barbara.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
LTER Information Management Training Materials LTER Information Managers Committee Documenting Spatial Data Theresa Valentine Andrews LTER.
SAN DIEGO SUPERCOMPUTER CENTER Inca Data Display (data consumers) Shava Smallen Inca Workshop September 5, 2008.
Grid Technologies Arcot Rajasekar (SEEK) Paul Watson (North East eScience Centre)
Ecoinformatics Workshop Summary SEEK, LTER Network Main Office University of New Mexico Aluquerque, NM.
The SEEK EcoGrid: A Data Grid System for Ecology Arcot Rajasekar Matthew Jones Bertram Ludäscher
1 ADVANCED MICROSOFT POWERPOINT Lesson 9 – Importing and Exporting Information Microsoft Office 2003: Advanced.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
Using R in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12, 2007
Kepler Deana Pennington LTER Network Office. Download Kepler Kepler website: website:
Kepler includes contributors from GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs (SEEK), EAR (GEON), DOE DE-FC02-01ER25486.
Information Management using Ecological Metadata Language Corinna Gries - CAP Margaret O’Brien - SBC.
EScience Workshop on Scientific Workflows Matthew B. Jones National Center for Ecological Analysis and Synthesis University of California Santa Barbara.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
MySQL Importing and creating a database. CSV (Comma Separated Values) file CSV = Comma Separated Values – they are simple text files containing data which.
Kepler Exercise Deana Pennington University of New Mexico December 10, 2004.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
Matthew B. Jones Jim Regetz National Center for Ecological Analysis and Synthesis (NCEAS) University of California Santa Barbara NCEAS Synthesis Institute.
Visualization in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12, 2007
Morpho – metadata management software SEEK Training January 2004.
NETS Stakeholder Workshop: Researcher Panel Shirley Han 28 June
Workflow-Driven Science using Kepler Ilkay Altintas, PhD San Diego Supercomputer Center, UCSD words.sdsc.edu.
Kepler BEAM Workshop Samantha Romanello LTER Network Office.
Copyright 2007, Paradigm Publishing Inc. EXCEL 2007 Chapter 8 BACKNEXTEND 8-1 LINKS TO OBJECTIVES Import data from Access, a Web site, or a CSV text file.
EcoGrid in SEEK A Data Grid System for Ecology Bertram Ludaescher University of California, Davis Arcot Rajasekar San Diego Supercomputer Center, University.
Scientific workflow in Kepler – hands on tutorial
Microsoft Office Illustrated
Azure Machine Learning & ML Studio
GRIDS Community Workshop
LTER Metadata Query Interface – Current Status and Future Challenges
Microsoft Excel 2007 The L Line The Express Line to Learning L Line
Microsoft Excel 2007 – Level 2
Navya Thum January 30, 2013 Day 5: MICROSOFT EXCEL Navya Thum January 30, 2013.
A Semantic Type System and Propagation
Ecological Informatics: Challenges and Benefits Presentation to ESA Visions Committee March.
Presentation transcript:

Using Desktop Data in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12,

Viewing a Dataset – Text Editor 1999 Sevilleta LTER NPP Quadrat Sampling Data Text Editor view of data from a web page Includes both data and documentation (metadata) In a single text document 727 KB file

Viewing a Dataset - Excel 1999 Sevilleta LTER NPP Quadrat Sampling Data Excel View Data and column header only Can be saved in various formats SevilletaData.xls – 1489 KB SevilletaData.csv – 369 KB SevilletaData.txt – 369 KB SevilletaData.xlm – 5863 KB Only some formats are easily readable by other applications! *.csv - comma separated values ; *.txt - tab separated values (Cutting & Pasting from Excel results in tab separated columns)

Viewing a Dataset – Morpho 1999 Sevilleta LTER NPP Quadrat Sampling Data Morpho view Shows data and eml metadata

Viewing a Dataset – Kepler 1999 Sevilleta LTER NPP Quadrat Sampling Data Kepler view (using KNB Metacat Ecogrid query) Can view formatted EML metadata Default configuration shows a port for each column in the data table

Viewing a Dataset – Kepler 1999 Sevilleta LTER NPP Quadrat Sampling Data Kepler view (using KNB Metacat Ecogrid query) Data source actor can be configured to display the data by running a simple workflow.

Viewing a Dataset - Kepler Kepler view (using local EML2 Dataset actor) Depends on proper format of link from Metadata (eml) to the local data file (not yet working with local Morpho files)

Kepler – ReadTable Actor 1999 Sevilleta LTER NPP Quadrat Sampling Data Kepler view (using the R-based ReadTable actor) Read local file and provide metadata such as separator, file name, header presence, etc.

Kepler – ReadTable Actor 1999 Sevilleta LTER NPP Quadrat Sampling Data Kepler view (using the R-based ReadTable actor) Result of executing workflow

Kepler – ReadTable Actor 1999 Sevilleta LTER NPP Quadrat Sampling Data Kepler view (using the R-based ReadTable actor) Text display from the ReadTable actor after adding ‘dim(df)’ and ‘summary(df)’ commands Row and Column count Data Summary

Kepler – ReadTable Actor 1999 Sevilleta LTER NPP Quadrat Sampling Data Kepler view (using the R-based ReadTable actor) Result of creating a BoxPlot of data in the 9th column (the ‘height’ column)

Kepler – ReadTable Actor Kepler view (using the R-based ReadTable actor) Dataframe created by the ReadTable actor can be passed To another actor for further processing

Kepler – ReadTable Actor Kepler view (using the R-based ReadTable actor) Result of further dataframe processing: Species vs count BoxPlots

Acknowledgements This material is based upon work supported by: The National Science Foundation under Grant Numbers , , , , , and Collaborators: NCEAS (UC Santa Barbara), University of New Mexico (Long Term Ecological Research Network Office), San Diego Supercomputer Center, University of Kansas (Center for Biodiversity Research), University of Vermont, University of North Carolina, Napier University, Arizona State University, UC Davis The National Center for Ecological Analysis and Synthesis, a Center funded by NSF (Grant Number ), the University of California, and the UC Santa Barbara campus. The Andrew W. Mellon Foundation. Kepler contributors: SEEK, Ptolemy II, SDM/SciDAC, GEON