NutNet data update 15 Aug 2011 Data management update

Slides:



Advertisements
Similar presentations
DB glossary (focus on typical SQL RDBMS, not XQuery or SPARQL)
Advertisements

Create new database Create staging table Import new taxonomy Index new taxonomy Load new taxonomy to core db New TNRS DB New taxonomic source More taxonomic.
GI Systems and Science January 30, Points to Cover  Recap of what we covered so far  A concept of database Database Management System (DBMS) 
John Porter Why this presentation? The forms data take for analysis are often different than the forms data take for archival storage Spreadsheets are.
Boyce-Codd Normal Form Kelvin Nishikawa SE157a-03 Fall 2006 Kelvin Nishikawa SE157a-03 Fall 2006.
EEP wants to do a better job creating natural ecosystems. CVS provides improved reference data, target design, monitoring, and data management and analysis.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
DAT702.  Standard Query Language  Ability to access and manipulate databases ◦ Retrieve data ◦ Insert, delete, update records ◦ Create and set permissions.
N. J. Taylor Database Management Systems (DBMS) 1.
The Relational Model These slides are based on the slides of your text book.
Module III: The Normal Forms. Edgar F. Codd first proposed the process of normalization and what came to be known as the 1st normal form. The database.
Normalization (Codd, 1972) Practical Information For Real World Database Design.
Databases and Statistical Databases Session 4 Mark Viney Australian Bureau of Statistics 5 June 2007.
Synopsis of current BIEN and Enquist projects managed by Martha iPlant 2014.
BIEN Confederated DB (S) Analytical DB(s) Heterogeneous source database(s) of Plots/Specimens/Occurrences Synonymy Names Reference taxonomy *** *** Feedback.
Relational Databases. Database Large collection of data in an organised format to allow access and control DBMS Database Management System - Special software.
Copyright © 2005 Ed Lance Fundamentals of Relational Database Design By Ed Lance.
1 Database Concepts 2 Definition of a Database An organized Collection Of related records.
Effective Indexes For Beginners. Performance is slow Let’s add another index!
1 A Guide to SQL Chapter 2. 2 Introduction Mid-1970s: SQL developed under the name SEQUEL at IBM by San Jose research facilities to be the data manipulation.
Managing the Impacts of Change on Archiving Research Data A Presentation for “International Workshop on Strategies for Preservation of and Open Access.
What have we learned?. What is a database? An organized collection of related data.
Microsoft Access. Microsoft access is a database programs that allows you to store retrieve, analyze and print information. Companies use databases for.
Module 3 Designing and Implementing Tables. Module Overview Designing Tables Working with Schemas Creating and Altering Tables.
D R. E.F.C ODD ’ S R ULES FOR RDBMS Dr. E.F.Codd is an IBM researcher who first developed the relational data model in 1970.Dr. Codd published a list.
Outlook Basics. Technology and Tools Microsoft Outlook Basics to Manage Your Days – management – Calendar management – Task management 11/28/20152.
1 Announcements Reading for next week: Chapter 4 Your first homework will be assigned as soon as your database accounts have been set up.  Expect an .
CLASSIFICATION Why Classify?. INQUIRY ACTIVITY 1) Construct a table with six rows and six columns. Label each row with the name of a different fruit.
Postgresql East Philadelphia, PA Databases – A Historical Perspective.
The challenge of organism identity --- The flora of the Southeast The flora of the Southeast as a case study Robert K. Peet University of North Carolina.
Logical Database Design and Relation Data Model Muhammad Nasir
Database Architecture Normalization. Purpose of Normalization A technique for producing a set of relations with desirable properties, given the data requirements.
Decomposition and Normalization Fan Qi
Programming for the Web MySQL Command Line Using PHP with MySQL Dónal Mulligan BSc MA
Database Planning Database Design Normalization.
Database Access with SQL
Databases and SQL Databases SQL Rev 1.5
Revised: 2 April 2004 Fred Swartz
© 2016, Mike Murach & Associates, Inc.
A brief summary of database normalization
Jessie Kennedy Rob Gales, Robert Kukla
Data Update core data update supplementary & add-on data update
Database Management Systems (DBMS)
Database Normalization
MS Access Database Connection
Entity Relationships and Normalization
Introduction to DataBase
1st, 2nd, and 3rd Normal Forms
Developing a Model-View-Controller Component for Joomla Part 3
Stores data in different tables
Data Management Innovations 2017 High level overview of DB
Data Update core data update supplementary & add-on data update
5 × 7 = × 7 = 70 9 × 7 = CONNECTIONS IN 7 × TABLE
5 × 8 = 40 4 × 8 = 32 9 × 8 = CONNECTIONS IN 8 × TABLE
1st, 2nd, and 3rd Normal Forms
4 × 6 = 24 8 × 6 = 48 7 × 6 = CONNECTIONS IN 6 × TABLE
5 × 6 = 30 2 × 6 = 12 7 × 6 = CONNECTIONS IN 6 × TABLE
Example usage of mockrobiota MC resource for marker gene and metagenome sequencing pipelines. Example usage of mockrobiota MC resource for marker gene.
10 × 8 = 80 5 × 8 = 40 6 × 8 = CONNECTIONS IN 8 × TABLE MULTIPLICATION.
3 × 12 = 36 6 × 12 = 72 7 × 12 = CONNECTIONS IN 12 × TABLE
Updating Databases With Open SQL
Microsoft Excel Basics: Pivot Tables
Database SQL.
5 × 12 = × 12 = × 12 = CONNECTIONS IN 12 × TABLE MULTIPLICATION.
BTEC ICT – Unit 18 With Mr Griffiths.
5 × 9 = 45 6 × 9 = 54 7 × 9 = CONNECTIONS IN 9 × TABLE
3 × 7 = 21 6 × 7 = 42 7 × 7 = CONNECTIONS IN 7 × TABLE
Updating Databases With Open SQL
Presentation transcript:

NutNet data update 15 Aug 2011 Data management update Taxonomic resolution update

Normalized data From Wikipedia: “Codd's twelve rules” are a set of thirteen rules (numbered 0-12) proposed by Edgar F. Codd, a pioneer of the relational model for databases… Edgar Frank "Ted" Codd IBM Scientist Inventor of the Relational Model * data are stored, inserted, changed, and deleted in ONLY ONE PLACE. * New data adds rows, not columns.

Site Site_Name 1 XYZ 2 PDQ 1 2 Borer, Seabloom et al. 2009 Bulletin of the Ecological Society of America 90:205–214

Places & people Observed plant taxa Units of Observation Observed Data NutNet relational database Supported by U of M MSI

Taxon observations Connect to site Taxonomy presents a common challenge to large ecological datasets Species are the terminal identity in a nested hirearchy, easily captured by relational db models Lookup and intersection tables can connect single taxon identity to multiple local names A lot of work has been done trying to reconcile the taxonomy

~1860 plant names ~1739 unique taxon identifiers

The next target: Further resolving NutNet phylogeny

NutNet Results are based on your data – QC is critical * Note any data issues as you work * Changes to data should be made in one place. Any changes, updates or deletions necessary to the raw NutNet data should be given in writing (preferably by email) to Eric Lind elind@umn.edu. * Examine species list in your folder; add or correct for completeness