Abstract Plant phenotyping involves the assessment of plant traits such as growth, tolerance, resistance, and yield. The Texas Tech Phenotyping Project.

Slides:



Advertisements
Similar presentations
2 Introduction A central issue in supporting interoperability is achieving type compatibility. Type compatibility allows (a) entities developed by various.
Advertisements

XML DOCUMENTS AND DATABASES
Integrating Cybersecurity Log Data Analysis in Hadoop Bryan Stearns, Susan Urban, Sindhuri Juturu Texas Tech University 2014 NSF Research Experience for.
Design for Engineering Unit 2 Engineering Design and Problem Solving Annette Beattie April 10, 2006 Engineering Design ETP 2006 – Annette Beattie This.
Relational Database Alternatives NoSQL. Choosing A Data Model Relational database underpin legacy applications and meet business needs However, companies.
Introduction  Data movement is a major bottleneck in data-intensive high performance computing  We propose a Fusion Active Storage System (FASS) to address.
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer.
Architecture, Deployment Diagrams, Web Modeling Elizabeth Bigelow CS-15499C October 6, 2000.
Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design Third Edition.
A Virtual Environment for Investigating Counter Measures for MITM Attacks on Home Area Networks Lionel Morgan 1, Sindhuri Juturu 2, Justin Talavera 3,
Research Proposal Presentation, June 21, 2011: David South and Mary Shuman Integration of a Graphics-Based Programming Tool with Robotics to Stimulate.
CS 405G: Introduction to Database Systems 24 NoSQL Reuse some slides of Jennifer Widom Chen Qian University of Kentucky.
DISCLAIMER: This material is based on work supported by the National Science Foundation and the Department of Defense under grant No. CNS Any.
TECHNIQUES FOR OPTIMIZING THE QUERY PERFORMANCE OF DISTRIBUTED XML DATABASE - NAHID NEGAR.
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 1.1.
IST Databases and DBMSs Todd S. Bacastow January 2005.
Copyright 2001 Prentice-Hall, Inc. Essentials of Systems Analysis and Design Joseph S. Valacich Joey F. George Jeffrey A. Hoffer Chapter 1 The Systems.
Mobile Mapping Systems (MMS) for infrastructural monitoring and mapping are becoming more prevalent as the availability and affordability of solutions.
EXPLOITING SECURITY VULNERABILITIES IN A SMART GRID HOME AREA NETWORK USING HARDWARE SIMULATION Tyler Flack, Samujjwal Bhandari, and Susan Urban TEXAS.
Approach Overview Using Dorothy, an enhanced version of the Alice 2.0 source code, and a Scribbler robot, it is our aim to increase interest in computer.
ACS1803 Lecture Outline 2 DATA MANAGEMENT CONCEPTS Text, Ch. 3 How do we store data (numeric and character records) in a computer so that we can optimize.
Abstract A software development life cycle can be divided into requirements elicitation, specification, design, implementation, testing, and maintenance.
Systems analysis and design, 6th edition Dennis, wixom, and roth
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design.
Simulation of Fault Detection for Robot Applications Chase Baker, Taeghyun Kang, Michael Shin Ph.D. Interaction with robot applications are becoming increasingly.
Engineering Design By Brian Nettleton This material is based upon work supported by the National Science Foundation under Grant No Any opinions,
Copyright 2002 Prentice-Hall, Inc. Chapter 1 The Systems Development Environment 1.1 Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer.
Getting Biologists off ACID Ryan Verdon 3/13/12. Outline Thesis Idea Specific database Effects of losing ACID What is a NoSQL database Types of NoSQL.
Database System Concepts and Architecture
1. Department of Arts and Sciences, Georgia State University 2. Department of Electrical and Computer Engineering, Texas Tech University 3. Department.
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Copyright 2002 Prentice-Hall, Inc. 1.1 Modern Systems Analysis and Design Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 1 The Systems Development.
FEN Introduction to the database field:  Applications, concepts and terminology Seminar: Introduction to relational databases.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Computer Aided Design By Brian Nettleton This material is based upon work supported by the National Science Foundation under Grant No Any opinions,
Communication with Handler Approach Overview Alice 2.0 source code was modified to release event information to a robot handler component using sockets.
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Energy for Education Moijue Kaikai, Professor Erin Baker, University of Massachusetts Amherst Abstract Many issues surround the global energy crisis. One.
Systems Realization Laboratory The Role and Limitations of Modeling and Simulation in Systems Design Jason Aughenbaugh & Chris Paredis The Systems Realization.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
NoSQL Systems Motivation. NoSQL: The Name  “SQL” = Traditional relational DBMS  Recognition over past decade or so: Not every data management/analysis.
Exploring Unanswered Questions in Earth Science Amy Pallant (PI), The Concord Consortium, Concord, MA This.
Context Aware RBAC Model For Wearable Devices And NoSQL Databases Amit Bansal Siddharth Pathak Vijendra Rana Vishal Shah Guided By: Dr. Csilla Farkas Associate.
DBS201: Data Modeling. Agenda Data Modeling Types of Models Entity Relationship Model.
Group members: Phạm Hoàng Long Nguyễn Huy Hùng Lê Minh Hiếu Phan Thị Thanh Thảo Nguyễn Đức Trí 1 BIG DATA & NoSQL Topic 1:
BIG DATA. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
WEB BASED DSS Aaron Atuhe. KEY CONCEPTS When software vendors propose implementing a Web-Based Decision Support System, they are referring to a computerized.
Big Data-An Analysis. Big Data: A definition Big data is a collection of data sets so large and complex that it becomes difficult.
Databases and DBMSs Todd S. Bacastow January
CS 405G: Introduction to Database Systems
Cloud Computing and Architecuture
CS122B: Projects in Databases and Web Applications Winter 2017
MongoDB Er. Shiva K. Shrestha ME Computer, NCIT
Technological Design VS Engineering Design
Introduction to Geospatial Technologies in Ag
Discussion and Conclusion
ACS1803 Lecture Outline 2   DATA MANAGEMENT CONCEPTS Text, Ch. 3
Chapter 1 The Systems Development Environment
Trouble Shooting Brian Nettleton
11/18/2018 2:14 PM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Tools for Processing Big Data Jinan Al Aridhee and Christian Bach
Title of Poster Site Visit 2017 Introduction Results
Title of session For Event Plus Presenters 12/5/2018.
Title of Poster Site Visit 2018 Introduction Results
This material is based upon work supported by the National Science Foundation under Grant #XXXXXX. Any opinions, findings, and conclusions or recommendations.
Working with GEOLocation Data
Presentation transcript:

Abstract Plant phenotyping involves the assessment of plant traits such as growth, tolerance, resistance, and yield. The Texas Tech Phenotyping Project is specifically studying the cross-breed of cotton plants that will better survive the harsh climate of West Texas. Using robotics, images of individual plants in a field are being collected and analyzed over time to support the study, generating massive amounts of plant data. This research project is investigating the big data storage and organizational issues for phenotyping data. A conceptual design of the phenotyping data requirements has been generated to illustrate the large scope of the data required. NoSQL database technology has also been investigated as an alternative to relational databases to provide more efficient storage and retrieval. In particular, the utilization of the NoSQL-based Couchbase system has been investigated for its high scalability and cost effective storage of massive data. Temporal data management with respect to NoSQL databases has also been explored due to the time- oriented nature of phenotyping data collection and analysis. This research provides a prototype implementation of image data storage using CouchBase, together with examples of temporal queries and a performance analysis. Objectives 1.Comparing different types of NoSQL Databases to determine which form is appropriate for the phenotyping project requirements. 1. Modeling the entity and attribute data requirements of the phenotyping project. 2.Capturing the temporal aspects and applying it as a data organization method. 3. Support for retrieval and querying of data over time. 4. Prototype using the wicking data application. CouchBase DataBase 1.Primary unit of Storage on the server is JSON documents 2.JSON documents offer a flexible structure that allows a document to be modeled as an object. 3.Couch Base Server 2.0 uses a JavaScript-based query system that uses field values within JSON documents. 1.Using Views to query specific data creates the ability to combine multiple attributes and retrieve documents based on a given specification. Big Data Storage and Access Issues for Phenotyping of Agricultural Data Stephen George,Susan Urban, Eric Hequet, and Hamed Sari-Sarraf Texas Tech 2013 NSF Research Experiences for Undergraduates Site Program Wicking Data 1.Due to the unavailability of plant data in this state of the project, the experiment was be conducted on wicking data. 2.What is Wicking? 1.The ability of a fabric to absorb moisture from a surface (skin). 2.Used in active wear and performance fabrics. The Phenotyping Project Plant phenotyping is the comprehensive assessment of plant complex traits such as growth, development, tolerance, resistance, architecture, physiology, ecology, yield, and the basic measurement of individual quantitative parameters that form the basis for the more complex traits. (LemnaTec) Robotics is being used to monitor and capture the plant’s growth over time and keep track of the plant’s environment. The navigation aspect of the project provides location information for each of the cotton plants in the fields. Each individual plant will have multiple images that capture growth attributes over time. Goals of the Texas Tech Phenotyping project Determining which cross-breed would survive in harsh climate of West Texas. Being able to store and analyze massive amounts of plant data overtime. The F1 cross contains plants over the site of one cotton field. Can potentially store close to 4 million images/attributes over a 10 week span for 1 generation 20 crosses * 200 lines * 2 Reps * 5 environments = 17.2 billion plant data spanning over a year. Massive amounts of data being produced. Future Work Implement the phenotyping database in CouchBase DB in order to store and handle attributes taken from the robot. Create different Views in order to fit the specifications for querying plant data based on physical attributes, time-spatial data, and environment. NoSQL NoSQL groups all the stores created as an attempt to solve problems which cannot fit into a table/column/rows structures. Many NoSQL systems produce better write performance than the traditional Relational Databases. NoSQL handles high volumes of data faster than that of a Relational Databases. Provides a greater level of flexibility when storing different data types such as images, documents and other objects. Key-Value Store: MongoDB, Document Store: MongoDB, Couch, Raven Column store: Hbase, Cassandra References Chen, S. (2010). Multimedia Databases and Data Management: A Survey. International Journal of Multimedia Data Engineering and Management (IJMDEM), 1(1), doi: /jmdem Monger, M. D., Mata-Toledo, R. A., & Gupta, P. (2012). Temporal Data Management in Nosql Databases. Journal of Information Systems & Operations Management, 6(2), Figure 1: Image of a Cotton farm with respect to the time aspects of cotton growth. Summary After looking at various NoSQL databases, it was determined that a document-store based DB, Couchbase would not only satisfy the project requirements, but also provide an in-system crash prevention, making the system durability close to Relational DBs. A data model for the Phenotyping project has been created and is ready for implementation. It supports not only the physical attributes of the plant but also environment variables that affect plant growth. This work also experimented with other forms of data (Wicking data) in order to see if we could implement a similar data model based on the phenotyping project. Figure 4: Sequence of frames of the drying cycle of active wear fabric. Figure 5: Query code for displaying Area based on Experiment 1. Figure 6: Query code for displaying Temperature based on Experiment 1. Figure 7: Graph for Figure 5 Query results. Figure 3: Data Model for the Wicking Experiment. Figure 2: Data Model for the Phenotyping Project. Figures 9: Displays the read speeds for the wicking experiment. DISCLAIMER: This material is based upon work supported by the National Science Foundation and the Department of Defense under Grant No. CNS An opinions, findings, and conclusions or recommendation expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation or the Department of Defense. Figures 8: Displays the write speeds for the wicking experiment. Area/cm Frames Area of Frames