V. Uddameri Texas Tech University

Slides:



Advertisements
Similar presentations
Lab for Remote Sensing Hydrology and Spatial Modeling Dept. of Bioenvironmental Systems Engineering, NTU Satellite Remote Sensing for Land-Use/Land-Cover.
Advertisements

R and HDInsight in Microsoft Azure
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
Big Data Management and Analytics Introduction Spring 2015 Dr. Latifur Khan 1.
Big Data and Predictive Analytics in Health Care Presented by: Mehadi Sayed President and CEO, Clinisys EMR Inc.
Big Data Workflows N AME : A SHOK P ADMARAJU C OURSE : T OPICS ON S OFTWARE E NGINEERING I NSTRUCTOR : D R. S ERGIU D ASCALU.
Sheldon Brown, UCSD, Site Director Milton Halem, UMBC Director Yelena Yesha, UMBC Site Director Tom Conte, Georgia Tech Site Director Fundamental Research.
Data Mining – Intro.
Basic Marketing Research Customer Insights and Managerial Action
Big Data Dr. Michael Stachiw January 10, 2015 What it is, what it means, and where do we go from here..
2.3 Methods for Big Data What is “Big Data”? Summarizing Big Data.
Big Data. What is Big Data? Big Data Analytics: 11 Case Histories and Success Stories
‘Big Data’ Panel Discussion Using data in applications Digital Ship Hamburg 2015.
© 2012 IBM Corporation IBM Security Systems 1 © 2013 IBM Corporation 1 Ecommerce Antoine Harfouche.
CODAR Ben Kravitz September 29, Outline What is CODAR? Doppler shift Bragg scatter How CODAR works What CODAR can tell us.
Dr. Michael D. Featherstone Summer 2013 Introduction to e-Commerce Web Analytics.
Astro / Geo / Eco - Sciences Illustrative examples of success stories: Sloan digital sky survey: data portal for astronomy data, 1M+ users and nearly 1B.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Steve Kopp Esri ArcGIS 10 and Beyond Arc Hydro River Workshop, Austin, Texas, December 1, 2010.
SUPPLY CHAIN OF BIG DATA. WHAT IS BIG DATA?  A lot of data  Too much data for traditional methods  The 3Vs  Volume  Velocity  Variety.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
1 Yun Fan, Huug van den Dool, Dag Lohmann, Ken Mitchell CPC/EMC/NCEP/NWS/NOAA Kunming, May, 2004.
If you have a transaction processing system, John Meisenbacher
BIG DATA. The information and the ability to store, analyze, and predict based on that information that is delivering a competitive advantage.
Big Data Quality Panel Norman Paton University of Manchester.
Big Data Javad Azimi May First of All… Sorry about the language  Feel free to ask any question Please share similar experiences.
Big Data and Official Statistics: Philippine Context Erniel B. Barrios.
Digitial Precision Agriculture and Location Intelligence Levente Klein IBM TJ Watson Research Center.
The Application of Big Data and Learning Analytics in Universities Management: Practices from the Open University of China 魏顺平 博士 Shunping Wei May. 22nd,
Data Analytics (CS40003) Introduction to Data Lecture #1
“Scorecards and PokemonGO”
Big Data in Technical and Vocational Education (TVE)
CNIT131 Internet Basics & Beginning HTML
Data Analytics 1 - THE HISTORY AND CONCEPTS OF DATA ANALYTICS
IOT – Firefighting Example
Data Mining – Intro.
Data Platform Modernization
Supporting the human decision maker IBM Research and big data at the edge Dr. Jürg von Känel Associate Director IBM Research Australia.
Understanding Big Data
Big Data.
BIG DATA IN ENGINEERING APPLICATIONS
Eric Shook Department of Geography Kent State University
Big-Data Fundamentals
Table 1. Advantages and Disadvantages of Traditional DM/ML Methods
New Unit! Climate Change.
Tamas Szalay, Volker Springel, Gerard Lemson
Daryl Herzmann and Raymond Arritt
Jennifer Boehnert Emily Riddle Tom Hopson
Introduction Data Mining for Business Analytics.
DEPT OF CIVIL ENGINEERING, TEXAS A&M UNIVERSITY MAY 03, 2004
(Geo) Informatics across Disciplines!
AUDIT AND VALIDATION TESTING FOR BIG DATA APPLICATIONS
Data Platform Modernization
Data Warehousing and Data Mining
Big Data.
Data and meta-data exploration and data quality reporting for GCOS
Intersection of GI and IT
Bleeding edge technology to transform Data into Knowledge
WATER IN THE ATMOSPHERE
Zoie Barrett and Brian Lam
Dep. of Information Technology By: Raz Dara Mohammad Amin
Big Data: Four Vs Salhuldin Alqarghuli.
Big DATA.
Data Analysis and R : Technology & Opportunity
Visualization of Global Argo Metadata:
Flood Modeling Tools for Response and Alerting
Data Curation in Climate and Weather
Presentation transcript:

3, 5 or 7 Vs of big data and what they mean to water resource management? V. Uddameri Texas Tech University M. Young, A. Sun, J. Banner, S. Pierce, D. Tremaine University of Texas at Austin

What is Big Data What is big data? Large volumes of data Moving target – In early 1990s 1TB was big data (average Hard Drive ~ 3 GB) Today – zettabytes (107 bytes) or more (average Hard Drive ~ 2 TB) Datasets that are not amenable to traditional archival methods Relational database model not suitable Not all data is structured or the structure is too complex Data analysis not possible with existing software engineering tools Need for massive parallelization Newer algorithms to mine insights from data (data analytics) It is not the amount of data that is important – It is what you want to do with it

Big Data – Water Resources What is Big (deal) about Water Resources Data? Can quickly become very big if looking at large spatial scales Global change studies Can quickly become very big if data are being collected at a high temporal resolution Flood forecasting, early warning systems Data come from disparate sources Soil, water, land use, public perceptions Similar data can be collected from multiple sources Ground measurements, Radar, Satellite Direct measurements of what we really are after is many times not possible Don’t know groundwater pumping, but know likely water budget changes from GRACE Satellite Water Resources (big) data comes from sensors, devices, networks, log files, transactional applications, web, and social media — much of it generated in real time and at a very large scale.

Big Data – Water Resources Data needs in water resources decision making varies widely across applications Flood studies Data in order of minutes, hours to days Droughts Data in order of weeks, months, years Water Planning Data in the order of years to decades

Big Data – 5V – 7V Model Volume  Amount of data generated High volume, if data generated is not amenable to analysis Velocity  Rate at which data is generated and data is necessary Real-time processing of data being generated by sensor networks. Rate of data inflow >>> Rate of information generated Variety  Diversity of datasets necessary for a water resources application Structure of data, geospatial, temporal resolutions Variability  Meaning of data is changing constantly Public adaptation, public perception, Meaning of a 100 year flood? Visualization How can data be presented to make decisions Multivariate datasets, animation, balancing qualitative and quantitative data Veracity  How certain or uncertain is the data? Long-range climate projections, sensor accuracy Value  How has big data helped? Would our decisions be the same if we did not have this data? Worth of additional data?