Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Mining for Science and Engineering Presented by: Kenji Yoshigoe.

Slides:



Advertisements
Similar presentations
Is the use of computers and software to manage information. In some companies, this is referred to as Management Information Services (or MIS) or simply.
Advertisements

Current impacts of cloud migration on broadband network operations and businesses David Sterling Partner, i 3 m 3 Solutions.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21) NSF-wide Cyberinfrastructure Vision People, Sustainability, Innovation,
1-1 Management Information Systems for the Information Age Copyright 2004 The McGraw-Hill Companies, Inc. All rights reserved Chapter 1 THE INFORMATION.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Workshop on HPC in India Grid Middleware for High Performance Computing Sathish Vadhiyar Grid Applications Research Lab (GARL) Supercomputer Education.
CPSC 695 Future of GIS Marina L. Gavrilova. The future of GIS.
Sales forecasting with SAS Advanced Analytics for the Pharmaceutical sector. A business case.
Computational Thinking Related Efforts. CS Principles – Big Ideas  Computing is a creative human activity that engenders innovation and promotes exploration.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Engineering & Physical Sciences Research Council.
Mining Large Data at SDSC Natasha Balac, Ph.D.. A Deluge of Data Astronomy Life Sciences Modeling and Simulation Data Management and Mining Geosciences.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
© 2011 IBM Corporation Smarter Software for a Smarter Planet The Capabilities of IBM Software Borislav Borissov SWG Manager, IBM.
Critical Emerging Network-Centric Applications Tele-control/tele-presence Defense Tele-medicine Remote plane/vehicle/robot control Distance learning Real-time.
Information Systems in MBS UG Studies Dr. Ilias Petrounias Room 3.19, MBS West
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
Fundamentals of Information Systems, Fifth Edition
1 © 2003 Cisco Systems, Inc. All rights reserved. CIAG-HLS Security For Infrastructure Protection: Public-Private Partnerships KEN WATSON 15 OCT.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
Model-Driven Analysis Frameworks for Embedded Systems George Edwards USC Center for Systems and Software Engineering
Fundamentals of Information Systems, Seventh Edition 1 Chapter 3 Data Centers, and Business Intelligence.
Fisheries Oceanography Collaboration Software Donald Denbo NOAA/PMEL-UW/JISAO Presented by Nancy Soreide NOAA/PMEL AMS 2002/IIPS 10.3.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by Seishi Ninomiya (NARC-NARO) ICTs transforming agricultural science,
Advanced Networking Infrastructure and Research The Division includes two lines of effort: Networking Research at a point of explosive growth, new definition.
03/19/02Scalab Seminar Series1 Mapping the Gnutella Network Macroscopic Properties of Large Scale P2P Systems Ramaswamy N.Vadivelu Scalab, ASU.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Geosciences - Observations (Bob Wilhelmson) The geosciences in NSF’s world consists of atmospheric science, ocean science, and earth science Many of the.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, March 29, 2000.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
Nora Sabelli, NSF What could data mining and retrieval contribute to the study of education?
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Marv Adams Chief Information Officer November 29, 2001.
9/03 Data Mining – Introduction G Dong (WSU)1 CS499/ Data Mining Fall 2003 Professor Guozhu Dong Computer Science & Engineering WSU.
Computational Science & Engineering meeting national needs Steven F. Ashby SIAG-CSE Chair March 24, 2003.
Challenges of Coping with Funding and Data Management in a Changing World Rick Lyons Director Infectious Disease Research Center.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
Oct 2005 page 1 The CIO of the Future – Changing the Dialogue Rolf Kubli, EDS EMEA Architects Office, CTO EDS Switzerland EGEE04 Industry Forum.
Tackling I/O Issues 1 David Race 16 March 2010.
© 2013 IBM Corporation 1 Title of presentation goes Elisa Martín Garijo IBM Distinguish Engineer and CTO for IBM Spain. Global Technology.
LIMPOPO DEPARTMENT OF ECONOMIC DEVELOPMENT, ENVIRONMENT AND TOURISM The heartland of southern Africa – development is about people! 2015 ICT YOUTH CONFERENCE.
NASA Earth Exchange (NEX) Earth Science Division/NASA Advanced Supercomputing (NAS) Ames Research Center.
Smart Grid Big Data: Automating Analysis of Distribution Systems Steve Pascoe Manager Business Development E&O - NISC.
1. 2 Quick Background I have an ecological background but I strayed……and ended up in computer science The good news is I have been able to blend the two.
Serving IT up with ITIL By Thane Price. IT is the laboratory’s pit crew  Goal : Make technology transparent while accomplishing valuable internal customer.
ITCube Business Process Outsourcing...! To Generate Your Business Revenue.
Popular Database Management Systems
Penn State Center for e-Design Site Vision and Capabilities
Organizations Are Embracing New Opportunities
Joslynn Lee – Data Science Educator
Joseph JaJa, Mike Smorul, and Sangchul Song
Emerging Trends in Information Technology
Recognising the vital role of the HE technician
RESEARCH, EDUCATION, AND TRAINING FOR THE SMART GRID
DATA COLLECTION, MANAGEMENT AND ANALYSIS
Model-Driven Analysis Frameworks for Embedded Systems
CUSTOMER RELATIONSHIP MANAGEMENT CONCEPTS AND TECHNOLOGIES
Data Warehousing and Data Mining
Computer Services Business challenge
Bird of Feather Session
The Barcelona Supercomputing Center
Computer Services Business challenge
Presentation transcript:

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Mining for Science and Engineering Presented by: Kenji Yoshigoe

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Why is Data Analytics Critical Now? Basis for next ‘paradigm-shifting’ revolution –Need to put the right information in the right person’s hand at the right time –Impacts: Weather, bioinformatics, biological sciences, emergency management, business intelligence, agricultural sciences, etc. Boundary-independent analysis and flow of data requires a flexible, standards-based, ‘open’ technical (software-driven) infrastructure –Design software systems to enable collaborative work To facilitate transformative, interdisciplinary research Based upon existing strengths and scaling up from current EPSCoR WiNS center effort

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Why is happening now? Estimated that in less than three years, the bytes of data generated by systems and devices will equal the number of grains of sand on all of the world's beaches 44 percent of large organizations (i.e., 1,000 employees or more) collect at least 1 terabyte of log file per month –11% collect over 10 TB – per month! Security and vulnerability management software products: 20 percent growth in ~$2.27B –More than 6 percent to 7 percent growth expected for the software industry as a whole

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 V&V Loop Performance Improvement Loop Data Bottlenecks…

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 The Gathering Storm… Significant advances in processing, networking and storage have exploded the availability of usable data –Need: corresponding advances in analyzing and utilizing gathered data –Fundamental advances in how we collect, transmit, protect and ‘process’ valuable information Impacts: Weather, bioinformatics, biological sciences, emergency management, business intelligence, agricultural sciences,

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Scientific Objective Problem: Data analytics – a fundamental ‘science’ barrier that can enable ‘transformative’ research in multiple disciplines –Dealing with data explosion - a critical issue in every domain Aim: A transformative infrastructure for computational data analytics leading to the development of a computational framework that is currently lacking –Develop novel algorithmic analysis strategies for data-driven dynamic systems –Apply these methods to transform current research across multiple disciplines –Data structures capable of handling large, irregular data patterns with historic, time and/or event dependent, evolutionary and semantic characteristics

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Summary Strength-based collaborative partnerships –UALR, UAF, UAMS, ASU, UAPB –Looking for others Contact –Srini Ramaswamy - /

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Why is this important? Underlying Issue –Dealing with data explosion has become a critical issue in every domain

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 Fundamental Barrier Across Science and Technology Discplines Googling for dirt –Many profitable new companies focusing on this – H5, Unityware, etc. –Law million documents on each other in response to discovery requests, automated intelligent searches are the only realistic solution

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008 * Global Disk storage per person GDSP (MB/person)

Oklahoma Supercomputing Symposium 2008 Oct 7 th 2008