A Paradigm for Space Science Informatics Kirk D. Borne George Mason University and QSS Group Inc., NASA-Goddard or

Slides:



Advertisements
Similar presentations
Web Mining.
Advertisements

An Introduction to Data Mining
Chapter 5: Introduction to Information Retrieval
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Kansas State University Department of Computing and Information Sciences Laboratory for Knowledge Discovery in Databases (KDD) KDD Group Research Seminar.
Data Mining Sangeeta Devadiga CS 157B, Spring 2007.
Data Mining Techniques Cluster Analysis Induction Neural Networks OLAP Data Visualization.
Sébastien Derriere IVOA interoperability meeting Victoria 2010 may 21 Semantics summary.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
Databases – A Key to Unlocking the Future Database Efficacy – Uses in the Classroom.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
University of Minnesota
KDD for Science Data Analysis Issues and Examples.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Presented To: Madam Nadia Gul Presented By: Bi Bi Mariam.
Business Intelligence
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
GUHA method in Data Mining Esko Turunen Tampere University of Technology Tampere, Finland.
OLAM and Data Mining: Concepts and Techniques. Introduction Data explosion problem: –Automated data collection tools and mature database technology lead.
Data Warehouse Fundamentals Rabie A. Ramadan, PhD 2.
9/30/2004TCSS588A Isabelle Bichindaritz1 Introduction to Bioinformatics.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Intelligent Systems Lecture 23 Introduction to Intelligent Data Analysis (IDA). Example of system for Data Analyzing based on neural networks.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Understanding Data Analytics and Data Mining Introduction.
Tang: Introduction to Data Mining (with modification by Ch. Eick) I: Introduction to Data Mining A.Short Preview 1.Initial Definition of Data Mining 2.Motivation.
Copyright R. Weber Machine Learning, Data Mining ISYS370 Dr. R. Weber.
Data Clustering 1 – An introduction
Chapter 1 Introduction to Data Mining
Beyond Co-occurrence: Discovering and Visualizing Tag Relationships from Geo-spatial and Temporal Similarities Date : 2012/8/6 Resource : WSDM’12 Advisor.
Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Copyright © 2012, SAS Institute Inc. All rights reserved. ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, March 29, 2000.
Data resource management
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
The end of geographic theory ? Prospects for model discovery in the geographic domain Mark Gahegan Centre for eResearch & Dept. Computer Science University.
Data Mining: Knowledge Discovery in Databases Peter van der Putten ALP Group, LIACS Pre-University College LAPP-Top Computer Science February 2005.
DDM Kirk. LSST-VAO discussion: Distributed Data Mining (DDM) Kirk Borne George Mason University March 24, 2011.
1 Introduction to Data Mining C hapter 1. 2 Chapter 1 Outline Chapter 1 Outline – Background –Information is Power –Knowledge is Power –Data Mining.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Knowledge Modeling and Discovery. About Thetus Thetus develops knowledge modeling and discovery infrastructure software for customers who: Have high-value.
Data Mining and Decision Support
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Smart Web Search Agents Data Search Engines >> Information Search Agents - Traditional searching on the Web is done using one of the following three: -
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Book web site:
Cluster Analysis This work is created by Dr. Anamika Bhargava, Ms. Pooja Kaul, Ms. Priti Bali and Ms. Rajnipriya Dhawan and licensed under a Creative Commons.
Oracle Advanced Analytics
Data Mining Functionalities
Data Mining.
Biological Databases By: Komal Arora.
DATA MINING © Prentice Hall.
Introduction C.Eng 714 Spring 2010.
Knowledge Management Systems
Multimedia Information Retrieval
Data Mining Modified from
כריית מידע -- מבוא ד"ר אבי רוזנפלד.
Data Warehousing and Data Mining
About Thetus Thetus develops knowledge discovery and modeling infrastructure software for customers who: Have high value data that does not neatly fit.
Data Mining.
Presentation transcript:

A Paradigm for Space Science Informatics Kirk D. Borne George Mason University and QSS Group Inc., NASA-Goddard or Timothy E. Eastman (presenter) QSS Group Inc., NASA-Goddard and

5/26/ What is Informatics? Informatics is the discipline of structuring, storing, accessing, and distributing information describing complex systems. Examples: 1.Bioinformatics 2.Geographic Information Systems (= Geoinformatics) 3.New! Space Science Informatics Common features of X-informatics: –Basic data unit is defined –Common community tools operate on data units –Data-centric and Information-centric approaches –Data-driven science –X-informatics is key enabler of scientific discovery in the era of large data science

5/26/ X-Informatics Compared Discipline X Bioinformatics Geoinformatics Space Sc. Informatics Common Tools BLAST, FASTA GIS CDAWeb, Bayes Inference, Cross Correlations, Principal Components Data Unit Gene Sequence Points, Vectors, Polygons Time Series, Event Lists, Catalogs, Object Parameters

5/26/ Data-Information-Knowledge-Wisdom T.S. Eliot (1934): “Where is the wisdom we have lost in knowledge? Where is the knowledge we have lost in information?”

5/26/ Key Role of Data Mining Data Mining = an information extraction activity whose goal is to discover hidden knowledge contained in large databases Data Mining is used to find patterns and relationships in the data Data Mining is also called KDD –KDD = Knowledge Discovery in Databases Data Mining is the killer app for scientific databases Examples: –Clustering Analysis = group together similar items and separate dissimilar items –Classification Prediction = predict the class label –Regression = predict a numeric attribute value –Association Analysis = detect attribute-value conditions that occur frequently together

5/26/ Space Science Knowledge Discovery

5/26/ Space Weather Example

5/26/ Space Science Informatics Key enabler for new science discovery in large databases Large data science is here to stay Common data browse and discovery tools, and common data structures, will enable exponential knowledge discovery within exponentially growing data collections X-informatics represents the 3 rd leg of scientific research: experiment, theory, and data-driven exploration Space Science Informatics should parallel Bioinformatics and Geoinformatics: become a stand-alone research sub-discipline

5/26/ Future Work: Informatics Applications Query-By-Example (QBE) science data systems: 1.“Find more data entries similar to this one” 2.“Find the data entry most dissimilar to this one” Automated Recommendation (Filtering) Systems: 1.“Other users who examined these data also retrieved the following...” 2.“Other data sets that are relevant to this data set include...” Information Retrieval Metrics for Scientific Databases: 1.Precision: “How much of the retrieved data is relevant to my query?” 2.Recall: “How much of the relevant data did my query retrieve?” Semantic Annotation (Tagging) Services: –Report discoveries back to the science database for community reuse Science / Technical / Math (STEM) Education: –Transparent reuse and analysis of scientific data in inquiry-based classroom learning ( DLESE.org ) Key concepts that need defining (by community consensus): Similarity, Relevance, Semantics (dictionaries, ontologies)