1 Mining Images of Material Nanostructure Data Aparna S. Varde, Jianyu Liang, Elke A. Rundensteiner and Richard D. Sisson Jr. ICDCIT December 2006 Bhubaneswar,

Slides:



Advertisements
Similar presentations
Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.
Advertisements

Ranking Multimedia Databases via Relevance Feedback with History and Foresight Support / 12 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT AND EXPLORATION.
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.
1 Learning Semantics-Preserving Distance Metrics for Clustering Graphical Data Aparna S. Varde, Elke A. Rundensteiner, Carolina Ruiz, Mohammed Maniruzzaman.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov
A New Biclustering Algorithm for Analyzing Biological Data Prashant Paymal Advisor: Dr. Hesham Ali.
Jeff Shen, Morgan Kearse, Jeff Shi, Yang Ding, & Owen Astrachan Genome Revolution Focus 2007, Duke University, Durham, North Carolina Introduction.
1 This work partially funded by NSF Grants IIS , IRIS and IIS Matthew O. Ward, Elke A. Rundensteiner, Jing Yang, Punit Doshi, Geraldine.
Object retrieval with large vocabularies and fast spatial matching
Reflective Symmetry Detection in 3 Dimensions
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Bioinformatics and Phylogenetic Analysis
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Data Mining As A Continuous Auditing Tool for “Soft Information”: A Research Question A Research Proposal By J. Donald Warren, Jr. Rutgers University Fifth.
Structural Knowledge Discovery Used to Analyze Earthquake Activity Jesus A. Gonzalez Lawrence B. Holder Diane J. Cook.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
1 LearnMet: Learning a Domain-Specific Distance Metric for Graph Mining Aparna S. Varde Update on Ph.D. Research Committee Prof. Elke Rundensteiner (Advisor)
Subdue Graph Visualizer by Gayathri Sampath, M.S. (CSE) University of Texas at Arlington.
The QuenchMiner ™ Expert System for Quenching and Distortion Control Aparna S. Varde, Mohammed Maniruzzaman, Elke Rundensteiner and Richard D. Sisson Jr.
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
1 Augmenting MatML with Heat Treating Semantics Aparna Varde, Elke Rundensteiner, Murali Mani Mohammed Maniruzzaman and Richard D. Sisson Jr. Worcester.
Computational Estimation of Heat Transfer Curves for Microstructure Prediction and Decision Support Aparna S. Varde, Mohammed Maniruzzaman, Elke A. Rundensteiner.
Data Mining – Intro.
WPI Center for Research in Exploratory Data and Information Analysis From Data to Knowledge: Exploring Industrial, Scientific, and Commercial Databases.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Data Mining : Introduction Chapter 1. 2 Index 1. What is Data Mining? 2. Data Mining Functionalities 1. Characterization and Discrimination 2. MIning.
Data Mining Techniques
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence From Data Mining To Knowledge.
Data Mining Chun-Hung Chou
A Few Answers Review September 23, 2010
Chapter 10 Artificial Intelligence. © 2005 Pearson Addison-Wesley. All rights reserved 10-2 Chapter 10: Artificial Intelligence 10.1 Intelligence and.
1 A Bayesian Method for Guessing the Extreme Values in a Data Set Mingxi Wu, Chris Jermaine University of Florida September 2007.
The isosurface is a 3D reconstruction of the DiO dataset. The surface structure exhibits the shape of the dendritic spine and color exhibits the concentration.
Interactive Discovery and Semantic Labeling of Patterns in Spatial Data Thomas Funkhouser, Adam Finkelstein, David Blei, and Christiane Fellbaum Princeton.
GA-Based Feature Selection and Parameter Optimization for Support Vector Machine Cheng-Lung Huang, Chieh-Jen Wang Expert Systems with Applications, Volume.
Designing Semantics-Preserving Cluster Representatives for Scientific Input Conditions Aparna Varde, Elke Rundensteiner, Carolina Ruiz, David Brown, Mohammed.
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Data Mining Knowledge on rough set theory SUSHIL KUMAR SAHU.
Adaptive Data Visualization Packet Information Collection and Transformation for Network Intrusion Detection and Prevention Richard A. Aló,
Survey Methodology Lilian Ma November 6, Three aspects 1. How questions were designed 2. How data was collected 3. How samples were drawn Probability.
Relevance Feedback in Image Retrieval Systems: A Survey Part II Lin Luo, Tao Huang, Chengcui Zhang School of Computer Science Florida International University.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
3-1 Data Mining Kelby Lee. 3-2 Overview ¨ Transaction Database ¨ What is Data Mining ¨ Data Mining Primitives ¨ Data Mining Objectives ¨ Predictive Modeling.
Nanoscale Science and Engineering. Nanoscale Science and Engineering embodies fundamental research and technology development of materials, structures,
VisDB: Database Exploration Using Multidimensional Visualization Maithili Narasimha 4/24/2001.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
Data Mining and Decision Trees 1.Data Mining and Biological Information 2.Data Mining and Machine Learning Techniques 3.Decision trees and C5 4.Applications.
A Novel Visualization Model for Web Search Results Nguyen T, and Zhang J IEEE Transactions on Visualization and Computer Graphics PAWS Meeting Presented.
Using decision trees to build an a framework for multivariate time- series classification 1 Present By Xiayi Kuang.
Web-based Data Mining for Quenching Data Analysis Aparna S. Varde, Makiko Takahashi, Mohammed Maniruzzaman, Richard D. Sisson Jr. Center for Heat Treating.
Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.
Surface Defect Inspection: an Artificial Immune Approach Dr. Hong Zheng and Dr. Saeid Nahavandi School of Engineering and Technology.
Why Intelligent Data Analysis? Joost N. Kok Leiden Institute of Advanced Computer Science Universiteit Leiden.
The KDD Process for Extracting Useful Knowledge from Volumes of Data Fayyad, Piatetsky-Shapiro, and Smyth Ian Kim SWHIG Seminar.
Data Mining – Intro.
Face Detection EE368 Final Project Group 14 Ping Hsin Lee
Introduction to Data Mining
Personalized Social Image Recommendation
Shape matching and object recognition using shape contexts
Research Areas Christoph F. Eick
What Visualization can do for Data Clustering?
Data Warehousing and Data Mining
CSc4730/6730 Scientific Visualization
Automating Domain-Type-Dependent Data Mining as a Computational Estimation Technique for Decision Support in Materials Science Ph.D. Dissertation Proposal.
Data Mining Classification: Alternative Techniques
Resource Allocation for Distributed Streaming Applications
Nano Technology Dr. Raouf Mahmood. Nano Technology Dr. Raouf Mahmood.
Presentation transcript:

1 Mining Images of Material Nanostructure Data Aparna S. Varde, Jianyu Liang, Elke A. Rundensteiner and Richard D. Sisson Jr. ICDCIT December 2006 Bhubaneswar, India

2 Introduction Data Mining: Process of discovering interesting patterns in data sets Mining Scientific Data Bioinformatics Materials Science Nanotechnology

3 Field that involves Design, characterization, production, application of Structures, devices and systems by controlling Shape, size, structure and chemistry of materials At the nanoscale level Data from nanotechnology Images of nanostructures Carbon Nanofibers Cobalt Nanowire Arrays Silicon Nanopore Array

4 Domain-Specific Analysis What is the difference in nanostructure at various locations of a given sample? How does the nanostructure evolve at different stages of a physical / chemical / biochemical process? How does processing under different conditions affect interactions at the same stage of a process?

5 Goals of Analysis in Applications Fabrication of biological nanostructures Materials for implants in human body Building computational tools Useful for tutoring, simulation, estimation Selection of materials for industrial processes Studying smaller samples helps large scale selection

6 Image Mining Techniques Clustering Similarity Search Target ImageTop 4 Matches

7 Challenges in Mining Nanostructure Image Data Learning Notion of Similarity Defining Interestingness Measures Visualizing Mining Results

8 Learning Notion of Similarity Some features of images may be more important than others Experts at best have subjective notions of similarity Need to learn a similarity measure that captures domain semantics

9 Domain Semantics Nanoparticle size Dimension of each particle in nanostructure Inter-particle distance Distance between particles in 2-D space Nanoparticle height Projection of particles above surface Zoom Level of magnification of images Location Part of sample where image taken

10 Proposed Learning Approach: FeaturesRank Given: Training samples with pairs of images and levels of similarity identified Learn: Distance function that incorporates image features and their relative importance Process: Iterative approach Use guessed initial distance function Compare obtained clusters with training samples Adjust function based on error between clusters and samples Return distance function with minimal error

11 Issues in FeaturesRank Defining suitable notion of error Proposing weight adjustment heuristics Assessing effectiveness of learned distance function Addressed in our paper [VRJSL:07]

12 Defining Interesting Measures What is interesting to the user Assessment of mining results Displaying the answers Objective measures for interestingness Take into account targeted applications Our work on cluster representatives [VRRMS:06] Minimum Description Length principle

13 Visualizing Mining Results Potential use of Visualization Techniques for Multidimensional Data Example: Star glyphs plot for heat transfer curves [VTRWMS:03] Vertex: Attribute Distance from center of star: Value

14 Related Work Similarity Search in Multimedia Databases [KB:04]: Overview metrics, do not learn a function Interestingness Measures for Association Rules, Decision Trees [HK:01]: Objective measures, not directly applicable to our work, draw an analogy XMDV Tool for Visualization of Multivariate Data [W:94]: Possible adaptation in this context

15 Conclusions Mining Nanostructure Images Domain Specific Analysis Targeted Applications Biological Nanostructures Computational Tools Industrial Processes Challenges Learning Notion of Similarity Defining Interestingness Measures Visualizing Mining Results