Department of Computer Science 2015 Research Areas and Projects 1.Data Mining and Machine Learning Group (UH-DMML) Its research is focusing on: 1.Spatial.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Motion Patterns Alla Petrakova & Steve Mussmann. Trajectory Clustering Trajectory clustering is a well-established field of research in Data Mining area.
Chung Sheng CHEN, Nauful SHAIKH, Panitee CHAROENRATTANARUK, Christoph F. EICK, Nouhad RIZK and Edgar GABRIEL Department of Computer Science, University.
Introduction REU is the abbreviation for Research Experience for Undergraduates. The project on Hazard Mitigation is funded by the National Science Foundation.
UH Data Mining & Machine Learning Group May 1, 2009 Christoph F. Eick Department of Computer Science University of Houston A Domain-Driven Framework.
1 Modeling Evolution in Spatial Datasets Paul Amalaman 2/17/2012 Dr Eick Christoph Nouhad Rizk Zechun Cao Sujing Wang Data Mining and Machine Learning.
Discovering Interesting Regions in Spatial Data Sets Christoph F. Eick Department of Computer Science, University of Houston 1.Motivation: Examples of.
FACULTY SUCCESS Dr. Bernard Arulanandam Professor of Microbiology & Immunology; Jane and Roland Blumberg Professor in Biology; Associate Dean of Research.
Department of Computer Science 1 Data Mining / KDD Let us find something interesting! Definition := “KDD is the non-trivial process of identifying valid,
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
Discovering Interesting Regions in Spatial Data Sets Christoph F. Eick for the Data Mining Class 1.Motivation: Examples of Region Discovery 2.Region Discovery.
Grid-based Analysis in GIS
Use of GIS Monitoring California Resources. Data fusion, remote sensing and predicitions.
Department of Computer Science Research Areas and Projects 1. Data Mining and Machine Learning Group ( research.
Discovering Interesting Regions in Spatial Data Sets using Supervised Clustering Christoph F. Eick, Banafsheh Vaezian, Dan Jiang, Jing Wang PKDD Conference,
Name: Sujing Wang Advisor: Dr. Christoph F. Eick
A N A RCHITECTURE AND A LGORITHMS FOR M ULTI -R UN C LUSTERING Rachsuda Jiamthapthaksin, Christoph F. Eick and Vadeerat Rinsurongkawong Computer Science.
Extracting Regional Knowledge from Spatial Datasets Christoph F. Eick Department of Computer Science, University of Houston 1.Motivation: Why is Regional.
A Scalable Self-organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation Dmitri G. Roussinov Department of.
Department of Computer Science Research Areas and Projects 1. Data Mining and Machine Learning Group ( research.
Ch. Eick: Region Discovery Project Part3 Region Discovery Project Part3: Overview The goal of Project3 is to design a region discovery algorithm and evaluate.
1. Data Mining (or KDD) Let us find something interesting! Definition := “Data Mining is the non-trivial process of identifying valid, novel, potentially.
Using Inactivity to Detect Unusual behavior Presenter : Siang Wang Advisor : Dr. Yen - Ting Chen Date : Motion and video Computing, WMVC.
Data Mining & Machine Learning Group ACM-GIS08 Christoph Eick (University of Houston, USA), Rachana Parmar (University of Houston, USA), Wei Ding.
MOSAIC: A Proximity Graph Approach for Agglomerative Clustering Jiyeon Choo, Rachsuda Jiamthapthaksin, Chun-shen Chen, Ulvi Celepcikay, Christian Guisti,
Department of Computer Science 1 KDD / Data Mining Let us find something interesting!  Motivation: We are drowning in data, but we are staving for knowledge.
Data Mining & Machine Learning Group ADMA09 Rachsuda Jianthapthaksin, Christoph F. Eick and Ricardo Vilalta University of Houston, Texas, USA A Framework.
Data Mining & Machine Learning Group UH-DMML: Ongoing Data Mining Research Data Mining and Machine Learning Group, Computer Science Department,
COMPUTATIONAL BIOLOGIST DR. MARTIN TOMPA Place of Employment: University of Washington Type of Work: Develops computer programs and algorithms to identify.
Ch. Eick: Introduction to Hierarchical Clustering and DBSCAN 1 Remaining Lectures in Advanced Clustering and Outlier Detection 2.Advanced Classification.
Change Analysis in Spatial Datasets by Interestingness Comparison Vadeerat Rinsurongkawong, and Christoph F. Eick Department of Computer Science, University.
Data Mining and Machine Learning Group (UH-DMML) Wei Ding Rachana Parmar Ulvi Celepcikay Ji Yeon Choo Chun-Sheng Chen Abraham Bagherjeiran Soumya Ghosh.
Department of Computer Science Research Focus of UH-DMML Christoph F. Eick Data Mining Geographical Information Systems (GIS) High Performance Computing.
Discovering Interesting Regions in Spatial Data Sets Christoph F. Eick for Data Mining Class 1.Motivation: Examples of Region Discovery 2.Region Discovery.
Department of Computer Science 1 Data Mining / KDD Let us find something interesting! Definition := “KDD is the non-trivial process of identifying valid,
AegisDB: Integrated realtime geo-stream processing and monitoring system Chengyang Zhang Computer Science Department University of North Texas.
Data Mining & Machine Learning Group UH-DMML: Ongoing Data Mining Research Data Mining and Machine Learning Group, Computer Science Department, University.
Department of Computer Science Research Areas and Projects 1. Data Mining and Machine Learning Group ( research.
Corresponding Clustering: An Approach to Cluster Multiple Related Spatial Datasets Vadeerat Rinsurongkawong and Christoph F. Eick Department of Computer.
Dr. Chen, Data Mining  A/W & Dr. Chen, Data Mining Chapter 3 Basic Data Mining Techniques Jason C. H. Chen, Ph.D. Professor of MIS School of Business.
1. ABSTRACT Information access through Internet provides intruders various ways of attacking a computer system. Establishment of a safe and strong network.
Discovering Interesting Regions in Spatial Data Sets Christoph F. Eick for Data Mining Class 1.Motivation: Examples of Region Discovery 2.Region Discovery.
FACULTY EXTERNSHIP OPPORTUNITIES IN DATA SCIENCE AND DATA ANALYTICS Facilitated by: FilAm Software Technology, Clark Freeport Zone Ecuiti, San Francisco,
Big data classification using neural network
What Else is Important in AI we Did not Cover?
Presented by: Chung-Hsien Yu
Air Dispersion Modeling: Planning for Airborne Terrorism Release
Meeting 02/27/2017 Short Overview UH-DAIS Lab Research
Statistical Methods for Model Evaluation – Moving Beyond the Comparison of Matched Observations and Output for Model Grid Cells Kristen M. Foley1, Jenise.
Research Focus Objectives: The Data Analysis and Intelligent Systems (DAIS) Lab  aims at the development of data analysis, data mining, GIS and artificial.
Research Areas and Projects
Meeting 02/27/2017 Short Overview UH-DAIS Lab Research
University of Houston, USA
COSC 6335 Data Mining Fall 2009: Assignment3a Post Analysis
Data Analysis and Intelligent Systems Lab
Research Focus Objectives: The Data Analysis and Intelligent Systems (DAIS) Lab  aims at the development of data analysis, data mining, GIS and artificial.
PollutantsEventsDiseases
Data Analysis and Intelligent Systems Lab
Yongli Zhang, Sujing Wang, Amar Mani Aryal, and Christoph F. Eick
Research Areas Christoph F. Eick
Yongli Zhang and Christoph F. Eick University of Houston, USA
UH-COSC Events Today, 4-6p: Student Welcome Party
Data Analysis and Intelligent Systems Lab
UH-DMML: Ongoing Data Mining Research
Section 4: see other Slide Show
Section 4: see other Slide Show
Data Analysis and Intelligent Systems Lab
Brainstorming How to Analyze the 3AuCountHand Datasets
Spatial Data Mining Definition: Spatial data mining is the process of discovering interesting patterns from large spatial datasets; it organizes by location.
Discovery of Interesting Spatial Regions
Presentation transcript:

Department of Computer Science 2015 Research Areas and Projects 1.Data Mining and Machine Learning Group (UH-DMML) Its research is focusing on: 1.Spatial Data Mining 2.Clustering and Anomaly Detection 3.Classification and Prediction 4.GIS 2. Current and Planned Projects 1.Clustering Algorithms with Plug-in Fitness Functions and Other Non- Traditional Clustering Approaches 2.Analyzing and Doing Useful Things with Bio-aerosol Data quite new 3.Using Mixture Models for Anomaly Detection and Change Analysis quite new 4.Interestingness Scoping Algorithms for the Analysis of Spatial and Spatio-temporal Datasets 5.Taxonomy Generation—Learning Class Hierarchies from Training Data 6.Understanding, Preventing, and Recovery from Flooding just starting 7.Educational Data Mining (lead by Nouhad Rizk ) UH-DMML

Department of Computer Science 1. Non-Traditional Clustering Algorithms UH-DMML Clustering Algorithms With plug-in Fitness Functions Mining Spatio-Temporal Datasets Parallel Computing Prototype-based Clustering Agglomerative Clustering Algorithms Clustering Polygons and Trajectories Illustration of MOSAIC’s approach Input Output MOSAIC STAXAC CLEVER AVALANCHE

Department of Computer Science 2. Understanding and Doing Useful Things with Bio-areosol Data  Definition: A bioaerosol (short for biological aerosol) is a suspension of airborne particles that contain living organisms or were released from living organisms. [1] These particles are very small and range in size from less than one micrometer ( ") to one hundred micrometers (0.004").aerosol [1]  Research Questions  Characterization of the Bio-aerosol Composition at a Particular Location  Anomaly Detection and Change Analysis for Bio-aerosols  Understanding Disease Spread  Sensor-based Bio-aerosol Early Warning Systems  … UH-DMML [ 1] Wathes, Christopher M.; Cox, C. Barry (1995). Bioaerosols handbook. Chelsea, Mich: Lewis Publishers. ISBN ISBN

Department of Computer Science 3. Using Mixture Models for Anomaly Detection and Change Analysis The Sensor Modeling Toolbox will be used for the following tasks:  Change analysis and anomaly detection (based on sensor readings)  For creating background models of particular sensors at particular locations  Development of sophisticated threat assessment functions that operate on the top of the toolbox Sensor Modeling Toolbox Analysis Function 1... Set of Sensor Reading Model Fitting Probabilistic Model Analysis Function 2 Analysis Function k UH-DMML

Department of Computer Science Gaussian Mixture Models Data Set EM BIC/Akaike/… Model Selection

Department of Computer Science 4. Interestingness Hotspot Discovery Framework for Grids  Objective: Find interesting hotspots in 4D grid-based datasets using plugin interestingness functions.  Methodology:  Find hotspots in grid-based spatio-temporal datasets using hotspot discovery algorithms and clustering techniques. Employ plugin interestingness and reward functions to guide the search for “good” hotspots.  Generate cluster summaries  Visualize 4-dimensional spatio-temporal clusters and cluster summaries  Dataset: We are working on a 4-dimensional grid-based air pollution dataset.  Each grid cell overs a 4x4 km area. There are 150,000 4D grid cells.  Grid cells have latitude, longitude, layer (altitude), and time dimensions.  Each grid cell is associated with hourly observations of 132 compounds in the air. UH-DMML Low variation hotspots

Department of Computer Science Interestingness Hotspot Discovery Framework for Grids UH-DMML Ozone PM2.5 Ozone concentration in the region PM2.5 concentration in the region

Department of Computer Science 5. Taxonomy Generation M Taxonomy Generation Algorithm Datasets UH-DMML

Department of Computer Science 6. Understanding, Preventing, and Recovery from Flooding UH-DMML UH CeSAR Symp. 7/24/2015 Center for Sustainability and Resiliency

Department of Computer Science Helping Scientists to Make Sense Out of their Data Figure 1: Co-location regions involving deep and shallow ice on Mars Figure 2: Interestingness hotspots where both income and CTR are high. Figure 3: Mining hurricane trajectories UH-DMML

Department of Computer Science Some UH-DMML Graduates 1 Christoph F. Eick Dr. Wei Ding, Associate Professor, Department of Computer Science, University of Massachusetts, Boston Sharon M. Tuttle, Professor, Department of Computer Science, Humboldt State University, Arcata, California Christopher T. Ryu, Professor, Department of Computer Science, California State University, Fullerton Sujing Wang, Assistant Professor, Department of Computer Science, Lamar University, Beaumont, Texas

Department of Computer Science Some UH-DMML Graduates 2 Christoph F. Eick Chun-sheng Chen, PhD Amazon Chong Wang, MS Haliburton Justin Thomas MS Section Supervisor at Johns Hopkins University Applied Physics Laboratory Mei-kang Wu MS Microsoft, Bellevue, Washington Jing Wang MS AOL, California Rachsuda Jiamthapthaksin PhD Faculty, Assumption University, Bangkok, Thailand

Department of Computer Science Students in the UH-DMML Research Group UH-DMML PhD Students: Yongli Zhang, Fatih Akdag, Nguyen Pham, Chong Wang and Paul Amalaman. Master Students: Puja Anchlia, Riny Hutapea and Rohit Jidagam. Undergraduate Students: none at the moment