The Keogh Lab 1 Presented by Abdullah Mueen. Overview of our work Our Goal: Extract information from raw, noisy, massive, unstructured data. We develop.

Slides:



Advertisements
Similar presentations
Indexing Time Series Based on original slides by Prof. Dimitrios Gunopulos and Prof. Christos Faloutsos with some slides from tutorials by Prof. Eamonn.
Advertisements

Time Series Shapelets: A New Primitive for Data Mining
SAX: a Novel Symbolic Representation of Time Series
Text mining Gergely Kótyuk Laboratory of Cryptography and System Security (CrySyS) Budapest University of Technology and Economics
Time Series Epenthesis: Clustering Time Series Streams Requires Ignoring Some Data Thanawin Rakthanmanon Eamonn Keogh Stefano Lonardi Scott Evans.
Data Mining Tools Overview Business Intelligence for Managers.
Prepared by Abdullah Mueen and Eamonn Keogh
Mining Mouse Vocalizations Jesin Zakaria Department of Computer Science and Engineering University of California Riverside.
F AST A PPROXIMATE C ORRELATION FOR M ASSIVE T IME - SERIES D ATA SIGMOD’10 Abdullah Mueen, Suman Nath, Jie Liu 1.
Mining Time Series.
A Compression Based Distance Measure for Texture Bilson J. L. Campana Eamonn J. Keogh University of California – Riverside 1.
Augmenting the Generalized Hough Transform to Enable the Mining of Petroglyphs Qiang Zhu, Xiaoyue Wang, Eamonn Keogh, 1 Sang-Hee Lee Dept. Of Computer.
User Manual of Mining Mouse Vocalizations Prepared by Jesin Zakaria and Eamonn Keogh.
© Prentice Hall1 DATA MINING TECHNIQUES Introductory and Advanced Topics Eamonn Keogh (some slides adapted from) Margaret Dunham Dr. M.H.Dunham, Data Mining,
Vector Space Information Retrieval Using Concept Projection Presented by Zhiguo Li
Finding Time Series Motifs on Disk-Resident Data
Visually Mining and Monitoring Massive Time Series Amy Karlson V. Shiv Naga Prasad 15 February 2004 CMSC 838S Images courtesy of Jessica Lin and Eamonn.
Copyright © 2004 Pearson Education, Inc.. Chapter 27 Data Mining Concepts.
EE465: Introduction to Digital Image Processing Copyright Xin Li 1 Introduction What is image segmentation?  Technically speaking, image segmentation.
Exact Discovery of Time Series Motifs This document was created to support our paper. It contains additional experiments and details which we could not.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
Pattern Matching with Acceleration Data Pramod Vemulapalli.
Mining Historical Archives for Near- Duplicate Figures Thanawin Rakthanmanon, Qiang Zhu, and Eamonn J. Keogh.
Data Mining GyuHyeon Choi. ‘80s  When the term began to be used  Within the research community.
Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.
Data Mining Eamonn Keogh. What is data mining? Generally, data mining (sometimes called data or knowledge discovery) is the process of analyzing data.
Discovering the Intrinsic Cardinality and Dimensionality of Time Series using MDL BING HU THANAWIN RAKTHANMANON YUAN HAO SCOTT EVANS1 STEFANO LONARDI EAMONN.
Publishing Articles in the STEM Field Eamonn Keogh UCR 2014.
Brandon Westover, Qiang Zhu, Jesin Zakaria, Eamonn Keogh
Data Mining with Oracle using Classification and Clustering Algorithms Presented by Nhamo Mdzingwa Supervisor: John Ebden.
Self Organization of a Massive Document Collection Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Teuvo Kohonen et al.
10/23/2015© Mohamed Medhat Gaber1 Adaptive Mobile ECG Analysis Dr Mohamed Medhat Gaber School of Computing University of Portsmouth
Data Mining By Dave Maung.
Mining Time Series.
1 CS 260 Winter 2014 Eamonn Keogh’s Presentation of Thanawin Rakthanmanon, Bilson Campana, Abdullah Mueen, Gustavo Batista, Brandon Westover, Qiang Zhu,
Abdullah Mueen Eamonn Keogh University of California, Riverside.
Semi-Supervised Time Series Classification & DTW-D REPORTED BY WANG YAWEN.
Srivastava J., Cooley R., Deshpande M, Tan P.N.
Kansas State University Department of Computing and Information Sciences CIS 830: Advanced Topics in Artificial Intelligence Wednesday, March 29, 2000.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
DTW-D: Time Series Semi-Supervised Learning from a Single Example Yanping Chen 1.
DATA MINING WITH CLUSTERING AND CLASSIFICATION Spring 2007, SJSU Benjamin Lam.
Foundations of Business Intelligence: Databases and Information Management.
Magic Camera Master’s Project Defense By Adam Meadows Project Committee: Dr. Eamonn Keogh Dr. Doug Tolbert.
Augmenting the Generalized Hough Transform to Enable the Mining of Petroglyphs Qiang Zhu, Xiaoyue Wang, Eamonn Keogh, 1 Sang-Hee Lee Dept. Of Computer.
Introduction to Data Mining by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.
Ohm’s Law Resistance in Series Circuits
NSF Career Award IIS University of California Riverside Eamonn Keogh Efficient Discovery of Previously Unknown Patterns and Relationships.
Abdullah Mueen 5 Slides Demo. Primitives for Time Series Data Mining ▪Time series motifs ▪Time series shapelets ▪Time series join 3/27/19962/24/19981/25/200012/25/200111/25/200310/25/20059/25/20078/25/20097/26/20116/26/2013.
Data Mining and Text Mining. The Standard Data Mining process.
DARE: Domain analysis and reuse environment Minwoo Hong William Frakes, Ruben Prieto-Diaz and Christopher Fox Annals of Software Engineering,
Oracle Advanced Analytics
Data Mining Association Analysis: Basic Concepts and Algorithms
What Is Cluster Analysis?
Matrix Profile I: All Pairs Similarity Joins for Time Series: A Unifying View that Includes Motifs, Discords and Shapelets Chin-Chia Michael Yeh, Yan.
Matrix Profile II: Exploiting a Novel Algorithm and GPUs to break the one Hundred Million Barrier for Time Series Motifs and Joins Yan Zhu, Zachary Zimmerman,
Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use these slides for teaching, if.
DCR Current Sensing DCR Current Sensing can be used to remove the additional conduction loss introduced by current sense resistor. The voltage across DCR.
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Visually Mining and Monitoring Massive Time Series
At Last! Time Series Joins, Motifs, Discords and Shapelets at Interactive Speeds  Eamonn Keogh With Yan Zhu, Chin-Chia Michael Yeh, Abdullah Mueen with.
مقدمه اي بر داده کاوي و اکتشاف دانش
机器感知与智能教育部重点实验室学术报告 Key Laboratory of Machine Perception (Minister of Education) Peking University Scalable, Robust and Integrative Algorithms for Analyzing.
Research Areas Christoph F. Eick
Others Structure Prediction Clustering DATA MINING Association Rules
DATA MINING Introductory and Advanced Topics Part II - Clustering
Function Rules and Tables.
A Method for the Comparison of Criminal Cases using digital documents
Series and Parallel Resistors
Semi-Supervised Time Series Classification
Presentation transcript:

The Keogh Lab 1 Presented by Abdullah Mueen

Overview of our work Our Goal: Extract information from raw, noisy, massive, unstructured data. We develop algorithms for – Classification – Clustering – Rule finding – Motif discovery – Discord discovery – Shapelet discovery – Linkage discovery We work closely with the domain experts. – For collecting new data. – To verify our results. 2

Case 1: Motif Discovery 3 Beet Leafhopper ( Circulifer tenellus) plant membrane Stylet voltage source input resistor V to insect conductive glue voltage reading to soil near plant Exact Discovery of Time Series Motifs. Abdullah Mueen, Eamonn Keogh, Qiang Zhu, Sydney Cash, Brandon Westover. SDM MK motif discovery

false nettles stinging nettles Case 2: Shapelet Discovery 4 false nettles Shapelet stinging nettles Time Series Shapelets: A New Primitive for Data Mining. Lexiang Ye and Eamonn Keogh. SIGKDD 2009

Case 3: Linkage Discovery 5 CK CK CK-1 Distance Measure CK-1 Distance Single Linkage Dendrogram Print House 1Print House 2 A Compression Based Distance Measure for Texture. Bilson Campana and Eamonn Keogh. SDM 2010 text a hand-press book character matrix text ornamentstext

Lab Members Dr. Eamonn Keogh Dr. Gustavo Batista Abdullah Mueen Qiang Zhu Bilson Campana Thanawin Art R. Bing Hu Yuan Hao Jesin Zakaria 6

Motif in Online Data Maintain motif in streaming data without introducing latency. 7

Motion Motif 8 Find repeated motion in motion capture data which is a 32 dimensional time series.