Jessica Lin Eamonn Keogh Stefano Lonardi

Slides:



Advertisements
Similar presentations
SAX: a Novel Symbolic Representation of Time Series
Advertisements

Some Behaviors of CPC Monthly Precipitation Forecasts In Nebraska James McCormick University of Nebraska-Lincoln School of Natural Resources August 16,
Efficient Anomaly Monitoring over Moving Object Trajectory Streams joint work with Lei Chen (HKUST) Ada Wai-Chee Fu (CUHK) Dawei Liu (CUHK) Yingyi Bu (Microsoft)
08/25/2004KDD ‘041 Fair Use Agreement This agreement covers the use of all slides on this CD-Rom, please read carefully. You may freely use these slides.
Jessica Lin, Eamonn Keogh, Stefano Loardi
Visually Mining and Monitoring Massive Time Series Amy Karlson V. Shiv Naga Prasad 15 February 2004 CMSC 838S Images courtesy of Jessica Lin and Eamonn.
1 Dot Plots For Time Series Analysis Dragomir Yankov, Eamonn Keogh, Stefano Lonardi Dept. of Computer Science & Eng. University of California Riverside.
Time-Series Data Kaitlin Duck Sherwood CS 533c. Why do you care? Time-series data is all over the place.
Jun 25, 2014 IAT Time ______________________________________________________________________________________ SCHOOL OF INTERACTIVE ARTS + TECHNOLOGY.
VizDB A tool to support Exploration of large databases By using Human Visual System To analyze mid-size to large data.
The months of the year January February. The months of the year March April.
VizTree Huyen Dao and Chris Ackermann. Introducing example
Rule discovery from time series Authors: Guatam Das, King-Ip Lin, Heikki Mannila, Gopal Renganathan, Pedhraic Smyth Presented By: Tom Gradel.
The Seasons The Seasons By Ryan and Shane.  There are twelve hours in a day and twelve hours in a night. There are 24 hours altogether. The Earth rotates.
& by HERBER.
How Does the Sun affect the earth?
Tutorial Number 4 Time, Days & Dates.
Deadline for Requisitions Payment Processing Date
Labour Day (May 1st) Spring (March-May) Women’s Day (March 8th)
The 853/863 Fields in ALEPH: An Overview
FALL SEMESTER 2014 International Student Orientation August 12, Tuesday Freshman Orientation Begins August 17, Sunday Transfer Student Orientation August.
Ethan & Fletcher’s Day Day of the week Time Date Day / Month / Year 12
The 15th of March Class work
Air Masses and fronts An air mass is a large body of air that has similar temperature and moisture properties throughout. A front is defined as the transition.
Visually Mining and Monitoring Massive Time Series
SPRING 2017 ASL LAB ORIENTATION.
Seasons and Weather all over the World
2017/2018 – SAP/Kinder Calendar
S2 Poisson Distribution.
Data Compression If you’ve ever sent a large file to a friend, you may have compressed it into a zip archive like the one on this slide before doing so.
Malibu Canyon Pole Replacement Project Information
Ecological responses to climate change
Visualizing Structures in Strings
Telling the story in a graph
Year 2 Autumn Term Week 12 Lesson 1
Interpreting Binary Data
Days of the week.
THE IRON GYM STATISTICS FOR APRIL 2018.
2018/2019 School Calendar July August September October November
Performance Measures Friends and Partners in Aviation
Rima Price & Andrew Dixon / Vicky Ward & Tom McDonald
2017/18 Payment Calendar Due Date Cut-off Day 1st of every month
Seasons.
This is probably your first free Calendar for 2011
2300 (11PM) September 21 Blue line is meridian..
GoodMORning Grade 2`s!!!.
Geoffrey Cheung IME Winter 2010
CONSEQUENCES OF REVOLUTION
SEASONS Khalatyan Nane Artschool The 4th grade.
McDonald’s calendar 2007.
Year 2 Autumn Term Week 12 Lesson 1
South Milford Pre-School Playgroup
Calendar.
Attendance Policy 2019 Updates
Welcome 1 This is a document to explains the chosen concept to the animator. This will take you through a 5 section process to provide the necessary details.
WHEN IS YOUR BIRTHDAY? IT’S ON THE 5TH (FIFTH) OF MAY.
Discovery of Significant Usage Patterns from Clickstream Data
World Geography 3202 Unit 2 Climate Patterns.
FALL SEMESTER 2014 International Student Orientation August 12, Tuesday Freshman Orientation Begins August 17, Sunday Transfer Student Orientation August.
Do you know the names of the seasons?
McDonald’s calendar 2007.
The Seasons.
WHEN IS YOUR BIRTHDAY? IT’S ON THE 5TH (FIFTH) OF MAY.
Job Club, Best time of Day, Month, Year to apply to jobs
20-Hour Week vs. 8-Hour Week
August September October December January February March April May
Specials Calendar Time Day 1 Day 2 Day 3 Day 4 Day 5 Day 6
Four Seasons! By izabelle.
& by HERBER.
Presentation transcript:

Jessica Lin Eamonn Keogh Stefano Lonardi Visualizing and discovering non-trivial patterns in large time series databases Jessica Lin Eamonn Keogh Stefano Lonardi Presented by Thomas Lotze

Purpose: Motifs and Anomalies Timeseries visualization Motifs (frequently occurring patterns) Anomalies

Semantic Representation: SAX Equprobable a c d c b d b a

Sequence Trees Set 1: Set 2: 000 100 010 101 001 101 010 101 000 100 010 101 001 101 010 101 010 110 010 101 011 111 010 101

Subsequence Trees

Subsequence Trees on Timeseries Sliding window Discretized using SAX Store the subsequence pattern Display as tree Thickness: frequency of pattern Color: grey if none exist

Numerosity Reduction Avoiding Trivial matches (neighbors) Same as previous MINDIST No overlap None Non-monotonic Can make a big difference!!!

VizTree Demo Remember to note: - pixels required only depends on number of characters, depth…

Comparing Series Support Confidence Surprisingness Difference in frequency of pattern Confidence Average of how significant this pattern is Surprisingness Support x Confidence Mention Bayesian method if the support is 0 (i.e., if the pattern does not occur in the time series) Maybe just talk about this during the demo (or do contrast portion of demo after this slide)

Likes Tree structure visualization Mapping using color and thickness Automatic Pattern Identification Speed of computation Generality of patterns Simultaneous view of subsequences matching a pattern/motif Can zoom to different tree levels/sections Remember to mention the generality is especially nice because it

Wishlist Dynamic parameter response Sliders for window size/character parameters Automatic suggestions for parameters Select individual subsequences in simultaneous time-zoom panel HCIL-style selectors for tree focus Maybe a human-assisting time series clusterer? Allowing for different lengths of patterns Allow user to “kick a subsequence out” from a cluster Allow user to recluster the unclustered More possible patterns Functional Data Analysis? Allow “no pattern overlap” or fuzzy overlap…fuzzy length?...somehow? Okay, now I’m just dreaming… …when lots of neighbors (relative to window size) have the same pattern, perhaps that indicates that the window should be longer?

Calendar Clusters van Wijk, van Selow Office hours are followed strictly. Most people arrive between 8:30 and 9:00 am, and leave between 4:00 and 5:00 pm. Furthermore, in the morning the number of employees present is slightly higher than in the afternoon. On Fridays and in the summer fewer people are present (cluster 722); On Fridays in the summer even fewer people are present (cluster 718); In the weekend and at holidays only very few people are working (cluster 710): security and fire brigade; Holidays in the Netherlands in 1997 were January 1st, March 28th, March 31st, April 30th, May 5th,May 8th, May 19th, December 25th and 26th. School vacations are visible in Spring (May 3rd toMay 11th), in Autumn (October 11th to October 19th), and in Winter (December 21th to December 31st); Many people take a day off after a holiday (cluster 721); On December 5th many people left at 4:00 PM. Dutch people will immediately knowthe explanation: On this day we celebrate Santa Claus and are allowed to leave earlier!

Spirals Marc Weber Marc Alexa Wolfgang Müller

Periodicity/Pattern Length Now: known in advance Find good candidates? spectral analysis? non-constant period? Visual insight scroll/slide through different window lengths see when patterns start to coalesce Note that even VizTree requires knowing the length of the subsequence motif…but it does allow for varying space between the subsequence occurrences