Similarity Measure Based on Partial Information of Time Series Advisor : Dr. Hsu Graduate : You-Cheng Chen Author : Xiaoming Jin Yuchang Lu Chunyi Shi.

Slides:



Advertisements
Similar presentations
Informatik 4 Lab 1. Laboratory Exercise Overview 1. Define size of 20 radius vectors 2. DCT transformation 3. Create Microsoft Excel spreadsheet 4. Create.
Advertisements

Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns Lu Shi Oct. 4, 2004.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Web-Page Summarization Using Clickthrough Data Advisor.
Video Table-of-Contents: Construction and Matching Master of Philosophy 3 rd Term Presentation - Presented by Ng Chung Wing.
New Image Encryption and Compression Method Based on Independent Component Analysis.
ADVISE: Advanced Digital Video Information Segmentation Engine
Clustering over Multiple Evolving Streams by Events and Correlations Mi-Yen Yeh, Bi-Ru Dai, Ming-Syan Chen Electrical Engineering, National Taiwan University.
Mining Hierarchical Decision Rules from Hybrid Data with Categorical and Continuous Valued Attributes Miao Duoqian, Qian Jin, Li Wen, Zhang Zehua.
Based on Slides by D. Gunopulos (UCR)
Video summarization by graph optimization Lu Shi Oct. 7, 2003.
The Effectiveness Study of Music Information Retrieval Arbee L.P. Chen National Tsing Hua University 2002 ACM International CIKM Conference.
Orthogonality and Least Squares
A Multiresolution Symbolic Representation of Time Series
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Spatial and Temporal Databases Efficiently Time Series Matching by Wavelets (ICDE 98) Kin-pong Chan and Ada Wai-chee Fu.
Presented by Arun Qamra
Notes Over 11.3 Geometric Sequences
2008/06/06 Y.H.Chang Towards Effective Browsing of Large Scale Social Annotations1 Towards Effective Browsing of Large Scale Social Annotations WWW 2007.
Presented by Tienwei Tsai July, 2005
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A novel genetic algorithm for automatic clustering Advisor.
Clustering Seasonality Patterns in the Presence of Errors Advisor : Dr. Hsu Graduate : You-Cheng Chen Author : Mahesh Kumar Nitin R. Patel Jonathan Woo.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Anthony K.H. Tung Hongjun Lu Jiawei Han Ling Feng 國立雲林科技大學 National.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Comparison of SOM Based Document Categorization Systems.
PMLAB Finding Similar Image Quickly Using Object Shapes Heng Tao Shen Dept. of Computer Science National University of Singapore Presented by Chin-Yi Tsai.
Intelligent Database Systems Lab 1 Advisor : Dr. Hsu Graduate : Jian-Lin Kuo Author : Silvia Nittel Kelvin T.Leung Amy Braverman 國立雲林科技大學 National Yunlin.
Notes Over 11.2 Arithmetic Sequences An arithmetic sequence has a common difference between consecutive terms. The sum of the first n terms of an arithmetic.
The Landmark Model: An Instance Selection Method for Time Series Data C.-S. Perng, S. R. Zhang, and D. S. Parker Instance Selection and Construction for.
A genetic approach to the automatic clustering problem Author : Lin Yu Tseng Shiueng Bien Yang Graduate : Chien-Ming Hsiao.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Learning Phonetic Similarity for Matching Named Entity.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
A Fuzzy k-Modes Algorithm for Clustering Categorical Data
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Manoranjan.
A New Temporal Pattern Identification Method for Characterization and Prediction of Complex Time Series Events Advisor : Dr. Hsu Graduate : You-Cheng Chen.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Presenter : Yu Cheng Chen Author: YU-SHENG.
8.3 Geometric Sequences and Series Objectives: -Students will recognize, write, and find the nth terms of geometric sequences. -Students will find the.
Introduction to Real Analysis Dr. Weihu Hong Clayton State University 10/7/2008.
CHAPTER Continuity Series Definition: Given a series   n=1 a n = a 1 + a 2 + a 3 + …, let s n denote its nth partial sum: s n =  n i=1 a i = a.
Content-Based Image Retrieval Using Block Discrete Cosine Transform Presented by Te-Wei Chiang Department of Information Networking Technology Chihlee.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Juan D.Velasquez Richard Weber Hiroshi Yasuda 國立雲林科技大學 National.
V. Clustering 인공지능 연구실 이승희 Text: Text mining Page:82-93.
Hierarchical Model-Based Clustering of Large Datasets Through Fractionation and Refractionation Advisor : Dr. Hsu Graduate : You-Cheng Chen Author : Jeremy.
A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Unsupervised Learning with Mixed Numeric and Nominal Data.
1 A Fuzzy Logic Framework for Web Page Filtering Authors : Vrettos, S. and Stafylopatis, A. Source : Neural Network Applications in Electrical Engineering,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A hierarchical clustering algorithm for categorical sequence.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Jessica K. Ting Michael K. Ng Hongqiang Rong Joshua Z. Huang 國立雲林科技大學.
Effective Anomaly Detection with Scarce Training Data Presenter: 葉倚任 Author: W. Robertson, F. Maggi, C. Kruegel and G. Vigna NDSS
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Wei Xu,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology ACM SIGMOD1 Subsequence Matching on Structured Time Series.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.
ViSOM - A Novel Method for Multivariate Data Projection and Structure Visualization Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Hujun Yin.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Sanghamitra.
CS654: Digital Image Analysis Lecture 11: Image Transforms.
Using category-Based Adherence to Cluster Market-Basket Data Author : Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen Graduate : Chien-Ming Hsiao.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Chun Kai Chen Author : Andrew.
Mining Top-n Local Outliers in Large Databases Author: Wen Jin, Anthony K. H. Tung, Jiawei Han Advisor: Dr. Hsu Graduate: Chia- Hsien Wu.
Holt McDougal Algebra 2 Introduction to Sequences Holt Algebra 2Holt McDougal Algebra 2 How do we find the nth term of a sequence? How do we write rules.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Intelligent Exploration for Genetic Algorithms Using Self-Organizing.
Web-based acquisition of Japanese katakana variants
Self-Organizing Maps for Content-Based Image Database Retrieval
Hybrid Architecture of DCT/DFT/Wavelet Transform (HWT)
Sequences and Series.
Dimension reduction : PCA and Clustering
Color Image Retrieval based on Primitives of Color Moments
Warmup Solve 3 sin θ - 2 cos 2 θ =0 for all values of θ if θ is measured in degrees. A 30° + 360°k, 120°+ 360°k B 30°+360°k, 150° +360°k C 150° + 360°k,
Geometric Sequences and series
Color Image Retrieval based on Primitives of Color Moments
Presentation transcript:

Similarity Measure Based on Partial Information of Time Series Advisor : Dr. Hsu Graduate : You-Cheng Chen Author : Xiaoming Jin Yuchang Lu Chunyi Shi

Motivation Objective Introduction Retrieval and Representation of partial Information System Setup Results and Discussion Conclusions Personal Opinion Outline

Motivation A “ good ” similarity measurement is determined by human.

Objective To propose a model for the retrieval and representation of the partial information in time series.

Introduction The model has three objects: Get the partial information Represent partial information in a compressed form Most similarity model could be applied

Retrieval and Representation of Partial Information 3.1 General Description Definition 1: Use a rule F to decompose X into a set of time series

3.1 General Description Definition 2: (1) Segment X into a set of sub-series (2) X ’ jk is the k-th F-based component of sub-series X j Use mapping rule T to map each X’ jk to a value R k (j)

3.1 General Description is the orders of all the representing sequences of interest. Definition 3: where A n is the degree of user ’ s interest to n-th component is portion of partial information of interest

3.1 General Description Definition 4: is the full representing sequence(FRS) of the partial information

Definition 5: Given two time series X,Y 3.1 General Description

Sum up, a representing model for partial information can be summarized by 3.1 General Description Decomposition method F Representation method T Distance measurement D

3.1 General Description Example 1

3.1 General Description Use F to decompose time series to two components (1) Local fluctuating movement S ’ 1 (2) Global movement S ’ 2 FRS(X)=R 1 and the length of the FRS(X)=200/8

3.2 Practial Method Let H is transform matrix of a given orthonormal discrete transform So T j =H*X j We denote the results of discrete transform of time Series Xj and Yj by DT(Xj)=XT j, DT(Yj)=YT j

3.2 Practial Method The k-th component of X is The k-th representing sequence is Then FRS(X) can be calculated as:

Here we use DCT(discrete Cosine transform) in our experiments 3.2 Practial Method

4. System Setup 4.1 Evaluation of Similarity Measurement Based on Partial Information We use hierarchical agglomerative clustering(HAC) to cluster FRSs.

5. Results and Discussion We used historical stock data and only considered the time series of closing price. Step 1: use DCT to decompose time series and to represent partial information. Step 2: E=(E 1, …,E r ) to represent the chosen portion. Step 3: E was used to calculate K and together with A Then FRSs of each time series were generated Step 4: calculating MD and clustering

5. Results and Discussion 11,15,14,10,19,10,14,17,14 3, 3, 3, 3, 2, 4, 4, 5, 5

5. Results and Discussion

Conclusions The experimental results could help designing a more effective and more efficient similarity measurement

Personal Opinion The similarity measurement can be improved better by increasing the weight of the meaningful component.