Presentation is loading. Please wait.

Presentation is loading. Please wait.

Date:2004/03/05 Mining Frequent Episodes for relating Financial Events and Stock Trends Anny Ng and Ada Wai-chee Fu PAKDD 2003 報告者: Ming Jing Tsai.

Similar presentations


Presentation on theme: "Date:2004/03/05 Mining Frequent Episodes for relating Financial Events and Stock Trends Anny Ng and Ada Wai-chee Fu PAKDD 2003 報告者: Ming Jing Tsai."— Presentation transcript:

1 date:2004/03/05 Mining Frequent Episodes for relating Financial Events and Stock Trends Anny Ng and Ada Wai-chee Fu PAKDD 2003 報告者: Ming Jing Tsai

2 Definition  Events : financial news,political …  e 1,e 2,e 3 ….,e k : event types  day record D i :{e i1,e i2,e i3 ….,e ik }  Episode:{e 1,e 2,e 3 ….,e k } , has at least two elements and at least one e j is a stock event type  Window = x days

3 Definition  Window frequency : number of windows that contains an event type  DB frequency : number of occurrences of an event type in DB  Frequency of an episode (ex) number of windows the first day of window contains at least one of the event types in episode.

4 Construct event tree  Header in descending db frequencies order  Event_set pair sorted in the descending db frequencies  node : E :event type,c :counts,b :binary bit

5 Pruning method  window frequencies < min_sup  Remove duplicate event type in both firstday part and remaining day part

6 daysevents 1b 2ac 3b 4d 5b 6ca 7d Window = 3,min_sup =3 An Event database Db frequencies

7 windows windowDay included Event_set pairs 11,2,3 22,3,4 33,4,5 44,5,6 55,6,7 66,7 77 Ordered frequent event type Window frequencies Window = 3,min_sup =3

8 {null} {b:1:0} {a:1:1} {c:1:1} {a:1:0} {c:1:0} {b:1:1} {d:1:1} b a c d

9 {null} {b:2:0} {a:1:1} {c:1:1} {a:1:0} {c:1:0} {b:1:1} b a c d {d:1:0} {b:1:1} {a:1:1} {c:1:1} {d:1:1}

10 {null} {b:3:0} {a:1:1} {c:1:1} {a:1:0} {c:1:0} {b:1:1} b a c d {d:1:0} {b:1:1} {a:1:1} {c:1:1} {d:1:1}

11 {null} {b:3:0} {a:2:1} {c:2:1} {a:1:0} {c:1:0} {b:1:1} b a c d {d:1:0} {b:1:1} {a:1:1} {c:1:1} {d:1:1}

12 {null} {b:3:0} {a:2:1} {c:2:1} {a:2:0} {c:2:0} {b:1:1} b a c d {d:1:0} {b:1:1} {a:1:1} {c:1:1} {d:1:1}

13 {null} {b:3:0} {a:2:1} {c:2:1} {a:2:0} {c:2:0} {b:1:1} b a c d {d:2:0} {b:1:1} {a:1:1} {c:1:1} {d:1:1}

14 Mining frequent episode  Header table{h 0,h 1, …..,h H }  Mining recursively each of the linked list kept at the header table from bottom to top  Conditional path can build conditional event tree  Object 1:found frequent episodes of form {a} ∪ {h i } first-part frequencies  Object 2:found frequent episodes that contain h i and at least two other event types Db frequencies

15 Traverse conditional path  Remove invalid event types  Adjust counts of nodes above hi in the path to be equal to that of hi  If hi is in the firstdays part, then move all event types in the remainingdays part to the firstdays part  Remove hi from the path

16 Generate frequent episode  When a conditional event tree contains only a single path Any subset of firstpart ∪ event base set Any Subsets of firstpart ∪ Any Subsets of remainingpart ∪ event base set

17 Mining Header d  event base set {d} db frequency:{ } First_part frequency:{ } Frequent episode :{bd,ad,cd} min_sup =3 WEvent_set pairs 1 2 3 4 5 6 7

18 Recursively Mining Header c  event base set {cd} db frequency:{ } First_part frequency:{ } Frequent episode :{bcd,acd}

19  Recursively Mining Header a event base set {acd} db frequency:{ } First_part frequency:{ } Frequent episode :{bacd}

20 Mining Header c  event base set {c} db frequency:{ } First_part frequency:{ } Frequent episode :{bc} min_sup =3 WEvent_set pairs 1 2 3 4 5 6 7

21 Recursively Mining Header a  event base set {ac} db frequency:{ } First_part frequency:{ } Frequent episode :{bac} min_sup =3

22 Mining Header a  event base set {a} db frequency:{ } First_part frequency:{ } Frequent episode :{ba} min_sup =3 WEvent_set pairs 1 2 3 4 5 6 7

23 Experiment (synthetic data)

24 Dataset 2 T20,I5,M1000,D3K

25 Experiment (real data)  News event from a internet 121 event types 757 days  Stock data Dow Jones,Nasdaq,Hang Seng, 12 top local companies

26 Experiment (real data)

27 episodesupport Nasdaq downs, PCCW downs151 Nasdaq ups, SHK properties flats, HSBC flats 178 China Mobile downs, Nasdaq downs, HK Electric flats 178


Download ppt "Date:2004/03/05 Mining Frequent Episodes for relating Financial Events and Stock Trends Anny Ng and Ada Wai-chee Fu PAKDD 2003 報告者: Ming Jing Tsai."

Similar presentations


Ads by Google