Presentation is loading. Please wait.

Presentation is loading. Please wait.

Random Subspace Feature Selection for Analysis of Data with Missing Features Presented by: Joseph DePasquale Student Activities Conference 2007 This material.

Similar presentations


Presentation on theme: "Random Subspace Feature Selection for Analysis of Data with Missing Features Presented by: Joseph DePasquale Student Activities Conference 2007 This material."— Presentation transcript:

1 Random Subspace Feature Selection for Analysis of Data with Missing Features Presented by: Joseph DePasquale Student Activities Conference 2007 This material is based upon work supported by the National Science Foundation under Grant No ECS- 0239090. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

2 Outline Motivation Motivation Missing feature algorithm Missing feature algorithm Selecting features for trainingSelecting features for training Finding usable classifiers for testingFinding usable classifiers for testing Impact of free parameters Impact of free parameters Number of features used for trainingNumber of features used for training Distribution update parameter βDistribution update parameter β

3 Motivation Missing data is a real world issue Missing data is a real world issue Failed equipmentFailed equipment Human errorHuman error Natural phenomenaNatural phenomena Matrix multiplication can not be used if a single data value is left out Matrix multiplication can not be used if a single data value is left out Missing Feature

4 Training

5

6 Training Xfifi Feature not used in trainingFeature used in training CiCi Usable classifier Usable Classifiers

7 Experimental Setup Research has been done for static selection of features used for training Research has been done for static selection of features used for training Dataset (f)Nof 1 nof 2 nof 3 nof 4 T VOC (12)3456200 PEN (16)6789250 ION (33)81012141000 WBC (30)101214161000

8 Volatile Organic Compound Database

9 Pen Digits Recognition Database

10 Ionosphere Database

11 Wisconsin Breast Cancer Database

12 Conclusions β does not significantly impact the algorithm, the number of features used for training does have an impact β does not significantly impact the algorithm, the number of features used for training does have an impact

13 References [1]Hussein, S., “Random feature subspace ensemble based approaches for the analysis of data with missing features,” Submitted Spring 2006. [2] Haykin, S., “Neural Networks A Comprehensive Foundation,” New Jersey: Prentice Hall, 1999. [3] “UCI repository,” [Online Document], Accessed: 25 Nov 2006. http://www.ics.uci.edu/~mlearn/MLRepository.html

14 Learn ++.MF Training Training Selecting features from distributionSelecting features from distribution Training the networkTraining the network Update likelihood of selecting featuresUpdate likelihood of selecting features Testing Testing Data corruptionData corruption Identify usable classifiersIdentify usable classifiers SimulationSimulation


Download ppt "Random Subspace Feature Selection for Analysis of Data with Missing Features Presented by: Joseph DePasquale Student Activities Conference 2007 This material."

Similar presentations


Ads by Google