Annual Income Prediction Modeling Using SVM

Slides:



Advertisements
Similar presentations
1 Image Classification MSc Image Processing Assignment March 2003.
Advertisements

Relevant characteristics extraction from semantically unstructured data PhD title : Data mining in unstructured data Daniel I. MORARIU, MSc PhD Supervisor:
ONLINE ARABIC HANDWRITING RECOGNITION By George Kour Supervised by Dr. Raid Saabne.
Particle swarm optimization for parameter determination and feature selection of support vector machines Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen,
IMAN SAUDY UMUT OGUR NORBERT KISS GEORGE TEPES-NICA BARLEY SEEDS CLASSIFICATION.
Vineel Pratap Girish Govind Abhilash Veeragouni. Human listeners are capable of extracting information from the acoustic signal beyond just the linguistic.
Face Recognition & Biometric Systems Support Vector Machines (part 2)
SUPPORT VECTOR MACHINES PRESENTED BY MUTHAPPA. Introduction Support Vector Machines(SVMs) are supervised learning models with associated learning algorithms.
Artificial Intelligence Statistical learning methods Chapter 20, AIMA (only ANNs & SVMs)
A Kernel-based Support Vector Machine by Peter Axelberg and Johan Löfhede.
SVMs Finalized. Where we are Last time Support vector machines in grungy detail The SVM objective function and QP Today Last details on SVMs Putting it.
SVM Support Vectors Machines
Minimal Neural Networks Support vector machines and Bayesian learning for neural networks Peter Andras
Optimization Theory Primal Optimization Problem subject to: Primal Optimal Value:
Introduction to Artificial Neural Network and Fuzzy Systems
July 11, 2001Daniel Whiteson Support Vector Machines: Get more Higgs out of your data Daniel Whiteson UC Berkeley.
An Example of Course Project Face Identification.
1 SUPPORT VECTOR MACHINES İsmail GÜNEŞ. 2 What is SVM? A new generation learning system. A new generation learning system. Based on recent advances in.
Feb 2007Alon Slapak 1 of 1 Classification A practical approach Classification Methods Training Set Classifier Example Definition Bibliography.
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.
Exploration of Instantaneous Amplitude and Frequency Features for Epileptic Seizure Prediction Ning Wang and Michael R. Lyu Dept. of Computer Science and.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Protein Fold Recognition as a Data Mining Coursework Project Badri Adhikari Department of Computer Science University of Missouri-Columbia.
RSVM: Reduced Support Vector Machines Y.-J. Lee & O. L. Mangasarian First SIAM International Conference on Data Mining Chicago, April 6, 2001 University.
Ohad Hageby IDC Support Vector Machines & Kernel Machines IP Seminar 2008 IDC Herzliya.
컴퓨터 과학부 김명재.  Introduction  Data Preprocessing  Model Selection  Experiments.
Support Vector Machines in Marketing Georgi Nalbantov MICC, Maastricht University.
Weka. Weka A Java-based machine vlearning tool Implements numerous classifiers and other ML algorithms Uses a common.
Support-Vector Networks C Cortes and V Vapnik (Tue) Computational Models of Intelligence Joon Shik Kim.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Miloš Kotlar 2012/115 Single Layer Perceptron Linear Classifier.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Support Vector Machines Optimization objective Machine Learning.
Learning by Loss Minimization. Machine learning: Learn a Function from Examples Function: Examples: – Supervised: – Unsupervised: – Semisuprvised:
A New Generation of Artificial Neural Networks.  Support Vector Machines (SVM) appeared in the early nineties in the COLT92 ACM Conference.  SVM have.
Parallel seasonal approach for electrical load forecasting. Presented by: Oussama Ahmia Authors : Oussama Ahmia & Nadir Farah ITISE2015 ITISE 2015, Granada,
A Brief Introduction to Support Vector Machine (SVM) Most slides were from Prof. A. W. Moore, School of Computer Science, Carnegie Mellon University.
Course Outline (6 Weeks) for Professor K.H Wong
PREDICT 422: Practical Machine Learning
Support Vector Machines
School of Computer Science & Engineering
Other Classification Models: Neural Network
Support Vector Machines and Kernels
Project 4: Facial Image Analysis with Support Vector Machines
Mixture of SVMs for Face Class Modeling
Animal Shelter Adoption
Schizophrenia Classification Using
Digit Recognition using SVMS
Final Year Project Presentation --- Magic Paint Face
An Introduction to Support Vector Machines
Machine Learning Week 1.
Classifying the Thyroid Disease
CSSE463: Image Recognition Day 11
Learning Gender with Support Faces
Minimax Probability Machine (MPM)
Shih-Wei Lin, Kuo-Ching Ying, Shih-Chieh Chen, Zne-Jung Lee
Applied Economic Analysis
Support Vector Machine I
Class #212 – Thursday, November 12
CSSE463: Image Recognition Day 15
Other Classification Models: Support Vector Machine (SVM)
COSC 4368 Machine Learning Organization
CSE 802. Prepared by Martin Law
CAMCOS Report Day December 9th, 2015 San Jose State University
1. What animal 2. Male or Female ? 4. Male or Female? Why? Why?
Modeling IDS using hybrid intelligent systems
Pattern Recognition ->Machine Learning- >Data Analytics Supervised Learning Unsupervised Learning Semi-supervised Learning Reinforcement Learning.
Support Vector Machines 2
Presentation transcript:

Annual Income Prediction Modeling Using SVM Xinjue YU 12/14/2010

Annual Income Prediction Why this problem? Useful in industries such as insurance, banking, marketing, etc Interested in the income distribution The goal: To predict whether a person has an annual income of more than $50,000 The information we have: Age, gender, education level, working hours per week, etc.

The Dataset The Adult dataset: 32561 total with 16281 for testing Extracted from the 1994 Census database. A set of reasonably clean records was extracted ((AAGE>16) && (AGI>100) && (AFNLWGT>1)&& (HRSWK>0))  http://archive.ics.uci.edu/ml/machine-learning- databases/adult/

Preparation There are 14 features in the raw dataset, using 4 out of 14 The 4 features that are used: gender, education level, aged and working hours per week Quantizing the features Education level: 1(<=high school), 2(<grad school) & 3(>=grad school) Gender: 0(Female) & 1(male) Age: 1(<30), 2(30-50) & 3(>50) Working hours per week: 1(<=40) & 2(>40)

The Approach Using Support Vector Machine in artificial neural network The data are supposed to be non-separable Using SVM for non-separable pattern classification Trying different kernels such as Linear RBF Polynomial Sigmoid Using 2-D feature pairs first gender & education level Age and working hours per week Using 4 features in further study (increased complexity)

The Expected Results Predict a person’s annual income is whether more than 50K by the result using SVM (classification/clustering involved) Using testing data to get the error rates of different kernels Comparison of the results of different kernels Linear kernels are supposed to have the highest error rate Try to limit the error rate within 20%-30%