CS6604 Project Ensemble Classification

Slides:



Advertisements
Similar presentations
1 Semi-supervised learning for protein classification Brian R. King Chittibabu Guda, Ph.D. Department of Computer Science University at Albany, SUNY Gen*NY*sis.
Advertisements

Presented by Your Name Title of Training Presentation.
Classification of the aesthetic value of images based on histogram features By Xavier Clements & Tristan Penman Supervisors: Vic Ciesielski, Xiadong Li.
CS771 Machine Learning : Tools, Techniques & Application Gaurav Krishna Y Harshit Maheshwari Pulkit Jain Sayantan Marik
Title of Training Presentation Presented by Your Name.
Assuming normally distributed data! Naïve Bayes Classifier.
Mapping Between Taxonomies Elena Eneva 30 Oct 2001 Advanced IR Seminar.
Three kinds of learning
CONTENT-BASED BOOK RECOMMENDING USING LEARNING FOR TEXT CATEGORIZATION TRIVIKRAM BHAT UNIVERSITY OF TEXAS AT ARLINGTON DATA MINING CSE6362 BASED ON PAPER.
Data mining and machine learning A brief introduction.
Qatar Content Classification Presenter Mohamed Handosa VT, CS6604 May 6, 2014 Client Tarek Kanan 1.
Title of Training Presentation Presented by Your Name.
Today Ensemble Methods. Recap of the course. Classifier Fusion
SPAM DETECTION AND FILTERING By Prasanna Kunchavaram.
Spam Detection Ethan Grefe December 13, 2013.
Insert Graphic Presentation Title Name Title Location.
***Classification Model*** Hosam Al-Samarraie, PhD. CITM-USM.
Copyright  2004 limsoon wong Using WEKA for Classification (without feature selection)
Title of Training Presentation Presented by Your Name.
In part from: Yizhou Sun 2008 An Introduction to WEKA Explorer.
Detecting Web Attacks Using Multi-Stage Log Analysis
Six Ways to make your Canvas Course more Accessible
2017 FFM Certification Refresher Course
Summary of 1st Visioning Conference
Dealing with Challenging Matches
How to Read a Song with Multiple Verses and Refrain
Title of Training Presentation
Marketing and Recruitment Peer Sharing Call
Empirical advances in studying relational networks
Case Clinic: Who Is the Leader?
Gallatin Gallopers 4H Helmet Safety
ActiveParent and CCSD Website Presentation
Kleeneze Browse Products screen walkthrough
ELAC Meeting February 17, 2017.
Dr Sambo Zulu [Leeds Beckett University, UK]
ELAC Meeting September 14, 2017.
Six Ways to make your Canvas Course more Accessible
Juweek Adolphe Zhaoyu Li Ressi Miranda Dr. Shang
Assistive Technology and your IEP
Accounts Payable Purchase Order Balancing
Title of Training Presentation
Assistive Technology and your IEP
TRAINING SESSION
Classifying enterprises by economic activity
A Clinical trial awareness tool
CASAS Reports: Assess, Analyze and Adjust
National Mentoring Month Peer Sharing Call
SQL SERVER TRANSACTION LOG INSIDE
iSRD Spam Review Detection with Imbalanced Data Distributions
ELAC Meeting February 11, 2016.
Ap human geography Industrialization and economic development
AP Human Geography Population and Migration Unit
Aggression in Play Therapy
AP Human Geography Population and Migration Unit
ELAC Meeting October 15, 2015.
Kids Who Come into the Playroom: AD/HD, Selective Mutism, Oppositionality, and More Dianna Green, MSW LCSW.
Classification Breakdown
Lecture 10 – Introduction to Weka
Mad Scramble for Africa
You can do it in an hour!
Mad Scramble for Africa
AP Human Geography Population and Migration Unit
World Geography Human Geography Unit
Sofia Pediaditaki and Mahesh Marina University of Edinburgh
Fruıt ımage recognıtıon wıth weka
Civil Electronic Filing Q&A
Title of Training Presentation
Kanchana Ihalagedara Rajitha Kithuldeniya Supun weerasekara
Extracting Why Text Segment from Web Based on Grammar-gram
Come get a Taste of our PB & J: Positive Behavior Journey
Presentation transcript:

CS6604 Project Ensemble Classification Project Team: Kannan, Vijayasarathy Soundarapandian, Manikandan Alabdulhadi, Mohammed Hamid, Tania Project Client: Yinlin Chen VT, Blacksburg 03/06/2014

Introduction Project Objective: Developing classifiers to aid in Transfer Learning and classify educational resources for the Ensemble portal. Machine Learning (Text Classification) How presentation will benefit audience: Adult learners are more interested in a subject if they know how or why it is important to them. Presenter’s level of expertise in the subject: Briefly state your credentials in this area, or explain why participants should listen to you.

The Big Picture Lesson descriptions should be brief.

Classification Algorithm Results – All Classes Instance Size No. of Classes Filter Classification Algorithm % of Accuracy Test Option 26695 54 String to Word Vector, SMOTE, Randomize Naïve Bayes Multinomial 40 Cross-validation (3 Folds) 52 Use Training Set J48 39 67.55

Results – Reduced Classes Instance Size No. of Classes Filter Classification Algorithm % of Accuracy Test Option 10002 10 String to Word Vector Naïve Bayes Multinomial 75.8 Cross-validation (3 Folds) 12003 12 67.2 SMO 76.8 65.66

Future Work Classifier Accuracy improvement Adding more features Conference name Author Name Bibliographic references Include all classes of ACM CCS Single-Class Classifiers Transfer Learning to Ensemble portal

Challenges Size of the training data set Data Filtering and Preprocessing Pruning the taxonomy Classifier Accuracy Weka Performance and Reliability Put tick mark against challenges resolved Weka performance: concern for large data sets, aiming to deploy it on distributed platform Classifier Accuracy : in progress. Improved it from 45 to 67 using various combination of filters

Questions ?