Neal Kurande, WinaGodwin Anyanwu Jr., Adam Chau

Slides:



Advertisements
Similar presentations
JKlustor clustering chemical libraries presented by … maintained by Miklós Vargyas Last update: 25 March 2010.
Advertisements

Hierarchical Clustering. Produces a set of nested clusters organized as a hierarchical tree Can be visualized as a dendrogram – A tree-like diagram that.
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
Weka & Rapid Miner Tutorial By Chibuike Muoh. WEKA:: Introduction A collection of open source ML algorithms – pre-processing – classifiers – clustering.
RCAC Research Computing Presents: DiaGird Overview Tuesday, September 24, 2013.
Clustering… in General In vector space, clusters are vectors found within  of a cluster vector, with different techniques for determining the cluster.
Clustering Ram Akella Lecture 6 February 23, & 280I University of California Berkeley Silicon Valley Center/SC.
CSC 4510 – Machine Learning Dr. Mary-Angela Papalaskari Department of Computing Sciences Villanova University Course website:
Tal Mor  Create an automatic system that given an image of a room and a color, will color the room walls  Maintaining the original texture.
Industrial Project (234313) Final Presentation “App Analyzer” Deliver the right apps users want! (VMware) Students: Edward Khachatryan & Elina Zharikov.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Windows Azure Team 9 Ben Holland Bao Nguyen Eric Petrowiak Barret Schloerke.
MaterialsHub - A hub for computational materials science and tools.  MaterialsHub aims to provide an online platform for computational materials science.
Ihr Logo Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang.
Development of a Distributed MATLAB Environment with Real-Time Data Visualization Authors: Joseph Diamond, Richard McEver Affiliation: Dr. Jian Huang,
Metadata Mòrag Burgon-Lyon University of Glasgow.
Overview and update Pete Raymond. » Purpose of this presentation » Background » JSR Requirements » Key concepts » Relationship to other standards/approaches.
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
Compiled By: Raj Gaurang Tiwari Assistant Professor SRMGPC, Lucknow Unsupervised Learning.
Kaifeng Chen Institute for Theoretical Physics Synthetic Biology with Engineering Tools 1 Francis Chen.
Clustering Algorithms Sunida Ratanothayanon. What is Clustering?
GROUP PresentsPresents. WEB CRAWLER A visualization of links in the World Wide Web Software Engineering C Semester Two Massey University - Palmerston.
CMPT 201 Computer Science II for Engineers
VisIt Project Overview
CSC 222: Object-Oriented Programming
Getting started with Power BI
COP4710 Database Systems Project Overview.
Network Service Identification through Hypergraph Clustering
Working With Azure Batch AI
Make Predictions Using Azure Machine Learning Studio
Casey O’Leary – Washington State University
Evaluating state of the art in AI
Data Mining, Distributed Computing and Event Detection at BPA
CSC 321: Data Structures Fall 2015
CSC 222: Object-Oriented Programming
MaterialsHub - A hub for computational materials science and tools.
A Network Science Approach to Fake News Detection on Social Media
Major themes Abstraction Design tradeoffs Recursion Leverage existing components without understanding details Create components that can be.
Blaze - An IoT Analytics Engine
Parametric calibration of speed–density relationships in mesoscopic traffic simulator with data mining Adviser: Yu-Chiang Li Speaker: Gung-Shian Lin Date:2009/10/20.
Git Version Control for Everyone
A weight-incorporated similarity-based clustering ensemble method based on swarm intelligence Yue Ming NJIT#:
Intro to Machine Learning
S-GEMS-UQ: An Uncertainty Quantification Toolkit for SGEMS
Discrete Convolution Demo
GIFT / Fiscal Data Package Iteration 3
CSE 142 vs CSE 143 CSE 142 CSE 143 You learned how to write programs and decompose large problems with: Print statements Methods Control Structures.
Visual Studio Database Tools (aka SQL Server Data Tools)
Machine Learning with Weka
DATA MINING Introductory and Advanced Topics Part II - Clustering
David Cyphert CS 2310 – Software Engineering
CS122B: Projects in Databases and Web Applications Spring 2018
CSCI N317 Computation for Scientific Applications Unit Weka
Simple Kmeans Examples
Intro to Machine Learning
CS4433 Database Systems Project.
Data Mining, Distributed Computing and Event Detection at BPA
Chorale Generator Capstone I – Preliminary Proposal – Fall 2011
Academic Map Report And Exhibition Group24.
Junheng, Shengming, Yunsheng 11/09/2018
CS1301 – Where it Fits Institute for Personal Robots in Education
CMPE 135: Object-Oriented Analysis and Design March 14 Class Meeting
IBM C IBM Big Data Engineer. You want to train yourself to do better in exam or you want to test your preparation in either situation Dumpspedia’s.
The Student’s Guide to Apache Spark
Photo Classification Evaluation Tool
Intelligent Tutoring Systems
Visual Data Flows – Azure Data Factory v2
Visual Data Flows – Azure Data Factory v2
Python4ML An open-source course for everyone
Presentation transcript:

Neal Kurande, WinaGodwin Anyanwu Jr., Adam Chau IRS-ML Neal Kurande, WinaGodwin Anyanwu Jr., Adam Chau

Team Members WinaGodwin Anyanwu Jr. 3rd Year Computer Science Major Experience: Java, C, Android, Python Adam Chau 2nd Year Computer Science Major Experience: Java, Python, JavaScript, SQL Neal Kurande 3rd year Computer Engineer Experience: C/C++, Python, Java, JavaScript, MATLAB

What is IRS? The Intelligent Response System aims to create a user interface that leverages the the ITS database to provide feedback that TA’s can use to improve student performance in target subject areas. This is accomplished by accessing the SQL database in python, pulling specific data, and then converting it to a json file that’s read by a GUI.

ITS Student Feedback Loop

Last Semester The IRS project was split into two teams The IRS-ML team worked on pulling data from the database Accessed question score, rating, and duration The IRS-GUI team worked on creating a GUI that could visualize this data This was completed by generating json files on the backend that the front end team would then convert using REACT.

Last Semester

Last Semester

Semester Goals To analyze data using machine learning techniques To modularize the code to improve future developer experience To make the system dynamic and update in realtime based on the SQL database

General Improvements Code Modularized Implementation Improvements File structure changed Code separated into methods Implementation Improvements Can choose data by semester Can select data by pre and post-test (Chapters 1- 7 & Chapter 8 respectively) Json names are generated based on the data parameters Used the Github Wiki Wiki now exists

K-Means Clustering A common form of clustering that creates n-number of clusters from a dataset K-Means is an iterative algorithm-creates n number of clusters, finds the centroid then remakes the clusters K-Means needs data preprocessing Need to first eliminate outliers from the dataset Normalize all dimensions of the dataset or create appropriate weights for each dimension Eliminate NaN data points

K-Means Pictures

Agglomerative Clustering A form of hierarchical clustering that uses a bottom-up approach Clusters are grouped together using the euclidean distance Data for spring 2018 and summer 2018 used to make seven clusters for the graphs

Agglomerative Clustering

Challenges Downloading and installing Ubuntu Version Control Insufficient Data Accessing more data streams Choosing relevant ML algorithms

Next Steps Incorporating other data streams into the clustering algorithms Using a different type of clustering or unsupervised learning Integrating with the cloud to run clustering in real time Displaying data via the IRS-GUI

DEMO!