CPS : Information Management and Mining

Slides:



Advertisements
Similar presentations
DATA MINING LECTURE 12 Link Analysis Ranking Random walks.
Advertisements

Logistics: –My office hours: T, Th 4-5pm or by appointment –Class Web page:
Spatial Outlier Detection and implementation in Weka Implemented by: Shan Huang Jisu Oh CSCI8715 Class Project, April Presented by Jisu.
Problem Addressed Attempts to prove that Web Crawl is random & biased image of Web Graph and does not assert properties of Web Graph Understanding the.
Introduction to SEG 5010 Hong Cheng 2009/10 Second Term.
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Welcome to CompSci 100! As You Arrive… Make sure you grab a syllabus packet. Read through it. I will be covering the most essential points in my talk,
1 Web Search and Advanced Internet Services 290N Class Introduction Tao Yang, 2014.
CS492: Special Topics on Distributed Algorithms and Systems Fall 2008 Lab 3: Final Term Project.
1 Information Retrieval and Advanced Internet Services 290N Class Introduction Tao Yang, 2015
X-Informatics Web Search; Text Mining B 2013 Geoffrey Fox Associate Dean for.
Distributed Computing Rik Sarkar. Distributed Computing Old style: Use a computer for computation.
An Example of Course Project Face Identification.
Introduction to CS Senior Design Project I / II Prof. Dr. H. Altay Güvenir.
Computing & Information Sciences Kansas State University Paper Review Guidelines KDD Lab Course Supplement William H. Hsu Kansas State University Department.
Resilient Distributed Datasets: A Fault- Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
CS 160: Software Engineering December 10 Class Meeting Department of Computer Science San Jose State University Fall 2014 Instructor: Ron Mak
1 1 COMP5331: Knowledge Discovery and Data Mining Acknowledgement: Slides modified based on the slides provided by Lawrence Page, Sergey Brin, Rajeev Motwani.
ITCS 6265 Details on Project & Paper Presentation.
1 CS 430: Information Discovery Lecture 5 Ranking.
MapReduceCS-4513 D-term MapReduce CS-4513 Distributed Computing Systems (Slides include materials from Operating System Concepts, 7 th ed., by Silbershatz,
Cluster Analysis Data Mining Experiment Department of Computer Science Shenzhen Graduate School Harbin Institute of Technology.
B. Prabhakaran1 Multimedia Systems Reference Text “Multimedia Database Management Systems” by B. Prabhakaran, Kluwer Academic Publishers. – Kluwer bought.
CS 157B: Database Management Systems II April 22 Class Meeting Department of Computer Science San Jose State University Spring 2013 Instructor: Ron Mak.
Autumn Web Information retrieval (Web IR) Handout #11:FICA: A Fast Intelligent Crawling Algorithm Ali Mohammad Zareh Bidoki ECE Department, Yazd.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
CSE6339 DATA MANAGEMENT AND ANALYSIS FOR COMPUTATIONAL JOURNALISM CSE6339, Spring 2012 Department of Computer Science and Engineering, University of Texas.
Getting & Running EdgeX Docker Containers
Department of Computer Science & Engineering
Big Data is a Big Deal!.
Introduction to CS Senior Design Project I / II
2017 Junior Exams Y9 Study timetable.
CS 201 – Data Structures and Discrete Mathematics I
Data Mining: Concepts and Techniques Course Outline
Introduction to CS Senior Design Project I / II
Introduction to CS Senior Design Project I / II
CS7280: Special Topics in Data Mining Information/Social Networks
Lecture 34 CSE 331 Nov 26, 2012.
2017 Junior Exams Y10 Study timetable.
Google Classroom To join use one the following codes:
Homework Assignment 1: Use the following data set to test the performance difference of three clustering algorithms: K-means, AP clustering and Spectral.
Dr. Clincy Professor of CS
Lecture 19 CSE 331 Oct 12, 2016.
Zhenjiang Lin, Michael R. Lyu and Irwin King
Physics 3 – Sept 13, 2018 Today’s Objective –
Physics 3 – Sept 27, 2018 Today’s Objective –
Dr. Clincy Professor of CS
Physics 3 – Sept 11, 2018 Today’s Objective –
Multimedia Systems Reference Text
Physics 3 – Sept 18, 2018 Today’s Objective –
9A HW: April 22-25, 2014 RESEARCH PAPER TOPIC DUE FRIDAY: your top 4 choices. Tuesday: Complete vocabulary unit 11 for next week. Wednesday: Study for.
Dept. of Computer Science University of Liverpool
Paper ID: XX Track: Track Name
Project Presentation 8 minutes min, 10 minutes max
KS4 PPE Timetable – Week Beginning 31st October 2016
Dr. Clincy Professor of CS
CSCE 1040.
Introduction to CS Senior Design Project I / II
UC ANR UCPath Network January 23, 2019 NR UCPath Implementation Plan
In April, we were working on... Mathematics Go-Math Chapter 11
ECE/CSE 576 Assignment 4 Course Project Spring 2019.
Technologies of Google Seminar Week 1
Directions… Sit down next to your group members. Log in.
Warm-ups Week of October 7-11, 2013.
Physics 3 – Sept 25, 2018 Today’s Objective –
Web Search and Advanced Internet Services
CSE 552 Intro Jay Lorch & Jon Howell.
CS144K An Introduction to Computer Networks
Sequences II Prof. Noah Snavely CS1114
CSCE 1040.
Presentation transcript:

CPS 196.03: Information Management and Mining Third programming project

Third Programming Project Three options: Clustering project PageRank project Your own topic 1-page project proposal due by Friday (April 10) 5.00 PM Project and report due on Tuesday April 21 Single demo for all three projects: April 21 and 22 30 minutes per team (team from Project 3) Should be prepared to run code on your laptop or by logging in to CS department machine Time slots will be determined through email

Clustering Project Implement BFR algorithm Notes www.cs.cornell.edu/Courses/cs678/2002sp/papers/bradley98scaling.ps Evaluate on one or more datasets from UCI repository http://kdd.ics.uci.edu/

PageRank Project Implement PageRank computation algorithm for large Web graphs How is the Web graph represented? Study running time, convergence properties, and robustness of the algorithm to spam/fraud Generate different types of Web graphs Paper from Google: The PageRank Citation Ranking: Bringing Order to the Web For discussion on Thursday (see readings page)