Term Project Proposal By J. H. Wang Apr. 7, 2017.

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Department of Computer Science Worcester Polytechnic Institute INTRODUCTION TO KNOWLEDGE DISCOVERY IN DATABASES AND DATA MINING.
Advertisements

Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
CS583 – Data Mining and Text Mining
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Introduction to IR Research ChengXiang Zhai Department of Computer.
1 Data Mining Techniques Instructor: Ruoming Jin Fall 2006.
An Overview of Our Course:
Ch. Eick: Course Information COSC Introduction --- Part2 1. Another Introduction to Data Mining 2. Course Information.
Ch. Eick: Introduction Data Mining and Course Information 1 Introduction --- Part2 1. Another Introduction to Data Mining 2. Course Information.
COMP5331: Knowledge Discovery and Data Minig
1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 1 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
Proposal for Term Project Operating Systems, Fall 2011 J. H. Wang Nov. 3, 2011.
CpSc 881: Machine Learning Introduction. 2 Copy Right Notice Most slides in this presentation are adopted from slides of text book and various sources.
Proposal for Term Project Operating Systems, Fall 2015 J. H. Wang Sep. 18, 2015.
Proposal for Term Project Operating Systems, Fall 2008 J. H. Wang Nov. 5, 2008.
Mehdi Ghayoumi MSB rm 132 Ofc hr: Thur, a Machine Learning.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
Overview of CS Class Jiawei Han Department of Computer Science
Instructor: Jinze Liu Spring 2014 CS 685 Special Topics in Data mining.
Proposal for Term Project Operating Systems, Fall 2012 J. H. Wang Nov. 13, 2012.
1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 1 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
Proposal for Term Project Information Security, Fall 2014 J. H. Wang Sep. 25, 2014.
The Interplay Between Mathematics/Computation and Analytics Haesun Park Division of Computational Science and Engineering Georgia Institute of Technology.
1 1 MSCIT 5210: Knowledge Discovery and Data Mining Acknowledgement: Slides modified by Dr. Lei Chen based on the slides provided by Jiawei Han, Micheline.
Lecture 01 – Introduction to DM
1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 1 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 1 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
CSCE 5073 Section 001: Data Mining Spring Overview Class hour 12:30 – 1:45pm, Tuesday & Thur, JBHT 239 Office hour 2:00 – 4:00pm, Tuesday & Thur,
Proposal for Term Project Information Security, Fall 2013 J. H. Wang Nov. 5, 2013.
Proposal for Term Project Compilers, Fall 2015 J. H. Wang Nov. 2, 2015.
CSC 4740 / 6740 Fall 2016 Data Mining Instructor: Yubao Wu Fall 2016.
1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 1 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
CS & CS ST: Probabilistic Data Management Fall 2016 Xiang Lian Kent State University Kent, OH
Data Mining IS698 Min Song.
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining: Concepts and Techniques
MSCIT BD 5002/IT 5210: Knowledge Discovery and Data Mining
Why Data Mining? What Is Data Mining?
Overview on Data Mining
Proposal for Term Project Information Security, Fall 2016
Proposal for Term Project
CS583 – Data Mining and Text Mining
Eick: Introduction Machine Learning
Introduction to IR Research
Data Mining: Concepts and Techniques — Chapter 1 — — Introduction —
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
中国计算机学会学科前沿讲习班:信息检索 Course Overview
Course Summary (Lecture for CS410 Intro Text Info Systems)
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining: Concepts and Techniques — Chapter 1 — — Introduction —
Jiawei Han Computer Science University of Illinois at Urbana-Champaign
Jiawei Han, Computer Science, Univ. Illinois at Urbana-Champaign, 2017
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
CS & CS Probabilistic Data Management
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Introduction --- Part2 Another Introduction to Data Mining
CS6220: Data Mining Techniques
©2012 Han, Kamber & Pei. All rights reserved.
Proposal for Term Project Operating Systems, Fall 2018
CS583 – Data Mining and Text Mining
CS & CS ST: Probabilistic Data Management
CS583 – Data Mining and Text Mining
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Christoph F. Eick: A Gentle Introduction to Machine Learning
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
CSCE 4143 Section 001: Data Mining Spring 2019.
CS583 – Data Mining and Text Mining
Promising “Newer” Technologies to Cope with the
First 2-3 Lectures (Intro to DS/DM)
Presentation transcript:

Term Project Proposal By J. H. Wang Apr. 7, 2017

About the Term Project Two options for term project Either team-based system development e.g. extension to exercises Or individual academic paper presentation Only one person per team allowed Tentative schedule for all teams Proposal: *required* one week after midterm (Apr. 25, 2017) Presentations: *required* in the last four weeks (starting from Jun. 2, 2017) Final report: *required* before the end of the semester (Jun. 26, 2017)

Proposal for System Projects A one-page description of ideas about your system Introduction: an overview of the problem and possible issues Methods: major ideas of your design to solve the problem Team members: the names and responsibilities of each member must be clearly identified Preferred time slots: your preferred time slot to present A 20-minute presentation that focuses on the major ideas of your design, including demo and Q&A System functionality and presentation will be counted in the score

Proposal for Paper Presentation A one-page description of your selected paper Introduction: an overview of the paper related to big data analytics Keypoints: methods, major contribution, experimental results in your selected paper Preferred time slots: your preferred time slot to present A 20-minute presentation that focuses on the major ideas in your selected paper Paper quality and presentation will be counted in the score Recommended sources: (in the next two slides)

Conferences and Journals on Data Mining KDD Conferences ACM SIGKDD Int. Conf. on Knowledge Discovery in Databases and Data Mining (KDD) SIAM Data Mining Conf. (SDM) (IEEE) Int. Conf. on Data Mining (ICDM) European Conf. on Machine Learning and Principles and practices of Knowledge Discovery and Data Mining (ECML-PKDD) Pacific-Asia Conf. on Knowledge Discovery and Data Mining (PAKDD) Int. Conf. on Web Search and Data Mining (WSDM) Other related conferences DB conferences: ACM SIGMOD, VLDB, ICDE, EDBT, ICDT, … Web and IR conferences: WWW, SIGIR, WSDM ML conferences: ICML, NIPS PR conferences: CVPR, Journals Data Mining and Knowledge Discovery (DAMI or DMKD) IEEE Trans. On Knowledge and Data Eng. (TKDE) KDD Explorations ACM Trans. on KDD

Where to Find References? DBLP, CiteSeer, Google Data mining and KDD (SIGKDD: CDROM) Conferences: ACM-SIGKDD, IEEE-ICDM, SIAM-DM, PKDD, PAKDD, etc. Journal: Data Mining and Knowledge Discovery, KDD Explorations, ACM TKDD Database systems (SIGMOD: ACM SIGMOD Anthology—CD ROM) Conferences: ACM-SIGMOD, ACM-PODS, VLDB, IEEE-ICDE, EDBT, ICDT, DASFAA Journals: IEEE-TKDE, ACM-TODS/TOIS, JIIS, J. ACM, VLDB J., Info. Sys., etc. AI & Machine Learning Conferences: Machine learning (ML), AAAI, IJCAI, COLT (Learning Theory), CVPR, NIPS, etc. Journals: Machine Learning, Artificial Intelligence, Knowledge and Information Systems, IEEE-PAMI, etc. Web and IR Conferences: SIGIR, WWW, CIKM, etc. Journals: WWW: Internet and Web Information Systems, Statistics Conferences: Joint Stat. Meeting, etc. Journals: Annals of statistics, etc. Visualization Conference proceedings: CHI, ACM-SIGGraph, etc. Journals: IEEE Trans. visualization and computer graphics, etc.

Resources on distributed computing and parallel algorithms (and OS!) Example topics: Efficient scheduling algorithms in MapReduce framework efficient matrix computation in Spark Tensorflow for large-scale machine learning Performance analysis of a Tensorflow Processing Unit (TPU) Using deep learning techniques for classification, pattern recognition, image recognition, speech recognition, … …

Proposal Submission Due: Apr. 25, 2017 How: Homework submission site The presentation schedule will be arranged in the last four (3.5) weeks Starting from Jun. 2, 2017 No other time slots available for presentation No presentation, no score for term project

About the Score The score you’ll get depends on the functionality, difficulty, and quality of your project For system development: System functions and correctness You can either write your own program, or call existing open source code or library (but NOT executing existing binary only) For academic paper presentation Quality and your presentation of the paper is critical Major methods/experimental results *must* be presented Papers from top conferences are strongly suggested Proposals, presentations, and reports are *required* for each team, and will be counted in the score Big Data Analytics, Spring 2017 NTUT CSIE

Thanks for Your Attention!