Jiawei Han Computer Science University of Illinois at Urbana-Champaign

Slides:



Advertisements
Similar presentations
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Advertisements

CS583 – Data Mining and Text Mining
Web Search and Mining Course Overview 1 Wu-Jun Li Department of Computer Science and Engineering Shanghai Jiao Tong University Lecture 0: Course Overview.
2008 © ChengXiang Zhai Dragon Star Lecture at Beijing University, June 21-30, Introduction to IR Research ChengXiang Zhai Department of Computer.
2015/6/1Course Introduction1 Welcome! MSCIT 521: Knowledge Discovery and Data Mining Qiang Yang Hong Kong University of Science and Technology
CS583 – Data Mining and Text Mining
CS/CMPE 535 – Machine Learning Outline. CS Machine Learning (Wi ) - Asim LUMS2 Description A course on the fundamentals of machine.
1 Data Mining Techniques Instructor: Ruoming Jin Fall 2006.
An Overview of Our Course:
CS583 – Data Mining and Text Mining
August 25, 2015 Data Mining: Concepts and Techniques 1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 1 — Jiawei Han, Micheline Kamber, and.
1 1 Data Mining: Concepts and Techniques (3 rd ed.) — Chapter 1 — Jiawei Han, Micheline Kamber, and Jian Pei University of Illinois at Urbana-Champaign.
CS523 INFORMATION RETRIEVAL COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
CS598CXZ (CS510) Advanced Topics in Information Retrieval (Fall 2014) Instructor: ChengXiang (“Cheng”) Zhai 1 Teaching Assistants: Xueqing Liu, Yinan Zhang.
Course Title Database Technologies Instructor: Dr ALI DAUD Course Credits: 3 with Lab Total Hours: 45 approximately.
CS525 DATA MINING COURSE INTRODUCTION YÜCEL SAYGIN SABANCI UNIVERSITY.
Overview of CS Class Jiawei Han Department of Computer Science
Data Warehousing/Mining 1 Data Warehousing/Mining Comp 150DW Course Overview Instructor: Dan Hebert.
Database Design CS562 Fall CS562 Database Design Instructor : Professor Chin-Wan Chung Office : Rm 3406 Tel : 3537
Information Retrieval and Data Mining (AT71.07) Comp. Sc. and Inf. Mgmt. Asian Institute of Technology.
General Information 439 – Data Mining Assist.Prof.Dr. Derya BİRANT.
CSCE 5073 Section 001: Data Mining Spring Overview Class hour 12:30 – 1:45pm, Tuesday & Thur, JBHT 239 Office hour 2:00 – 4:00pm, Tuesday & Thur,
DATA MINING: LECTURE 1 By Dr. Hammad A. Qureshi Introduction to the Course and the Field There is an inherent meaning in everything. “Signs for people.
1 SBM411 資料探勘 陳春賢. 2 Lecture I Class Introduction.
CSE6339 DATA MANAGEMENT AND ANALYSIS FOR COMPUTATIONAL JOURNALISM CSE6339, Spring 2012 Department of Computer Science and Engineering, University of Texas.
CSC 4740 / 6740 Fall 2016 Data Mining Instructor: Yubao Wu Fall 2016.
CS & CS ST: Probabilistic Data Management Fall 2016 Xiang Lian Kent State University Kent, OH
CS583 – Data Mining and Text Mining
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
MSCIT BD 5002/IT 5210: Knowledge Discovery and Data Mining
Term Project Proposal By J. H. Wang Apr. 7, 2017.
Why Data Mining? What Is Data Mining?
CS510 Advanced Topics in Information Retrieval (Fall 2017)
Introduction to Operating Systems
Course Introduction 공학대학원 데이타베이스
CS583 – Data Mining and Text Mining
Introduction to IR Research
CS583 – Data Mining and Text Mining
中国计算机学会学科前沿讲习班:信息检索 Course Overview
CS598CXZ (CS510) Advanced Topics in Information Retrieval (Fall 2016)
Course Summary (Lecture for CS410 Intro Text Info Systems)
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
CS410: Text Information Systems (Spring 2018)
Jiawei Han, Computer Science, Univ. Illinois at Urbana-Champaign, 2017
Data Mining: Concepts and Techniques Course Outline
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
CS & CS Probabilistic Data Management
CS583 – Data Mining and Text Mining
Computer Networks CNT5106C
CS7280: Special Topics in Data Mining Information/Social Networks
Exploring the Power of Links in Data Mining
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
CS510 (Fall 2018) Advanced Topics in Information Retrieval
Introduction --- Part2 Another Introduction to Data Mining
CS6220: Data Mining Techniques
©2012 Han, Kamber & Pei. All rights reserved.
CS583 – Data Mining and Text Mining
CS & CS ST: Probabilistic Data Management
Course Summary ChengXiang “Cheng” Zhai Department of Computer Science
CS583 – Data Mining and Text Mining
ECE 8823: GPU Architectures
Fundamental of Artificial Intelligence (CSC3180)
Dept. of Computer Science University of Liverpool
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 1 —
CSCE 4143 Section 001: Data Mining Spring 2019.
Information Retrieval CIS-462
CS583 – Data Mining and Text Mining
Information Retrieval and Data Mining (AT71. 07) Comp. Sc. and Inf
First 2-3 Lectures (Intro to DS/DM)
Presentation transcript:

An Overview of CS512 @Spring 2018 Jiawei Han Computer Science University of Illinois at Urbana-Champaign September 17, 2018

Data and Information Systems (DAIS) Course Structures at CS/UIUC Three main streams: Database, data mining and text information systems Seminar: Yahoo!-DAIS Seminar: (Not CS591 seminar, no credit given) Database Systems: Database management systems (CS411: Fall + Spring) Advanced database systems (CS511: Fall) Human-in-the-loop Data Management (CS 598: Aditya Parameswaran) Data mining Intro. to data mining (CS412: Fall + Spring) Data mining: Principles and algorithms (CS512: Spring (Han)) Text information systems Introduction to Text Information Systems (CS410: Spring (Zhai)) Advance Topics on Information Retrieval (CS 598 or CS510: Fall (Zhai)) Social & Economic Networks (CS 598: Hari Sundaram)

Coursera Data Mining Specialization Data Visualization: John Hart Pattern Discovery in Data Mining: Jiawei Han Text Retrieval and Search Engines: ChengXiang Zhai Cluster Analysis in Data Mining: Jiawei Han Text Mining and Analytics: ChengXiang Zhai Data Integration and Data Warehousing: Kevin Chang Capstone Data Mining Capstone (6 weeks) Online MCS Data Science Master program (https://online.illinois.edu/mcs-ds)

Topic Coverage: CS512 @ 2017 Class introduction (0.5 wk) An overview on recent data mining research (0.5 wk) Text mining: phrase mining and entity typing (2 weeks) Textcube construction and exploration (2 weeks) 1st midterm exam (0.5week) — 1st Lect. of 6th week Mining heterogeneous information networks (3 weeks) Truth finding (1 week) Mining social media and spatiotemporal data (1 week) Stream data mining (1 week) if time permits Selected class survey presentation (1 week) 2nd midterm exams (0.5 week) —2nd Lect. of 15th week Class research project presentation (final week + exam week)

Class Information Instructor: Jiawei Han (www.cs.uiuc.edu/~hanj) Lectures: Tues/Thurs 9:30-10:45am (0216 SC) Office hours: Tues/Thurs 10:45-11:30am (2132 SC) Teach Assistants: Ahmed El-Kishky (25%), Carl Yang (25%), Qi Zhu (50%, online TA), Honglei Zhuang (50%, lead TA) Prerequisites (course preparation) CS412 (offered every semester) or consent of instructor General background: Knowledge on statistics, machine learning, and data and information systems will help understand the course materials Course website (bookmark it since it will be used frequently!) https://wiki.cites.illinois.edu/wiki/display/cs512/Lectures Textbook: Jialu Liu, Jingbo Shang and Jiawei Han,  Phrase Mining from Massive Text and Its Applications, Morgan & Claypool, 2017 Yizhou Sun and Jiawei Han, Mining Heterogeneous Information Networks: Principles and Methodologies, Morgan & Claypool, 2012 A set of recent published research papers (see course syllabus) J. Han, M. Kamber, J. Pei, Data Mining: Concepts and Techniques, 3rd ed., Morgan Kaufmann, 2011

Textbook & Recommended Reference Books Yizhou Sun and Jiawei Han, Mining Heterogeneous Information Networks: Principles and Methodologies, Morgan & Claypool, 2012 Jiawei Han, Micheline Kamber, Jian Pei, Data Mining: Concepts and Techniques, 3rd ed., Morgan Kaufmann, 2011 Recommended reference books Mark Newman, Networks: An Introduction, Oxford Univ. Press, 2010 Albert-László Barabási, Network Science, Cambridge Univ. Press, Aug. 2016 M. J. Zaki and W. Meira, Jr. “Data Mining and Analysis: Fundamental Concepts and Algorithms”, Cambridge University Press, 2014 K. P. Murphy, "Machine Learning: a Probabilistic Perspective", MIT Press, 2012 Reference papers A list of reference papers will be made available at course “resource” page

Course Work: Assignments, Exams and Course Project Assignments: (2 assignments, equal weight) 15% total Two midterm exams (equal weight): 40% in total Research project proposal (one-page): 0% (due at the end of 4th week) Class attendance (3%): Max misses w/o penalty: 3, then −0.3% for each miss For online students, 3% will be folded into survey report Class presentation and/or research survey (12% total) Survey report [expect to be comprehensive and in high quality, ≈ 20 pages] Encourage to align with your research project topic domain Report plus companion presentation slides [due at the end of 12th week] Class presentation: May use 10 min. class survey presentation to replace the survey report (consent of instructor)—contents must closely aligned with the class content and in very high technical quality Final course project: 30% (due at the end of semester) Evaluated by class (50%) and TA + instructor (50%) collectively!

Research Projects Evaluation Final course project: 30% (due at the end of semester) The final project will be evaluated based on (1) technical innovation, (2) thoroughness of the work, and (3) clarity of presentation The final project will need to hand in: (1) project report (length will be similar to a typical 8-12 page double-column conference paper), and (2) project presentation slides (which is required for both online and on-campus students) Each course project for every on-campus student will be evaluated collectively by instructor (plus TA) and other on-campus students in the same class The course project for online students will be evaluated by instructors and TA only Group projects (both survey and research): Single-person project is OK, also encouraged to have two as a group, and team up with other senior graduate students, and will be judged by them

Where to Find Reference Papers? Course research papers: Check reading list and list of papers at the end of each set of chapter slides Major conference proceedings that will be used DM conferences: ACM SIGKDD (KDD), ICDM (IEEE, Int. Conf. Data Mining), SDM (SIAM Data Mining), ECMLPKDD (Principles KDD), PAKDD (Pacific-Asia) DB conferences: ACM SIGMOD, VLDB, ICDE ML conferences: NIPS, ICML IR and Web conferences: SIGIR, CIKM, WWW, WSDM Social network confs: ASONAM Other related conferences and journals IEEE TKDE, ACM TKDD, DMKD, ML Use course Web page, DBLP, Google Scholar, Citeseer

From Data to Networks to Knowledge: An Evolutionary Path! Han, Kamber and Pei, Data Mining, 3rd ed. 2011 Yu, Han and Faloutsos (eds.), Link Mining, 2010 Sun and Han, Mining Heterogeneous Information Networks, 2012 Y. Sun: SIGKDD’13 Dissertation Award Wang and Han, Mining Latent Entity Structures, 2015 C. Wang: SIGKDD’15 Dissertation Award