Eick : First Lecture COSC 7362 Overview Advanced Machine Learning Hyla-Tree Frog Outlier and Anomaly Detection Spatio-Temporal Clustering Ensemble Learning.

Slides:



Advertisements
Similar presentations
Learning in ECE 156A,B A Brief Summary Li-C. Wang, ECE, UCSB.
Advertisements

Welcome to the seminar course
John Hurley Cal State LA
EE 791D Data Converter Design 2 Introduction 1. Data Converters Signal Conditioning ADC MCU DAC Post Processing -Amplification -Anti-aliasing filter ex.
Project Proposal.
How to Give a Journal Club Talk
CS112: Course Overview George Mason University. Today’s topics Go over the syllabus Go over resources – Marmoset – Blackboard – Piazza – Textbook Highlight.
CSE 5522: Survey of Artificial Intelligence II: Advanced Techniques Instructor: Alan Ritter TA: Fan Yang.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
MACHINE LEARNING 1. Introduction. What is Machine Learning? Based on E Alpaydın 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Need.
CS/CMPE 535 – Machine Learning Outline. CS Machine Learning (Wi ) - Asim LUMS2 Description A course on the fundamentals of machine.
ICS 324 Database Systems Second Semester (072) Information & Computer Science Department.
June 13, Introduction to CS II Data Structures Hongwei Xi Comp. Sci. Dept. Boston University.
CS 232 Geometric Algorithms: Lecture 1 Shang-Hua Teng Department of Computer Science, Boston University.
CSCD 555 Research Methods for Computer Science
CSE 322: Software Reliability Engineering Topics covered: Course outline and schedule Introduction, Motivation and Basic Concepts.
Using MyMathLab Features You must already be registered or enrolled in a current MyMathLab class in order to use MyMathLab. If you are not registered or.
PROBABILITY AND STATISTICS FOR ENGINEERS Session 1 Dr Abdelaziz Berrado MTH3301 —Fall 09.
COMP 14 – 02: Introduction to Programming Andrew Leaver-Fay August 31, 2005 Monday/Wednesday 3-4:15 pm Peabody 217 Friday 3-3:50pm Peabody 217.
INTRODUCTION TO Machine Learning ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
July 16, Introduction to CS II Data Structures Hongwei Xi Comp. Sci. Dept. Boston University.
INTRODUCTION TO Machine Learning 3rd Edition
Introduction to Programming Environments for Secondary Education CS 1140 Dr. Ben Schafer Department of Computer Science.
CS Machine Learning. What is Machine Learning? Adapt to / learn from data  To optimize a performance function Can be used to:  Extract knowledge.
Computer Science 102 Data Structures and Algorithms V Fall 2009 Lecture 1: administrative details Professor: Evan Korth New York University 1.
CS211: Course Overview George Mason University. Today’s topics Go over the syllabus Go over resources – Marmoset – Piazza – Textbook Highlight important.
What you need to know about this class A powerpoint syllabus.
Ch. Eick Project 3 COSC Christoph F. Eick Last updated: October 21, 2014.
Autonomous Learning Proficiency: Getting students to think about their learning Lynn Grinnell College of Business.
COMP 465W: Software Engineering Fall Components of the Course The three main components of this course are: The study of software engineering as.
Strategies for Interpreting a Prompt and Succeeding at the In-Class Timed Writing Essay.
COMP 875 Machine Learning Methods in Image Analysis.
MACHINE LEARNING 張銘軒 譚恆力 1. OUTLINE OVERVIEW HOW DOSE THE MACHINE “ LEARN ” ? ADVANTAGE OF MACHINE LEARNING ALGORITHM TYPES  SUPERVISED.
COMP 111 Programming Languages 1 First Day. Course COMP111 Dr. Abdul-Hameed Assawadi Office: Room AS15 – No. 2 Tel: Ext. ??
CSE 501N Fall ‘09 00: Introduction 27 August 2009 Nick Leidenfrost.
COMP Introduction to Programming Yi Hong May 13, 2015.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 10a-11:30a Instructor: Christoph F. Eick Classroom:AH123
SE-308 Software Engineering-II 7th Term SE University of Engineering & Technology Taxila, Pakistan Software Engineering Department.
CS598CXZ (CS510) Advanced Topics in Information Retrieval (Fall 2014) Instructor: ChengXiang (“Cheng”) Zhai 1 Teaching Assistants: Xueqing Liu, Yinan Zhang.
Understanding Student Expectations & Effective Teaching Presented by: Dr. Laura Marler Associate Professor of Management College of Business Mississippi.
COMP 465W Software Engineering Fall Components of the Course The three main components of this course are: The study of software engineering as.
Teaching Thermodynamics with Collaborative Learning Larry Caretto Mechanical Engineering Department June 9, 2006.
CSCI 51 Introduction to Computer Science Dr. Joshua Stough January 20, 2009.
Dana Nau: CMSC 722, AI Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Research and Writing Seminar Thursday, – 16 35, room C To find an up-to-date version of the schedule and to read the papers check the website
ICS 6B Boolean Logic and Algebra Fall 2015
Yazd University, Electrical and Computer Engineering Department Course Title: Advanced Software Engineering By: Mohammad Ali Zare Chahooki 1 Introduction.
An Orientation: General Psychology Online. The Course Menu Shown on the far left is the menu used to navigate our Psychology course.
COP3502: Introduction to Computer Science Yashas Shankar.
Eick : First Lecture COSC 7363 Overview Advanced AI Hyla-Tree Frog History of AI thr AI and the Web Traditional Clustering Graph Mining Spatial and.
Most of contents are provided by the website Introduction TJTSD66: Advanced Topics in Social Media Dr.
G5BAIM Artificial Intelligence Methods Dr. Graham Kendall Course Introduction.
C Programming Lecture 1 : Introduction Bong-Soo Sohn Associate Professor School of Computer Science and Engineering Chung-Ang University.
1 CS 381 Introduction to Discrete Structures Lecture #1 Syllabus Week 1.
CS112: Course Overview George Mason University. Today’s topics Go over the syllabus Go over resources – Marmoset – Blackboard – Piazza – Textbook Highlight.
Data Structures and Algorithms in Java AlaaEddin 2012.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
Course Overview Stephen M. Thebaut, Ph.D. University of Florida Software Engineering.
{ COMI Thursday night 5:30 – 10:30 Room 6054.
Introduction to CSCI 1311 Dr. Mark C. Lewis
CSc 1302 Principles of Computer Science II
Eick: Introduction Machine Learning
Objectives of the Course and Preliminaries
CS5040: Data Structures and Algorithms
COMI Friday 9:00 – 1:50 Room 2108.
Tentative Teaching Through October 14
Christoph F. Eick: A Gentle Introduction to Machine Learning
CS 6640 Sample Presentation
Welcome! Knowledge Discovery and Data Mining
Presentation transcript:

Eick : First Lecture COSC 7362 Overview Advanced Machine Learning Hyla-Tree Frog Outlier and Anomaly Detection Spatio-Temporal Clustering Ensemble Learning Deep Learning Density Estimation / Model-based Approches Research Methodology: How to be successful in the field of Machine Learning

Eick : First Lecture COSC What is Machine Learning? Machine Learning –Study of algorithms that –improve their performance –at some task –with experience Optimize a performance criterion using example data or past experience. Role of Statistics: Inference from a sample Role of Computer science: Efficient algorithms to –Solve the optimization problem –Representing and evaluating the model for inference

Eick : First Lecture COSC What We Talk About When We Talk About“Learning ” Learning general models from a data of particular examples Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and scarce. Example in retail: Customer transactions to consumer behavior: People who bought “Da Vinci Code” also bought “The Five People You Meet in Heaven” ( Build a model that is a good and useful approximation to the data.

Eick : First Lecture COSC Why Should Computers Learn to Learn? Machine learning is programming computers to optimize a performance criterion using example data or past experience. Learning is used when: –Human expertise does not exist (navigating on Mars), –Humans are unable to explain their expertise (speech recognition) –Solution changes in time (routing on a computer network) –Solution needs to be adapted to particular cases (user biometrics, costumer preferences)

Eick : First Lecture COSC Data Mining/KDD/Data Analytics/BigData Retail: Market basket analysis, Customer relationship management (CRM) Finance: Credit scoring, fraud detection Manufacturing: Optimization, troubleshooting Medicine: Medical diagnosis Telecommunications: Quality of service optimization Bioinformatics: Motifs, alignment Web mining: Search engines... Definition := “KDD is the non-trivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns in data” (Fayyad ) Applications:

Eick : First Lecture COSC AI Is learning is part of human intelligence, machine learning is also a major focus of AI

Eick : First Lecture COSC 7362 Growth of Machine Learning Machine learning is preferred approach to –Speech recognition, Natural language processing –Computer vision / automatically creating models maps of cities –Medical outcomes analysis –Robot control –Computational biology –Understanding the world using sensor data This trend is accelerating –Improved machine learning algorithms –Improved data capture, networking, faster computers –Software too complex to write by hand –New sensors / IO devices –Demand for self-customization to user, environment –It turns out to be difficult to extract knowledge from human experts  failure of expert systems in the 1980’s –New applications: individualized services, recommender systems,… 7

Eick : First Lecture COSC 7362 General Thoughts and Teaching Philosophy I Focus of the course is providing more in-depth knowledge in the areas mentioned on the previous slide and to learn how to read, summarize, present, and evaluate scientific papers. Interactive discussion of papers, research topics, research methodology and homework solutions. No cheating! No cheating! No cheating! Teaching Philosophy: You will have to face some criticism; otherwise, you will not learn anything. Learning without exposing yourself to errors is impossible!! If you do not know what you do wrong, it is hard to improve! No matter if you like it are not, you will have to talk a lot in this course. During the course you will also make several informal, unstructured presentations. The ideal class size for this course is 12!! You do Something Feedback Learn

Eick : First Lecture COSC 7362 General Thoughts and Teaching Philosophy II No projects and only 2 quizzes: We.: October 21 and likely Mo., November 23; there still will be student presentations and maybe discussions on Nov. 30/Dec.2… Learning by doing!! I am aware that most of you are not too experienced in these matters; consequently, my expectations are initially quite low; if you do not understand something write down it down as a question. The workload of the course is not that bad; however, you will need to work continuously…; different students might have different workloads/tasks but Dr. Eick will do his best to ensure that each student’s total workload is approximately the same. Please, always bring hard-copy/labtop with softcopy of the paper(s) we are discussing to the lecture!! Not reading papers that will be discussed on a particular day is not acceptable! Paper will be covered: –Paper Walkthroughs (there will be 4-5 of those) — see later slide –Student Presentations

Eick : First Lecture COSC 7362 General Thoughts and Teaching Philosophy III One objective if this course is to describe what other do in your own words – consequently, no copying from any sources (web or class mates) or if you do reference the source and if you use actual text properly quote it. If you face particular or unusual problems when taking this course: talk to me during my office hours or send me an . During the course you will also make 2 more formal presentations — this is tentative: the exact requirements depend on how many students are enrolled in the course; by September 9, we should know exactly what the requirements are. We also will cover (machine learning) research methodology; you will learn about the main ingredients of conducting a successful (machine learning) project. Finally, about 20% of the lecture time will be allocated to discussions and answering student questions.

Eick : First Lecture COSC 7362 General Thoughts and Teaching Philosophy IV You will also get some exposure concerning writing abstracts, summaries, introductions, white paper, and conclusions. Learning to write included to know what people expect concerning what you write and how what you write will be judged. In this course, we will try it several teaching strategies, some of which will be revised or even abandoned as the course progresses. Is a by product you will hopefully get a better understanding on how to conduct a scientific project and on how to summarize and present its results. The will be a few (6-9) lectures by Dr. Eick that typically give an introduction to a few of the five research themes that are covered in addition to machine learning methodology this semester. The course website plays an important role when teaching the course:

Eick : First Lecture COSC 7362 General Thoughts and Teaching Philosophy V Prerequisites: COSC 6342 or Data Mining and co-enrolled in COSC 6342 or … or consent of the instructor. If you are a non-Computer Science student and plan to take the course, see Dr. Eick during his office hour today or make an appointment with Dr. Eick after today’s lecture. More emphasis in this course is put on attendance and on leading and participating in course discussions. As the way the course is actually taught depends on the number of students who takes this course, the teaching of this course will be finalized early October. As the schedule / content of the course is not written into stone, (particularly what will be discussed after October 15, 2015) your input with respect to interesting papers / topics to be discussed in the second half of the course is encouraged.

Eick : First Lecture COSC 7362 Technical Topics Covered in Fall 2015 Anomaly and Outlier Detection Density Estimation and Model-based Approaches to Machine Learning Deep Learning Ensemble Learning Spatial Temporal Clustering Maybe a sixth topic!-feel free to propose a subarea of machine learning you might be interested in by September 3, 2015.

Eick : First Lecture COSC 7362 Tentative Teaching Plan Next 6 Weeks Aug. 24: Course Overview ( ) Aug. 26: Lecture: Introduction to Anomaly Detection + Overview Density Est. Aug. 31: Overview Density Estimation continued Sept. 2: Paper-walkthrough: Survey Book Chapter Paper Anomaly Detection Sept. 9: Paper-walkthrough: Silverman’s Classical paper on Density Estimation; maybe “Thoughts on Research” Sept. 14: Student Presentation: Using GMM for Anomaly Detection Sept. 16: Maybe: Density-estimation Tools in R (a lot of presenters!) Sept. 20: Likely more Student Presentations Anomaly Detection Sept. 22: Student Presentation(s): Research papers centering on Non-Parametric Density Estimation September 27: Lecture and Discussion: How to read scientific papers September 29: Paper-walkthrough Overview paper Deep Learning October 4: Lecture and Discussion: How to write scientific papers October 6: Homework1: Writing Abstracts / Introduction and Leftovers October 11: Student Presentation: Deep Learning … October 21: Quiz1 Remark: Student Presentation Topics through Sept. 30 will be assigned approx. Sept. 3, 2015

Eick : First Lecture COSC 7362 Course Activities A lot of informal presentations and discussions 1-3 formal presentations about a paper covered in the course Writing abstracts, introductions, conclusions and paper reviews --- learning by doing; there will be two homeworks that center on those issues 2 Quizzes that ask questions about papers we have read Discussions of research topics as well as of homework solutions of your fellow students. Learning how to read, summarize, present, and review papers. Background knowledge on ‘how to perform a research project’ and on ‘how to be successful in your research / career’. A few activities might be conducted in groups; e.g. reviewing Discussing many other, entertaining things---such as the ‘Giant Squid’, life of xyz-- most of which are still related to one of the above activities.

Eick : First Lecture COSC 7362 Forms of Covering Papers in the Course Papers will be discussed, presented in many different forms in this course: Slow Walk Through (I only plan to have 3-5 of those!!) Guided Walk Through Other Walk Throughs (I did not consider yet!) By answering a given set of questions. Just Discussion 1-Page (5-page) Summary of a Paper Professional Powerpoint Presentation Profession Paper Review (  November 2015) …

Eick : First Lecture COSC 7362 (Slow) Paper Walk Throughs Papers will be discussed paragraph by paragraph Very slow!! Therefore, there will be only 3-5 of those… Course participants are responsible for sections of the paper. Responsibilities include: –Lead discussion –Present short summaries for boring sections to speed up things –Ask questions about things they do not understand –Prepare review questions for the other students that will be discussed either immediately or after a delay. Everybody should read the paper carefully including the sections you are not responsible for. It might be a good idea to create brief summaries for the read sections and to capture, what you do not understand, in form of questions. If you finished reading the paper try to come up with your own evaluation of the paper We will not only discuss the contents of the paper, but also address the question “why an author writes a paper in a particular way” and “how the presentation of the discussed paper could be improved / made more convincing. Additionally, issues on how to write a paper will be discussed during slow walk throughs discussions --- these matters are Dr. Eick’s responsibility..

Eick : First Lecture COSC 7362 Course Objectives Upon completion of this course, students will know what the goals and objectives of machine learning are will know how to read, understand, summarize, evaluate machine learning papers will have sound knowledge of particular subfields of machine learning, namely Anomaly and Outlier Detection, Deep Learning, Density Estimation, Ensemble Learning, and Spatial- Temporal Clustering will learn the main ingredients to conduct a machine learning project successfully will learn how to make presentations and to lead discussions

Eick : First Lecture COSC 7362 Course Content 1.Goals and Objectives of Machine Learning 2.Anomaly and Outlier Detection 3.Density Estimation & Model-based Approaches to Machine Learning 4.Deep Learning 5.Machine Learning Research Methodology 6.Ensemble Learning 7.Spatial-Temporal Clustering 8.How to Read, Understand, Summarize, and Evaluate Machine Learning Papers with Practical Exercises

Eick : First Lecture COSC 7362 Course Elements 7-10 lectures 2 Quizzes 4-5 Paper Walkthroughs 8-12 Student Presentations 2-3 Discussions 2 Homeworks

Eick : First Lecture COSC 7362 Grading 2 Quizzes: 40% Student presentations and Leading Course Discussions: 28% Homeworks: 17% Class Participation: 15% Remark: These percentages are preliminary and subject to change.

Eick : First Lecture COSC 7362 Consultation Instructor: Dr. Christoph F. EickDr. Christoph F. Eick office hours (573 PGH): M 3:15-4:45p W 2:30-3p