Data Science Dianna Xu Bryn Mawr College 1. The Course 300-level Computer Science elective CS majors and minors Pre-reqs: CS1, CS2 (Data Structures),

Slides:



Advertisements
Similar presentations
Department of Statistics R.W. Doerge West Lafayette, IN Developing statistics programs for majors outside of statistics and.
Advertisements

1 Undergraduate Curriculum Revision Department of Computer Science February 10, 2010.
1 Welcome to the Kernel-Class My name: Max (Welling) Book: There will be class-notes/slides. Homework: reading material, some exercises, some MATLAB implementations.
Teaching Courses in Scientific Computing 30 September 2010 Roger Bielefeld Director, Advanced Research Computing.
Department of Mathematics and Computer Science
Panel: Mathematics in CS Iowa Undergraduate Computer Science Consortium October 2005.
New Geometric Methods of Mixture Models for Interactive Visualization PIs: Jia Li, Xiaolong (Luke) Zhang, Bruce Lindsay Department of Statistics College.
Collaborative Filtering in iCAMP Max Welling Professor of Computer Science & Statistics.
Principal Component Analysis
University of Hartford Computer Science Program The purpose of this slide set is to provide an overview of the Computer Science Program at the University.
Financial Engineering Club Career Path and Prep. Entry Level Career Paths Type 1: Research based Background: Physics, Electrical Engineering, Applied.
Data Analytics Program at Drake Brad C. Meyer, Chair Information Management and Business Analytics.
Overview of the MS Program Jan Prins. The Computer Science MS Objective – prepare students for advanced technical careers in computing or a related field.
Computer Science Graduate Programs at UTSA Dr. Weining Zhang.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
General Information Course Id: COSC6342 Machine Learning Time: MO/WE 2:30-4p Instructor: Christoph F. Eick Classroom:SEC 201
Department of Mathematics, Statistics and Computer Science Degree Requirements Contact Information Begin Exit.
1 By The Numbers. 2 One of largest programs of its kind at a major research institution Replication is underway at other universities. US Department of.
Graduate Programs in Dept of Computer Science Univ. of Texas at San Antonio Dr. Weining Zhang.
CS525: Big Data Analytics Machine Learning on Hadoop Fall 2013 Elke A. Rundensteiner 1.
Anomaly detection with Bayesian networks Website: John Sandiford.
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 10a-11:30a Instructor: Christoph F. Eick Classroom:AH123
J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive Datasets, 2.
Mathematics in Computer Science at Loras College Steve Mosiman.
Proposal for Background Requirements Changes For the current MS/PhD programs, background requirements are expressed in the "Background Preparation Worksheet"
Dr. John Lowther, Associate Professor of CS Adjunct Associate Prof. of Cognitive and Learning Sciences Computer Graphics:
CSE543T: Algorithms for Nonlinear Optimization Yixin Chen Department of Computer Science & Engineering Washington University in St Louis Spring, 2011.
Bioinformatics Education at North Central College Caroline St.Clair Professor of Computer Science.
Database Systems Carlos Ordonez. What is “Database systems” research? Input? large data sets, large files, relational tables How? Fast external algorithms;
What’s Right with Undergraduate Statistics? Exciting Course Options.
Advanced Analytics on Hadoop Spring 2014 WPI, Mohamed Eltabakh 1.
Jonathan Crabtree Assistant Director of Computing and Archival Research UNC Chapel Hill, Odum Institute Vision for Sociometric Analysis of Long-tail Science.
Learning from Positive and Unlabeled Examples Investigator: Bing Liu, Computer Science Prime Grant Support: National Science Foundation Problem Statement.
GTES-CS Georgia Tech Emerging Scholars in Computer Science.
CS Curriculum Changes Fall, BS Computer Science 2015 COMPUTER SCIENCE COURSES—64 Hours COMPUTER SCIENCE CORE (48 Hours) CS 258Intro to Object-Oriented.
CSE 312 Foundations of Computing II Instructor: Pedro Domingos.
CSC 478 Programming Data Mining Applications Course Summary Bamshad Mobasher DePaul University Bamshad Mobasher DePaul University.
The Level-2 Projects for Course Clusters Haojun Sun College of Engineering Shantou University.
Mining of Massive Datasets Edited based on Leskovec’s from
The Fulton Experience “Education for the Changing World”
General Information Course Id: COSC6342 Machine Learning Time: TU/TH 1-2:30p Instructor: Christoph F. Eick Classroom:AH301
Department of Mathematics and Computer Science
Messy Data: Teaching Students Early on About the Realities of Data.
Observations from Large Programs September 27, 2013 Guidelines for Undergraduate Statistics Programs Workgroup Webinar Series
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Knowledge Discovery in a DBMS Data Mining Computing models and finding patterns in large databases current major challenge in database systems & large.
Beginning 1956  Associate of Science Degree included 27 credits of mathematics  Math 12 Plane Trigonometry  Math 13 Analytical Geometry  Math 91 Calculus.
CSE543T: Algorithms for Nonlinear Optimization Yixin Chen, PhD Professor Department of Computer Science & Engineering Washington University in St Louis.
GCSE Computer Science Content Overview
Usman Roshan Dept. of Computer Science NJIT
Department of Mathematics, Statistics and Computer Science
BEng(CompSc) Curriculum Structure & Highlights
Computer Science Courses
Business analytics Lessons from an undergraduate introductory course
Information about the MSE in Data Science
René Vidal Time/Place: T-Th 4.30pm-6pm, Hodson 301
Deep Learning with TensorFlow online Training at GoLogica Technologies
CSE543T: Algorithms for Nonlinear Optimization Yixin Chen, PhD Professor Department of Computer Science & Engineering Washington University in St Louis.
Machine Learning & Data Science
CS7280: Special Topics in Data Mining Information/Social Networks
CMPT 733, SPRING 2016 Jiannan Wang
CS 179 Project Intro.
Session 2:50 PM - 4:20 PM; Saturday, Nov 18, 2017
Data Science with Python
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Usman Roshan Dept. of Computer Science NJIT
Computer Science Courses in the Major
Presented by: Ava Meredith, Seattle Central College
Computer Science Dr Hwang Chair, Computer Science Department
Presentation transcript:

Data Science Dianna Xu Bryn Mawr College 1

The Course 300-level Computer Science elective CS majors and minors Pre-reqs: CS1, CS2 (Data Structures), Discrete Math and Linear Algebra Unstructured data and explorative data analysis Iterative processes that require programming skills and and knowledge of algorithms 2

200-level Data Visualization Course Taught Spring 2014 at Haverford – Stats basics, linear regression – Clustering – Baby network analysis – PageRank – Visualization with D3 50% overlap of students 3

Topics Statistical methods (2-3 weeks) – basics, regression analysis, Bayesian methods Machine learning background (2-3 weeks) – multivariate regression and logistic regression Dimensionality reduction (3 weeks) – PCA, SVD, Kernel PCA, other non-linear methods Topological data analysis (2-3 weeks) – manifold learning, intro to TDA Network analysis (2-3 weeks) – collaborative filtering, community detection 4

Project-Oriented Students will team up for a semester-long project on data analysis for local "data clients" – Faculty members who have "real life" data sets and research questions – Library, registrar and institutional research 5

Data Sets Digital Du Chemin – repertory of polyphonic songs from 16th-century France Dark Reactions – chemical experiments with associated reactants and results Maine athletes Anil's bio data? 6

Machine Learning Modules Focused on the process of doing (good) machine learning i.e. – (step 1) Pose a problem in the language of machine learning – (step 2) Gather data – (step 3) Choose a potential method for solving the problem – (step 4) Setup an experiment to properly evaluate your method – (step 5) Evaluate experiment and possibly go to step 2 or 3 Possible toolsets: – iPython notebook – Scikit-learn 7

iPython Notebook Modules 8 cture_1/Machine%20Learning%20Lecture%201.html arning_lecture_2/Machine%20Learning%20Lecture%202.html

9 Open-source, python-based package for machine learning. Principal strength is a consistent API and enforcement of "good" machine learning practices