Cse 344 May 30th – analysis.

Slides:



Advertisements
Similar presentations
Semester wrap-up …the review.. Course ReCap To make you notice interfaces, good and bad – You’ll never look at doors the same way again To help you realize.
Advertisements

Lecture-8/ T. Nouf Almujally
Data Structures and Programming.  John Edgar2.
CSC 105 Introduction to Computer Science Professor Batchelor.
 A set of objectives or student learning outcomes for a course or a set of courses.  Specifies the set of concepts and skills that the student must.
Last Words COSC Big Data (frameworks and environments to analyze big datasets) has become a hot topic; it is a mixture of data analysis, data mining,
CS461: Principles and Internals of Database Systems Instructor: Ying Cai Department of Computer Science Iowa State University Office:
Info Systems Fall 2013 . The modern role of often not-so-modern database technology  We will look at MySQL SQL PHP  NoSQL DBs Mongo and GUIs for it.
CSEP 545 Transaction Processing for E-Commerce Course Information Spring (March – May) 2007.
What else is there? CMPT 454: Database Systems II. – Transaction Management. – Query Planning. – Optional topics, e.g. data mining, information retrieval,
Most of contents are provided by the website Introduction TJTSD66: Advanced Topics in Social Media Dr.
Introduction.  Administration  Simple DBMS  CMPT 454 Topics John Edgar2.
1 Seattle University Master’s of Science in Business Analytics Key skills, learning outcomes, and a sample of jobs to apply for, or aim to qualify for,
HPHC - PERFORMANCE TESTING Dec 15, 2015 Natarajan Mahalingam.
What Business Analytics Can Do For You!
Science Fair Information Night
Summer 2017 Hub meeting.
Course Overview - Database Systems
GCSE science End of Year 10 tests
CSE 102/ISE 102 Introduction to Web Design and Programming
COMP 135: Human-Computer Interface Design
Presenter Date | Location
Course Introduction 공학대학원 데이타베이스
CS122B: Projects in Databases and Web Applications Spring 2017
CS122B: Projects in Databases and Web Applications Winter 2017
Final exam: Wednesday, March 20, 2:30pm
Unit 2.5 Translators and Facilities of Languages – Lesson 2
Bridging the Data Science and SQL Divide for Practitioners
SQL – Transactions in SQLite
March 27 – Course introductions; Adts; Stacks and Queues
ACC 349 Course Inspiring Minds/ tutorialrank.com
Developing Information Systems
AP Computer Science Principals Course Importance and Overview
September 27 – Course introductions; Adts; Stacks and Queues
Temple Analytics Challenge
Introduction of Week 3 Assignment Discussion
ACC 349 Competitive Success-- snaptutorial.com
INF 620 Enthusiastic Study/snaptutorial.com
ACC 349 Education for Service-- snaptutorial.com
ACC 349 Teaching Effectively-- snaptutorial.com
COSC 6340 Projects & Homeworks Spring 2002
March 21st – Transactions
February 7th – Exam Review
Topics Covered in COSC 6340 Data models (ER, Relational, XML (short))
CSE255 Final Exam Advice and Hints
Stat 414 – Day 15 Exam 1 Review.
DATA MINING.
CS122B: Projects in Databases and Web Applications Winter 2018
Machine Learning with R
CS 456 Interactive Software.
Data Science introduction.
Choosing the Proper Querying Course
Bell-ringer: 4 minutes Choose the most appropriate units to measure the weight of a textbook a) Pounds b) Grams c) Ounces d) Kilograms Copy CCSS/Objectives/HW.
Do I really make a difference?
CS122B: Projects in Databases and Web Applications Winter 2019
Cse 344 June 1st – Final Review.
What could you invent to measure something in a new way…….
CS122B: Projects in Databases and Web Applications Spring 2018
Train the Trainer Your Name.
CSE 444 Database Management Systems Spring 1999 University of Washington Introduction and Welcome © 1999 UW CSE 4/4/2019.
AP Computer Science Principals Course Importance and Overview
Business concentration, minor and certificate programs
Building a better APUSH DBQ
Designing Your Performance Task Assessment
Welcome! Knowledge Discovery and Data Mining
Lecture 1a- Introduction
Sarah Diesburg Operating Systems CS 3430
Presented by: Ava Meredith, Seattle Central College
R for Data Science Data science Data science is a booming field in today’s world. Since Artificial Intelligence is the main focus of today’s technology,
CSE 444 Database Management Systems Autumn 1997 University of Washington Introduction and Welcome © 1997 UW CSE 12/12/2019.
Presentation transcript:

Cse 344 May 30th – analysis

Administrivia HW8 Due Friday, 11:30 OQ7 Due Tonight, 11:00 Course evaluations! Exam review Section tomorrow Problems from previous midterms Friday June 1, in-class Topics list

Exam June 6th, 8:30 – 10:20 am Please don’t show up at 9:30! Cumulative final Focus on second half of the course Database Design Transactions Covers through the end of isolation last Friday Practice exam + solutions out Friday

Exam One sheet of notes Front and back Written or typed Length Roughly 50% longer Twice the time Watch out for tricky questions

Exam Free to meet Friday Now is the time to discuss grades, not after the final All posted grades will be final at end of day on Friday HW7 Graded before final HW8 Graded by end of quarter

Databases Two types of data base usage Transactional Analytical How does the usage differ?

Databases Transactional Maximizing throughput Ensuring consistency No “right way” to deploy a DB

Databases Transactional Maximizing throughput Ensuring consistency No “right way” to deploy a DB Things to consider Relational v. Semi-structured Parallelism and Multi-national distribution Durability and analytical accumulation

Analytical DB usage Web DB are primarily transactional Need to be processed to be analyzed Facebook

Analytical DB usage Web DB are primarily transactional Need to be processed to be analyzed Facebook Transactional element: free-user service Analytical element: paid-user service

Analytical DB usage Web DB are primarily transactional Need to be processed to be analyzed Facebook Transactional element: free-user service Analytical element: paid-user service Which do you think is most important?

Analytical DB usage Data analysis is big business Machine learning Data mining “Captcha” tests

Analytical DB usage Data analysis is big business Machine learning Data mining “Captcha” tests Analysis in SQL? Very limited capacity Combine with other data processing

Data analysis Data analysis is big business Machine learning Data mining “Captcha” tests Analysis in SQL? Very limited capacity Combine with other data processing

Analytical Tools Common data analysis tools

Analytical Tools Common data analysis tools Python -- great for those with CS background R – great for those with social science background

Analytical Tools Common data analysis tools Python -- great for those with CS background R – great for those with social science background “Common” data analysis tools Excel – see “Growth in a Time of Debt” Tableau – graphs are not analysis

Analytical Tools Python Good scripting language to learn Interfaces well with other languages R Designed for data science Well supported public packages Very pretty graphs

Data analytics vs Data science Analysis is often heuristic If you cannot explain “why” your model predicts behavior, then you don’t understand it Data science Combination of statistical analysis and cooperation with relevant experience

A note about ethics Computer aided decision making affects lives ”Growth in a time of debt” NYC CompStat Essential to be certain in the quality of your work and the impacts that your decisions will have Present data ethically

A note about ethics Computer aided decision making affects lives ”Growth in a time of debt” NYC CompStat “95% accuracy” can be a very misleading statement Cancer screening Precision

Analytical JObs Statistics Data cleaning Reconciling multiple data sources Verification Visualization

Conclusion While most DB usages need to be optimized for transactional usage, they must also be ready for analytical tools This is separate from most of the course as demonstrated Important and common application of data Big money for analytics, but also important to understand error and verification