Data and Data Science Some Final Thoughts. Scientific Programming Basically always follows the same structure: – Formatted reading in of the data and.

Slides:



Advertisements
Similar presentations
aims By the end of the session you will be able to: Explain how images are represented in Processing Manipulate images using the pixels member Use for.
Advertisements

Guy Griffiths. General purpose interpreted programming language Widely used by scientists and programmers of all stripes Supported by many 3 rd -party.
Procedures and Functions. What are they? They are both blocks of code that can be reused to perform specific task. However there is a difference: Function-
Successful College Writing 5 th Edition Kathleen T. McWhorter ©2012 Bedford/St. Martin’s ISBN-10: ISBN-13: Successful College.
SCOPE! Simple Review For Your Reference If Needed.
Personal Reading Procedures Reading for High School and Beyond By; Holden Stengel.
AP Computer Science. Google Interview Question You are given 8 identical looking balls. One of them is heavier than the rest of the 7 (all the others.
Final Thoughts. When you get data… Check for Viruses Lock the files – Right click> properties>click on read only Assume the data has not been cleaned.
CS101- Lecture 11 CS101 Fall 2004 Course Introduction Professor Douglas Moody –Monday – 12:00-1:40 – – –Web Site: websupport1.citytech.cuny.edu.
Math 105: Problem Solving in Mathematics. Course Description This course introduces students to the true nature mathematics, what mathematicians really.
Lecture 7 Sept 19, 11 Goals: two-dimensional arrays (continued) matrix operations circuit analysis using Matlab image processing – simple examples Chapter.
Lecture 6 Sept 15, 09 Goals: two-dimensional arrays matrix operations circuit analysis using Matlab image processing – simple examples.
ENGR 3 rocks. Debugging Skills Read the error message. Try to decipher which line of C code is causing the problem. Read that line of code carefully for.
School of Computer ScienceG53FSP Formal Specification1 Dr. Rong Qu Introduction to Formal Specification
Writing an Essay Career Fair Paper.
Test Preparation Strategies
index.php Palmyra Area High School 1.
Homework 1 Generally done well – but remember all is an illusion.
Programming For Nuclear Engineers Lecture 12 MATLAB (3) 1.
Researching a Persuasive Essay How to Formulate and Support An A+ Argument.
The Writing Process Steps in the Writing Process.
Lists in Python.
Spring Learning Statement #1 I am learning that the use of technology is not always engaging for students and does not always support student learning.
Numerical Approximation1  You have some Physics equation or equations which need to be solved  But: You can’t or don’t want to do all that mathematics,
Debugging !!!  ► Believe it or not, errors happen ► The most frustrating part of programming ► Hunting down an error can be harder and more time consuming.
Elementary School Science: Emphasizing the Basics Presented by Frank H. Osborne, Ph. D. © 2015 EMSE 3123 Math and Science in Elem. Ed.
Welcome to IIT and cs115!.
Time Management and Organization CLC Workshop. Identify What are the specific issues that prevent you from completing work? Motivation Unrealistic expectations.
NOTE TO INSTRUCTORS I don’t really have “clicker” questions on the material in Chapter 2 – Except to have them figure out how some of the Excel features.
Introduction to MATLAB 7 Engineering 161 Engineering Practices II Joe Mixsell Spring 2010.
Math 105: Problem Solving in Mathematics
Chapter 7 File I/O 1. File, Record & Field 2 The file is just a chunk of disk space set aside for data and given a name. The computer has no idea what.
Current Assignments Homework 2 is available and is due in three days (June 19th). Project 1 due in 6 days (June 23 rd ) Write a binomial root solver using.
Introduction to Computer Application (IC) MH Room 517 Time : 7:00-9:30pm.
Problem-Based Learning Meets Web 2.0 Frances A. May University of North Texas 2009 LOEX Conference May 1, 2009 – Albuquerque, NM.
Hirophysics.com Steven Jeffrey K irkup. Hirophysics.com What is Object Oriented Programming (OOP)? Essentially, OOP is a protocol that can be followed.
(As if you don’t have enough to do!) This presentation was created by J. Sheppard and adapted by S. Russell Introduction to Assessment.
Skills That Go Beyond the Single Word Level Inferencing/prediction Cohesion Main idea Summarizing Drawing conclusions.
Lecture 20 The last function of reason is to recognize that there are an infinity of things which surpass it. Blaise Pascal.
Howto use Eureka ICS 105 Research Team Fall 2000.
How to Rock the Chairman’s Award By: The Killer Bees.
Comparative Civilizations 12 Introduction. Course Structure This is very much a web-based course. We also use plenty of text-based material from the Library,
If the music works, it has no real significance just a little background noise for your ears and reading enjoyment.
The TGD Project Daniel Hajas and David Turner. What is a Tactile Graphics Display? A screen for your fingers! Our TGD will use Taxels rather than pixels.
Homeschooling vs. The Public School System BY TAYLOR SPILKER.
David Streader Computer Science Victoria University of Wellington Copyright: David Streader, Victoria University of Wellington Debugging COMP T1.
COMP1927 Course Introduction 16x1
How the Session Works Outline Practical on arrival Talk 1 Reflect on practical Clarify concepts Practical exercises at your own pace Talk 2: Further concepts.
10 Chapter 101 Using Menus and Validating Input Programming Logic and Design, Second Edition, Comprehensive 10.
Version Control and SVN ECE 297. Why Do We Need Version Control?
9/29. Grammar Practice Perfect Paragraph – Summary Edition Practice! End Goal – Reinforce grammar and understand how to create a perfect paragraph.
Introduction to Literate Programming in Matlab 2WN50 – Week programming-in-matlab.pptx?dl=0.
An array example: the transpose A 2-D array corresponds to a mathematical matrix: One simple matrix operation transposes the matrix.
Structured Programming (COIT 29222) Term 2, 2009, Final Exam Date And Time  As advised by the university Open/Closed Book (3 hours)  Structured Programming,
Day 26 Arrays Episode 2. Array Lengths To determine the length of an array, use the command: array_Name.length.
Importance of formative literacy experiences Daniella Ramos.
Cover Letter This year I have learned about the basics of science. I was able to obtain knowledge about the rock cycle, the ocean floor, and the periodic.
MDFP Mathematics and Statistics 1. Univariate Data – Today’s Class 1.STATISTICS 2.Univariate (One Variable) Data 1.Definition 2.Mean, Median, Mode, Range.
CIT 590 Intro to Programming Lecture 6. Vote in the doodle poll so we can use some fancy algorithm to pair you up You.
How Might Classroom Climate Support Mathematical Discourse? Productive Struggle? Reasoning? Physical Space?
Debugging using By: Samuel Ashby. What is debugging?  A bug is an error in either a program or the hardware itself.  Debugging is first locating and.
AP CSP: Cleaning Data & Creating Summary Tables
Year 11 Subject Selection 2018 VCE Mathematical Methods CAS Units 1&2
Procedures Programming Guides.
Data and Data Science Some Final Thoughts.
MATLAB – What Is It ? Name is from matrix laboratory Powerful tool for
MATLAB – What Is It ? Name is from matrix laboratory Powerful tool for
CMSC201 Computer Science I for Majors Final Exam Information
Tour of NCL Website Modified by R. Grotjahn
Problem Solving Fall 2016.
Presentation transcript:

Data and Data Science Some Final Thoughts

Scientific Programming Basically always follows the same structure: – Formatted reading in of the data and convert to arrays (data initially is probably in an array) – Data sorting and possible reformat/transform – Looping through the data – Conditional If statements everywhere – Mathematical operations on pieces of the data – often calls to library subroutines – Output for analysis

Things you should now recognize after experiencing this “class” Preparing raw data for analysis is a lot more time consuming than you think Preparation is best done via formatted read and formatted write operations. Prepare subsets of data to operate on. Always try to array the data Attempt to write your own mathematical operation filters and not use “libraries” Always keep track of local minimization when fitting data;

More things Plot the data early and often – get a feeling for the noise. Use powerful command line procedures for certain data sorts or global edits. Build command line pipeline that produces a graphical output whenever your re-run code on data Search for existing tools to adapt – especially for visualization needs Practice, practice, practice; mistakes, mistakes, mistakes is the best way to learn.

Data Mining The third exercise in the last homework assignment is effectively a data mining exercise. Temp. Gradient Find Global Max

Books To Read

So you wanna be a data scientist?

Skills Needed (adapted from Burtch Works Executive Recruiting) Python Easily Trumps R

How to practice:

Advice from Paco Nathan (google him) Scalding:

Summary We didn’t hold your hand and teach you specific code for a reason  that won’t help you – you can either be pissed off at that or realize that to make progress you have to go forth and practice. Your career as a data scientist is much more probable than that as a physicist  either your believe that or you don’t