Big data camp intro Jerry Davis 17 June 2019 1.

Slides:



Advertisements
Similar presentations
Copyright ©: SAMSUNG & Samsung Hope for Youth. All rights reserved Tutorials Software: Building apps Suitable for: Advanced.
Advertisements

Jordan Maxwell ADVANCED PROGRAMMING. DEFINITIONS PHP: A server side Programming language often used in websites. API: ( Application programming interface.
Unit 1 – Improving Productivity Instructions ~ 100 words per box.
ITIS 1210 Introduction to Web-Based Information Systems Chapter 27 How Internet Searching Works.
Unit 1 – Improving Productivity Instructions ~ 100 words per box.
ICOS big data camp May and 29, 2014 Co-sponsored by ICOS, ARC, and LSA.
Online Safety 6th grade. Rules to know Protect your privacy Protect your password Protect the privacy of others Beware of contests, clubs, prizes, & gifts.
Win32 Programming Lesson 2: The Tools of the Trade.
NOTE: To change the image on this slide, select the picture and delete it. Then click the Pictures icon in the placeholder to insert your own image. Cyber.
Revolutionary War Project. You will be doing a research project about the Revolutionary War. You will research people, battles, and events. This project.
OpenWetWare: Share your science Erik Dassi Centre for Integrative Biology University of Trento Mattarello
WHAT YOU NEED TO KNOW ABOUT MLA. HOW TO USE MLA IN YOUR WRITING. THESE NOTES WILL BE CHECKED FOR A CLASSWORK GRADE EVERY WEDNESDAY. MLA Format.
Let’s Get Savvy about Technology Mrs. George Dondero School.
Introduction to Programming and App Inventor. Introduction What is a computer program? Introducing App Inventor Getting hands on with App Inventor.
Introduction to CSCI 1311 Dr. Mark C. Lewis
Internet Etiquette or Netiquette
Cyber Info Gathering Techniques
1/25/17 – List 5 activities you could do to be an activist - check in with grades - all late work due Monday, January 30 Today: What is activism? How can.
Creating your online identity
Social Network.
Introduction to gathering and analyzing data via APIs Gus Cavanaugh
Professional Practices
June 5-9, 2017 Co-sponsored by ICOS and MIDAS
Cleveland SQL Saturday Catch-All or Sometimes Queries
We will watch a short news clip on the topic of ‘FORENSIC ENTOMOLOGY’
How to use the internet safely and How to protect my personal data?
JavaScript: Build "Rock, Paper, Scissors"
Journalism/ Yearbook Week 10.
How to use the internet safely and How to protect my personal data?
Acquirly Review Generation Setup and Overview
101.
Capitalizing on Social Media
Wading Through the Web Conducting Research on the Internet
Let’s Go Surfing: Internet Safety
Software Engineering Process
A POWERPOINT MADE TO KEEP YOU SAFE ONLINE!
INTRODUCTION.
Coding - The Ultimate Survival Skill
STOP. THINK. CONNECT. Online Safety Quiz.
INF 103 Education for Service-- snaptutorial.com.
INF 103 Teaching Effectively-- snaptutorial.com
INF 103 Education for Service-- tutorialrank.com
entrance survey Why are you completing an entrance survey?
Ian Ramsey C of E School GCSE ICT Smart working Software choices.
Course Overview - Database Systems
Finding work experience can be a tricky business
Research 101 Just the Basics!.
Lesson 2 5th Grade WELCOME & INTRODUCTION 1:30 CLASS WELCOME
11/29/2018 Department of Software Engineering and IT Engineering
Louisiana: Our History.
Tips & Tricks to make your life easier
Sr. Manager, Global Talent Acquisition
The dark web By Riley. Sampare.
Why do you attend school?
Student Success Services
PLAGIARISM AND CITATION BASICS
E - safety How e-safe are you?.
Tonga Institute of Higher Education IT 141: Information Systems
The Perfect Search Engine Is Not Enough
Unit 3 Introduction.
Software Engineering Process
CS 2530 Intermediate Computing Dr. Schafer
Staying safe online Use privacy settings Dangers – identity theft, appearing rude/angry, Think before you share pictures or personal information Use.
HU-JRNL 1: Ethics Prof. Vaccaro Jobs, Resumes & Branding
Technologies of Google Seminar Week 1
Cyber Literacy Week 9.
Software Engineering Process
Tonga Institute of Higher Education IT 141: Information Systems
CSCI 203: Introduction to Computer Science I
Open data in teaching and education
Presentation transcript:

Big data camp intro Jerry Davis 17 June 2019 1

What are we up to this week? Monday: overview, python, project group formation Tuesday: Machine Learning, Visualization, APIs & Scraping Wednesday: Networks, Text, CLI Thursday: Buffet of advanced topics Friday: write “Social capital asset pricing model (SCAPM)” app“Social capital asset pricing model (SCAPM)” app

What does social life look like today? Consultant running meeting on Google Hangouts Real estate agent checking listings Journalist applying for job Student writing paper for class Activist uploading files to Wikileaks Professor grading papers

The job description for 90% of the people at the University of Michigan: “Stare at a screen and

Thanks to ICTs, economics today is “roughly where astronomy was when the telescope was invented or where biology was when the microscope was invented.” (Robert Shiller, certified smart guy) 6

How should the pervasive “mediation” of contemporary social life affect social science?

Google Trends: the gateway drug for big data

New insights into traditional topics

Does racism influence voting?

Does racism influence voting?

New insights into new topics

If only someone would come up with a way to gather horrifyingly intrusive personal information online…

ICTs and social movements

One Facebook post

A deep philosophical point: A web page does not exist until you perceive it.

How big is big data? Visit your favorite website (e.g., www.umich.edu) Right-click and “View page source” Wait, what is all this stuff? Search for http Is there some convenient way to search through all this junk online, copy it, and drop it into a database for future use? (Will the site’s owner get mad?)

19

20

Some big data questions Where do I get “big data”? Is there some secret handshake I need? What does it look like? How do I make gigabytes of words and numbers into something meaningful?

A method and three tools to start The method: learning in groups (cf. “agile software development”) The tools: SQL: how to manipulate those databases underlying what you see on the Web Python: a pretty good open-source programming language

Be Not Afraid: lessons from “computer science” Plagiarized from the estimable Prof. Brian Noble

We’re All Charlatans Computer science: not a science Few “natural laws” because it is a human construction Exception: “This sentence is false.” (1/3 of EECS 376) Software engineering: not an engineering discipline Engineering: static/dynamic modeling, safety margins, etc. Software: “Recovery-oriented computing” (1/5 of EECS 582) A culture of decentralized collaborative tinkering Facebook: likely the most successful company run this way

MOVE FAST AND BREAK THINGS (like democracy) Facebook Rule #1 MOVE FAST AND BREAK THINGS (like democracy)

Don’t be afraid to make a mistake Everyone makes mistakes! I (Brian Noble) make programming mistakes all the time Students who actually do things make mistakes as well Professional staff at Facebook do too (obviously!) Fundamental to the process These are formal languages (vs. natural) Mortals aren’t inherently great at this

Stay focused and keep shipping Facebook Rule #2 Stay focused and keep shipping

Don’t Wait to Find Your Mistakes Build a little, test a little “You keep using that word. I do not think it means what you think it means.” --Inigo Montoya You have an important advantage! CS students believe they are really good at this But, no one is really good at this, just shades of bad

Done is better than perfect Facebook Rule #3 Done is better than perfect

Never Fly Solo Two people per keyboard, always Everyone is bad at this, but in different ways Only one of you needs to see the problem Trade hands-on-keyboard frequently It’s tempting to let one person “do the work” You lose much of the benefit this way Talk about what you are doing as you do it Forces you to reveal hidden assumptions Catch some mistakes even before you make them

Fortune favors the bold Facebook Rule #4 Fortune favors the bold

Practical tips There are no new problems under the sun Check Google Ask your physical neighbors Ask your virtual neighbors Steal, do not invent! Large community with a strong culture of sharing Before writing something, see if someone else has Keep versions of things around: your Lab Notebook Explains how you got there In case you have to “go backwards” In case you accidentally delete tons of work

CS Professor (at another institution)

What would you do if you were not afraid? Facebook Rule #5 What would you do if you were not afraid?

A Few Caveats You can do almost anything, but should you? Intellectual property restrictions on code Terms of Service restrictions on data providers Lots of personally-identifiable information (IRB) Computers allow you to make bigger mistakes more quickly Get a sense for how this work is received elsewhere Check with advisor(s)

The deliverable Find one interesting true thing to say about your group’s topic by one week from Thursday afternoon, and explain how you got there