Introduction to Data Science – INFO 480 – Drexel University’s iSchool Sean P. Goggins, PhD April 30, 2013 Week Five.

Slides:



Advertisements
Similar presentations
Finding a Research Topic Janie Irwin CSE, Penn State with credits to Kathy Yelick, EECS, UC Berkeley.
Advertisements

Team Software Project - Ebnenasir - Spring CS 3141: Team Software Project - Introduction Ali Ebnenasir Department of Computer Science Michigan Technological.
1IT Project Management, Third Edition Chapter 4 Chapter 4: Project Integration Management.
Author Instructions How to upload Single Abstract to the paper management system Single Abstract is a document that describes one presentation that someone.
CS 3500 SE - 1 Software Engineering: It’s Much More Than Programming! Sources: “Software Engineering: A Practitioner’s Approach - Fourth Edition” Pressman,
Starting The Process Chapter 5 Database Design For Mere Mortals.
Computer Science 162 Section 1 CS162 Teaching Staff.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
Qualitative Research Dr. Constance Knapp Information Systems DCS 891A Research Seminar December 15, 2006.
Course Technology Chapter 3: Project Integration Management.
Databases Chapter 11.
Software Development Models: Waterfall and Spiral Sung Hee Park Department of Mathematics and Computer Science Virginia State University August 21, 2012.
CHAPTER 19 Building Software.
1 CS428 Web Engineering Lecture 18 Introduction (PHP - I)
MBAC 611.  We have been using MS Access to query and modify our databases.  MS Access provides a GUI (Graphical User Interface) that hides much of the.
Web Information Retrieval Projects Ida Mele. Rules Students can work in teams (max 3 people) The project must be delivered by the deadline that will be.
Lesson 1 Week01.
Copyright Course Technology 1999
Introduction to Data Science – INFO 480 – Drexel University’s iSchool Sean P. Goggins, PhD April 2, 2013 Week Three.
1 Chapter 4: Project Integration Management. 2 Learning Objectives Describe an overall framework for project integration management as it relates to the.
BIT 115: Introduction To Programming1 Sit in front of a computer Log in –Username: 230class –password: –domain: student Bring up the course web.
SOS EGEE ‘06 GGF Security Auditing Service: Draft Architecture Brian Tierney Dan Gunter Lawrence Berkeley National Laboratory Marty Humphrey University.
Introduction to ArcGIS for Environmental Scientists Module 1 – Data Visualization Chapter 1 – GIS Basics.
CS CS 5150 Software Engineering Lecture 3 Software Processes 2.
Spring 2012 February 29, 2012 Dr. Alma Kadragic. Tonight’s program Invitation from Dr. Swapna Koshy Reports from Day 1 of WAN-IFRA Conference 6:30 pm.
2. Introduction to the Visual Studio.NET IDE. Chapter Outline Overview of the Visual Studio.NET IDE Overview of the Visual Studio.NET IDE Menu Bar and.
Team Skill 2 Understanding User and Stakeholder Needs Requirements Workshop (11)
1 COMPSCI 110 Operating Systems Who - Introductions How - Policies and Administrative Details Why - Objectives and Expectations What - Our Topic: Operating.
Mini-Project on Web Data Analysis DANIEL DEUTCH. Data Management “Data management is the development, execution and supervision of plans, policies, programs.
Science Project Type your project title here Your name Mueller Park Junior High You are welcome to choose any design style for your project!
Odyssey A Reuse Environment based on Domain Models Prepared By: Mahmud Gabareen Eliad Cohen.
Leveraging ArcGIS Online Elevation and Hydrology Services
FALL 2011 TECHNICAL ORIENTATION. Session starts at 11:00 am We’ll be online shortly Speaker test starts about 10:45 To ask questions, use the chat window.
Twitter Analytics: The Sample of the London Olympics Week Two – INFO 480 – Introduction to Data Science.
Technical Orientation Summer Technical Orientation Session starts at 2:00 pm – We’ll be online shortly – Speaker test starts about 1:45 pm To ask.
TECHNICAL ORIENTATION WINTER Technical Orientation Session starts at 2:00 pm We’ll be online shortly Speaker test starts about 1:45 To ask questions,
Computer Systems & Architecture Lesson 4 8. Reconstructing Software Architectures.
The Software Development Process
CSE 436—Software Development Models Ron K. Cytron 16 October 2006.
1 Software Engineering: A Practitioner’s Approach, 7/e Chapter 2 Process: A Generic View Software Engineering: A Practitioner’s Approach, 7/e Chapter 2.
GNOME, KDE and X Windows. The GNOME Project was started in 1997 to produce a free (as in freedom) desktop environment. GNU Network Object Model Environment.
Chapter 12: How Private are Web Interactions?. Why we care? How much of your personal info was released to the Internet each time you view a Web page?
Technical Orientation Summer Technical Orientation  Session starts at 11:00 am  We’ll be online shortly  Speaker test starts about 10:45  To.
CS 4620 Intelligent Systems. What we want to do today Course introductions Make sure you know the schedule for the next three weeks.
(Spring 2015) Instructor: Craig Duckett Lecture 07: Tuesday, April 28, 2015 PHASE 1: Discovery DUE TONIGHT 1.
INFO 344 Web Tools And Development CK Wang University of Washington Spring 2014.
WEEK 6 Introduction to Project Management. Wk 6 Agenda Verify Hybrid Wk 5 Review Wk 5 ◦ Compressing the Schedule ◦ Risk Identification Techniques ◦ Project.
IFET COLLEGE OF ENGINEERING DEPARTMENT OF ECE Web Based Tool Management System Presented by Pravina.R Final year/ECE Guided by Mr.M.Arul Pugazhendhi.
Life Science In Class –Write Up How to write –Experiment »Perform »Data Table –Analysis –Conclusion –Discussion –Review for Chapter 3 Test is.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
CU Student Organizer Trey McAlhany CPSC 482 Mobile Software Development Clemson University April 7, 2015.
For more course tutorials visit
Tableau Desktop & Server
Version Control Systems
Type your project title here Your name Mueller Park Junior High
Groupname [PP|…|FD] Iteration
Selecting the Best BI Tool
Version Control Systems
Mining and Analyzing Data from Open Source Software Repository
System And Application Software
Preparing your Data using Python
Preparing your Data using Python
Systems Analysis and Design in a Changing World, 6th Edition
IS 651: Distributed Systems
Train the trainers session
Chapter 3: Project Integration Management
Lecture 1a- Introduction
Course Introduction Data Visualization & Exploration – COMPSCI 590
Chapter 2: Building a System
Presentation transcript:

Introduction to Data Science – INFO 480 – Drexel University’s iSchool Sean P. Goggins, PhD April 30, 2013 Week Five

What is Data Science?  Storytelling  Database Theory – How you organize your data has a big influence on what you can do with it.  Agile Manifesto – Key thing is iterative development; it’s a technology value system.  Spiral Dynamics – What we view as fact and what we desire emerges from the data presented to us. Credit:

Tonight  Share Software for transformation on GitHub  Share How you approached the assigment with the class (individually)  Ask questions  Make sure you understand everyone’s approach  Help each other – The result not the language or technique used to transform data are what matter  Use network scripts from week one to transform your transformed data (that’s right!) into networks. Groups of 3

Week Five   Software Sharing #1 (Share scripts produced in week 3 using an open source software configuration management tool).   Students will refine and then share their scripts with other students   Included in the assignment is a 500 word explanation of how their script could be improved, optimized and adapted to other data of a similar type.   The “read me” file distributed with the script will explain to another user how to apply the script to the data distributed in assignment one. This will include specific, technical specifications.

Using GitHub for Software Sharing  Creating a GitHub Account  Creating a GitHub Project  Using the GitHub Desktop client  Committing & Syncing  The Pull Request  Sharing Your Software!  For my respository  Create a directory with your name under “student Files”  Put your assignment in there  Create a “pull request”

Discuss Homework  Analysis Questions. Write up a short essay with tables or graphs if needed to describe how you would:  Build a network using the scripts from week1 against the mention connections? Reply-To connections? In this sample data. What transformations are required? How would you filter the data? Use the actual data to ground your thinking. Feel free to actually write or modify the R code samples from the first two weeks to experiment. Some of you will be more comfortable doing this; some will be more comfortable addressing the question conceptually. This is OK.

Individual Presentations Informally by you!

Remembering Networks

Underpants Gnomes With much discourtesy from the US TV Program “South Park”

Underpants Gnomes

Addressing The Underpants Gnome Postulate

Discussion Post Read Response Classification Open Coding Axial Coding Identification of Coordination Events Time proximity Topical proximity Aggregation of Posts by Topic Weighted Network Analysis of Interactions Methodological Approach Weight Connections Based on Time Distance, Grouped By Topic and informed by analysis of time distance between posts. Identify Key Information Brokers 12

Network Transformation Activity

Week Six   Week 6: Sharing Data Preparation Results and Tools   Readings and Assignments Due:   Presentation involves sharing data with other people in a way that is visually insightful. Students will be asked to bring an example of a visualization of data from a website or news organization, and make a short presentation about what makes the visualization insightful.   Data Visualization Example Presentation   Chapters 4-7 of “The Anarchist in the Library: How the Clash Between Freedom and Control is Hacking the Real World and Crashing the System”.