2017 OTN-TOOLBOX Presented by Marta Mihoff and Alex Nunes

Slides:



Advertisements
Similar presentations
CTER Orientation Tutorial The Use of the WebBoard.
Advertisements

Using the SmartPLS Software
Using Basic FormulasUsing Basic Formulas Lesson 4 © 2014, John Wiley & Sons, Inc.Microsoft Official Academic Course, Microsoft Word Microsoft Excel.
HQ Workshop 2014 OTN SandBox Presented by Marta Mihoff OTN Database/Data Process Manager.
Windows XP Basics OVERVIEW Next.
Course Introduction and Getting Started with C 1 USF - COP C for Engineers Summer 2008.
® IBM Software Group © 2006 IBM Corporation The Eclipse Data Perspective and Database Explorer This section describes how to use the Eclipse Data Perspective,
1 Excel Lesson 3 Organizing the Worksheet Microsoft Office 2010 Introductory Pasewark & Pasewark.
Lecture Excel: Macros & Pivot Tables. Macros A macro is a series of commands that are stored and can be run whenever you need to perform the task.
P366: Lecture #1 Use of Excel for analysis Lei Chen, MD Jan 6, 2002.
1 Working with MS SQL Server Textbook Chapter 14.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 Working with MSSQL Server Code:G0-C# Version: 1.0 Author: Pham Trung Hai CTD.
McGraw-Hill/Irwin The Interactive Computing Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Lesson 1 Introduction.
Go to your school’s web locker site school name.schoolweblockers.com) Your user name is the first letter of your first name, the first 4.
PowerTeacher with Web Grade Book Semester Classes School Year August 2012.
Select Reports Console. Type in Progress, Click Search.
Chapter 17 Creating a Database.
OTN Workshop 2015 OTN SandBox Presented by Marta Mihoff OTN Database/Data Process Manager.
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 19 Organizing and Enhancing Worksheets 1 Morrison / Wells / Ruffolo.
MS Excel Lesson 1. Starting Excel Excel opens to a list of templates and in most cases you choose Blank workbook or open a previous file. Think of a workbook.
OTN Workshop 2014 OTN SandBox Presented by Marta Mihoff OTN Database/Data Process Manager.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
2016 OTN-TOOLBOX Presented by Marta Mihoff and Alex Nunes Assisted by Brian Jones, Sean Carey, Sara Colborne, Lenore Bajona.
Reporter Training for High School RIO TM
McGraw-Hill/Irwin The Interactive Computing Series © 2002 The McGraw-Hill Companies, Inc. All rights reserved. Microsoft Excel 2002 Using Macros Lesson.
How to create a website from scratch.  You should have an internet access.  Visit  You need to create a new account OR.
AdisInsight User Guide July 2015
Innotas Reports, Dashboards, and Filters
Note to trainers The videos in this presentation provide closed-caption files, and they’re on by default because it’s often easier for viewers of all.
Windows 7 and file management
Helping Yourself in PD2 SPS Spotlight Series July 2015.
Development Environment
Web-based Information Science Education
TEST PREP SEMINARS 2017 PayPal Buttons Lets Begin!
Excel: Macros & Pivot Tables
CMS I – BASIC WEB EDITING INTRODUCTION TO THE CMS
Release Numbers MATLAB is updated regularly
Setting up Categories, Grading Preferences and Entering Grades
Practical Office 2007 Chapter 10
Focal Point Responsibilities
Lesson 3: Customizing Document Elements
Data-Basics Training & Support
An Introduction to Computers and Visual Basic
Basic Training May 2016.
Microsoft Excel 2003 Illustrated Complete
CCA Skill Certification
EZ Speed with SINAMICS S120 Starter Script File
A Course Management System for Math Teachers
Other Features – Filter Options
Creating and Modifying Queries
Social Media And Global Computing Introduction to Visual Studio
CIS16 Application Development Programming with Visual Basic
SSI Toolbox Status Workbook Overview
Microsoft Excel 101.
Managing Rosters Screener Training Module Module 5
Training course Part 2: Administration tasks
Microsoft Official Academic Course, Access 2016
Microsoft Official Academic Course, Microsoft Word 2013
Reporter Training for High School RIOTM
Innotas Reports, Dashboards, and Filters
Grauer and Barber Series Microsoft Access Chapter One
Running a Java Program using Blue Jay.
Review of Previous Lesson
Lab 8: GUI testing Software Testing LTAT
Unit J: Creating a Database
A brief introduction to the nutrient tool-kit, getting R Studio to work and checking the data Martyn Kelly
Advanced Speed Control with SINAMICS S120 Starter Script File
Create, Upload and Use Data Extensions (Lists)
Presentation transcript:

2017 OTN-TOOLBOX Presented by Marta Mihoff and Alex Nunes Assisted by Brian Jones, Jon Pye, Lenore Bajona More informal. We are going to work today.

Start up Toolbox Open CMD window Navigate to the install folder (Desktop/OTN-toolbox) Execute command “vagrant up” This should remain visible. Maybe with a scribble poster.

URLS New R-Studio (user change to vagrant pw otn123) R notebooks http://localhost:18787/auth-sign-in New R notebooks http://localhost:18888/tree/r_notebooks Python notebooks http://localhost:18888/tree/py_notebooks This should remain visible. Maybe with a scribble poster.

Rstudio Changes Cosmetic only User changed to “vagrant”, password is the same “otn123” Removed the Virtual Machine GUI which none of you will notice File structure: programs are in folder “otn-toolbox” “data” folder accessible from inside “otn-toolbox” or on its own. While we are waiting for everyone to get there toolboxes started I’ll give you a brief overview of what is changed and new

File Structure Home folder otn-toolbox folders Some new “notebooks” folders which you should ignore

Code Exists in folders The code is PUBLIC. You can see the code and change it in any way you want Changing files in these folders could break everything. You can recover by installing a new copy Recommend you change “copies”

DATA Folder The “data” folder exists independently from all the code It is accessible from RStudio or from Desktop/OTN-Toolbox NEVER delete or rename the data folder Copy files into the data folder to make them accessible to programs. In RStudio files should be saved into the “data” folder Folders will be lost or overwritten on an update if not in “data” folder.

R and PY Notebooks New wrappers for same code executed from RStudio GUI May find easier to use r-notebooks offer same set of functions available in Rstudio py-notebooks offer same set plus new functions In future all new functions developed will be done for py-notebooks

New Tools Available in PY-Notebooks only data_subsetting.ipynb Creates a subset of an input file based on a date range or a column value Useful when input file and run time are extremely large and long residence_index.ipynb Offers four methods to choose from. Mix and Match. interactive_residence_index.ipynb same as previous, different map visual_detection_timeline.ipynb Creates an interactive time series from a detection file.

File Preparation OTN detection extracts are ready to go as is. VUE CSV export needs preparation: Latitude and longitude columns must be filled in Rename column receiver  station Rename column transmitter  catalognumber Rename column datetime  datecollected Column unqdetecid can be added with function add_uniquecid Data Subset If your file is very large use the subset tool: py_notebooks/data_subsetting.ipynb The data subset tool and add unique id tools are self explanatory . Stick your hand up if you need help.

Data Sub-setting: Demo http://localhost:18888/notebooks/py_notebooks/data_subsetting.ipynb

Notebook: Execution Current cell is highlighted with a blue or green bar on LHS. When a cell is highlighted clicking the run button will execute the code in the cell.

Exercise: Filter suspect detections (45 min) Copy your detection file into your “data” folder Choose one of the three urls In py-notebooks open load_and_filter_detections.ipynb In r-notebooks open filter_driver.ipynb In RStudio open filter_driver.r Need to do this to get a distance matrix Everyone needs to do the first step of this one, to get a distance matrix

Filter tool What to fill in These are the parameters you need to fill in Filename detection_radius (use 400)

Filter Tool Output Step 1 File of suspect detections File of calculated distances between stations (Distance Matrix) Step2 File of filtered detections (suspects removed) Distance Matrix (adjusted) Look at records 14 to 17 in suspect file Look at distance matrix

Exercise: Interval or Cohort data (15 min) For Interval data (one step) In py-notebooks or r-notebooks open interval_data_driver.ipynb In RStudio open interval_data_driver.r For Cohort Data (two steps) In py-notebooks open detection_compression.ipynb first then cohort_data.ipynb In r-notebooks open compress_driver.ipynb first then cohort_driver.ipynb In RStudio open compress_driver.r then cohort_driver.r Choose which one your data is more appropriate or interesting for you

Interval/ Cohort What to fill in Interval: use outputs from Filter step detection_file <- 'detections.csv' #Detection file input name distance_matrix <-'detections_distance_matrix_v00.csv‘ OR for Cohort Compression: detection_file <- 'detections.csv‘ Cohort (need output from compression step) time_interval <- 6 compressed_file <- 'compressed_detections.csv' File names will appear in messages. Cut them from messages and paste into current look. You will need the .csv suffix

Residence / Visual Timeline http://localhost:18888/notebooks/py_notebooks/residence_index.ipynb http://localhost:18888/notebooks/py_notebooks/visual_detection_timeline.ipynb

Teach yourself to program Free open software Extremely powerful Standardized Python Python(x,y): rival to MATLAB and Rstudio PostgreSQL One of the best things you can do to further your career is teach yourself to program You will be way ahead of your colleages who have not bothered to do this You may think you don’t have time But consider how much time you would spend doing these very simple, common, everyday tasks if we had not provided these programming solutions. With some basic programming skills much more complex questions can be answered In reality you do not have time not to learn to program

How? Coursera and Code Academy Code Academy Python course: http://www.codecademy.com/en/tracks/python Rice University: An Introduction to Interactive Programming in Python Next session Sep 15 https://www.coursera.org/course/interactivepython University of Michigan: Programming for Everybody Next Session Oct 6 https://www.coursera.org/course/pythonlearn Johns Hopkins: R Programming Part of the "Data Science" Specialization Next session Oct 6 https://www.coursera.org/course/rprog There are some wonderful online courses. For these ones no programming experience is required. Coursera is an education platform that partners with top universities and organizations worldwide, to offer courses online for anyone to take, for free. I can attest to the quality of instructors and course sylabus They are first class These ones listed do not require any programming experience

Python solutions for common Science questions. Data Science from Scratch Joel Grus O’Reilly Media Inc 2015