Foundations of Data Science

Slides:



Advertisements
Similar presentations
Thanks to Microsoft Azure’s Scalability, BA Minds Delivers a Cost-Effective CRM Solution to Small and Medium-Sized Enterprises in Latin America MICROSOFT.
Advertisements

An Introduction to DuraCloud Carissa Smith, Partner Specialist Michele Kimpton, Project Director Bill Branan, Lead Software Developer Andrew Woods, Lead.
How to Use App Inventor in Your Classroom Julie Friesen Piper Middle School Technology 6 th -8 th Grades Edmodo Group Code to Join for More Resources:
Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over the Internet. Cloud is the metaphor for.
GIS and Cloud Computing. Flickr  Upload and manage your photos online  Share your photos with your family and friends  Post your photos everywhere.
SharePoint Services Indiana University Cory P. Retherford May 9 th, 2011.
Bizfss File Sync and Sharing Solution, Built on Microsoft Azure, Allows Businesses to Sync, Share, Back Up Using Their Own Cloud Storage MICROSOFT AZURE.
Datalayer Notebook Allows Data Scientists to Play with Big Data, Build Innovative Models, and Share Results Easily on Microsoft Azure MICROSOFT AZURE ISV.
GOOGLE APP ENGINE By Muktadiur Rahman. Contents  Cloud Computing  What is App Engine  Why App Engine  Development with App Engine  Quote & Pricing.
How to find your textbooks … ©Wellner Design, 2010.
Cloud Computing ILAS project DONE BY:. Table of content INTRODUCTION. ◦ Cloud computing in general ◦ What are the things that worked during the implementation.
Built on the Microsoft Azure Platform, UberCloud Helps Engineers and Software Providers to Offer and Deploy Powerful Cloud Services On Demand MICROSOFT.
Canadian Bioinformatics Workshops
JupyterHub for Data Science Education Eric Fraser – Assistant Dean and Director of IT, College of Engineering.
Importing Quicken Files for Mac to Excel. Intuit introduced Quicken as a personal finance tool. If you also use it for your business purposes then you.
Petr Škoda, Jakub Koza Astronomical Institute Academy of Sciences
How to Get Started With Python
READ ME FIRST Use this template to create your Partner datasheet for Azure Stack Foundation. The intent is that this document can be saved to PDF and provided.
Created with Logic Apps on Azure for Health Care Innovation and Interoperability, Corolar Cloud Is a Smarter and Faster Health Care Integration Engine.
Univa Grid Engine Makes Work Management Automatic and Efficient, Accelerates Deployment of Cloud Services with Power of Microsoft Azure MICROSOFT AZURE.
IBM Predictive Analytics Virtual Users’ Group Meeting March 30, 2016
Deploying Web Application
Chapter 1 The Nature of Software
Accelerate your DevOps with OpenShift by Red Hat
Business Directory REST API
DocFusion 365 Intelligent Template Designer and Document Generation Engine on Azure Enables Your Team to Increase Productivity MICROSOFT AZURE APP BUILDER.
MIRACLE Cloud-based reproducible data analysis and visualization for outputs of agent-based models Xiongbing Jin, Kirsten Robinson, Allen Lee, Gary Polhill,
Netscape Application Server
Tulika Chaudharie / Harikharan Krishnaraju
Metis Data Science Meetup:
Working With Azure Batch AI
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
Trial.iO Makes it Easy to Provision Software Trials, Demos and Training Environments in the Azure Cloud in One Click, Without Any IT Involvement MICROSOFT.
R For The SQL Developer Kevin Feasel Manager, Predictive Analytics
Microsoft SharePoint Server 2016
Nimble Streamer Helps Media Content Providers Create Streaming Networks Cost-Effectively and Easily by Utilizing Azure’s Worldwide Scalability MICROSOFT.
FICEER 2017 Docker as a Solution for Data Confidentiality Issues in Learning Management System.
OpenNebula Offers an Enterprise-Ready, Fully Open Management Solution for Private and Public Clouds – Try It Easily with an Azure Marketplace Sandbox MICROSOFT.
Coding in the Cloud This slide deck includes recorded video demonstrations of content from the live presentation. Joon-Yee.
Easy Way to Export All WordPress URLs in Plain Text Guided By: - WPGLOBALSUPPORTWPGLOBALSUPPORT.
CS190/295 Programming in Python for Life Sciences: Lecture 1
High Performance Data Scientist
Scalable SoftNAS Cloud Protects Customers’ Mission-Critical Data in the Cloud with a Highly Available, Flexible Solution for Microsoft Azure MICROSOFT.
Logsign All-In-One Security Information and Event Management (SIEM) Solution Built on Azure Improves Security & Business Continuity MICROSOFT AZURE APP.
Auth0 Is Identity Made Simple for Developers, Built by Developers and Supported by the High Availability and Performance of Microsoft Azure MICROSOFT AZURE.
Microsoft Azure-Powered Self-Service Insurance Management Tool Gets Products to Market Fast “With Azure, we can provide a genuinely elastic, web-scale.
AllDigital Brevity on Microsoft Azure Cloud Platform Supercharges Media Workloads by Encoding During High-Speed File Transmission MICROSOFT AZURE ISV PROFILE:
MyCloudIT Enables Partners to Drive Their Cloud Profitability Using CSP-Enabled Desktop Hosting Automation with Microsoft Azure and Office 365 MICROSOFT.
Getting Started.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
12/5/ :36 AM © Microsoft Corporation. All rights reserved. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN.
Appcelerator Arrow: Build APIs in Minutes. Connect to Any Data Source
Getting Started.
Media365 Portal by Ctrl365 is Powered by Azure and Enables Easy and Seamless Dissemination of Video for Enhanced B2C and B2B Communication MICROSOFT AZURE.
Jupyter Notebooks in Dyalog APL
Introduction Astra is a free WordPress theme with excellent speed. Astra Theme is a best choice for any WordPress web design project. Anyone can have.
Stephen W Thomas Using BizTalk Server as your Foundation to the Clouds
SharePoint 2019 Overview and Use SPFx Extensions
Emerging technologies-
Programming in JavaScript
Programming in JavaScript
Python Crash Course CSC 576: Data Science.
Azure Container Service
Enol Fernandez & Giuseppe La Rocca EGI Foundation
A DevOps process for deploying R to production
Deploying machine learning models at scale
Presented by: Ava Meredith, Seattle Central College
Containers on Azure Peter Lasne Sr. Software Development Engineer
SQL Server 2019 Bringing Apache Spark to SQL Server
Presentation transcript:

Foundations of Data Science JupyterHub in Data Science Instruction John DeNero, Ryan Lovett, Jeff Anderson-Lee, et al Cloud-Hosted Browser-Based Software as a Service for Data Science Instruction Jupyter Notebooks Foundations of Data Science Other Features Notebooks are delivered to students is via nbinteract, a notebook extension that clones notebooks from a git repository into the students' accounts. Students initiate this process by clicking "INTERACT" buttons inside the Data 8 online textbook at inferentialthinking.com. Data 8 is using okpy for grading, in conjunction with gradescope. We have also developed a handful of other tools: nbserverproxy, a generic web service proxy written to accommodate pycortex's WebGL-based views; nbrsessionproxy, to proxy RStudio Server sessions launched from Jupyter; nbgdrive, to backup notebooks to Google Drive. Foundations of Data Science is an innovative introduction to core concepts of computer programming and statistics using Jupyter notebooks and a custom Python library. The Foundations course is complemented by other “connector courses”, introducing diverse subjects through the lens of data science: CIV ENG 88B • COGSCI 88 • CS 88 • ESPM 88A • ESPM 88B • GEOG 88 • HIST 88 • INFO 88 • L&S 88-5 • Legal Studies 88 • MCB 88 • STAT 88 • STAT 89A • CS/Stat C100 • Stat 140 • Stat 28 “Notebook documents [contain] the inputs and outputs of a interactive session as well as additional text that accompanies the code but is not meant for execution. In this way, notebook files can serve as a complete computational record of a session, interleaving executable code with explanatory text, mathematics, and rich representations of resulting objects. These documents are internally JSON files and are saved with the .ipynb extension. Since JSON is a plain text format, they can be version-controlled and shared with colleagues.” JupyterHub Future Directions Please put in notes here to explain poster so that someone else could summarize what your poster is about Hosting notebooks in JupyterHub eliminates the need to replicate software installation on personal or lab devices. Students only need to visit a website to be granted access to their own pre-built instance of Jupyter. It also reduces the demand on physical lab resources, gives each student the same starting point, provides results that are more reproducible, and more importantly, frees up time for both learning and teaching. Scaling up (and down) to support more students using auto-scaling in Kubernetes. Enable instructors to more easily create custom class hubs. More Google Drive or Box integration for notebook storage and distribution Roster-based account provisioning via okpy and Canvas APIs Your Courses Deployments Are you interested in using Jupyter notebooks for your course, workshop, or intensive? We'd like to hear about it! Please contact: ds-infrastructure@lists.berkeley.edu The pilot course in Fall 2015 had 80 students using Intel-donated hardware. In Spring 2016, 450 students were hosted on Microsoft-donated Azure nodes. These deployments were based on Jessica Hamrick's JupyterHub for COGSCI 131. In Spring 2017 we migrated from Docker Swarm to Kubernetes on Google Compute Engine. The new software stack lets us easily scale, and we quickly deployed three hubs for 900 students. Further Reading http://data.berkeley.edu http://data8.org/ https://www.inferentialthinking.com/ https://github.com/data-8/jupyterhub-k8s