4CeeD: Private Cloud and Data Cyber-Infrastructure for Scientific Instruments Steve Konstanty, Senior Research Programmer, CSL.

Slides:



Advertisements
Similar presentations
Legal Meetings: Extended Instructions on Movica and Screencast.
Advertisements

Cloud Computing EDT Cloud Computing Overview Cloud Computing can be defined as a network of applications, services, and infrastructure that are.
Copyright © 2014 Pearson Education, Inc. Publishing as Prentice Hall
File Management Chapter 3
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
1 User-Centered Design at the USPTO: Application to Patent IT Modernization Marti Hearst Chief IT Strategist, USPTO May 23, 2011.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
© InLoox ® InLoox PM Web App product presentation The Online Project Software.
Introduction to Microsoft Office Web Apps with Jim Mollé Learn iT! Computer Software Training.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
For more notes and topics visit:
High-Speed, High Volume Document Storage, Retrieval, and Manipulation with Documentum and Snowbound March 8, 2007.
UNIT 14 Lecturer: Ghadah Aldehim 1 Websites. Introduction 2.
‘ {] Chapter 2 (HW01) Getting Started with Windows 7.
Trimble Connected Community
‘ {] PowerPoint Presentation to Accompany GO! with Windows 7 Getting Started Chapter 2 Getting Started with Windows 7.
Intro to Google Apps B3: Working in Google Drive.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Cloud Computing Characteristics A service provided by large internet-based specialised data centres that offers storage, processing and computer resources.
Chapter 9 Publishing and Maintaining Your Site. 2 Principles of Web Design Chapter 9 Objectives Understand the features of Internet Service Providers.
IPlant Collaborative Hands-on Cyberinfrastructure Workshop – Part 2 R. Walls University of Arizona Biodiversity Information Standards (TDWG) Sep. 29, 2015,
GAAIN Virtual Appliances: Virtual Machine Technology for Scientific Data Analysis Arihant Patawari USC Stevens Neuroimaging and Informatics Institute July.
Sync and Exchange Research Data b2drop.eudat.eu This work is licensed under the Creative Commons CC-BY 4.0 licence B2DROP EUDAT’s Personal.
Afresco Overview Document management and share
Intro to Datazen.
The Online World ONLINE DOCUMENTS. Online documents Online documents (such as text documents, spreadsheets, presentations, graphics and forms) are any.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No B 2 DROP User.
©2012 Check Point Software Technologies Ltd. [PROTECTED] — All rights reserved. Securing Your Data in Endpoint and Mobile Environments Frank Suijten Security.
Ontolica Fusion 4.0 The easy Automation Tool for SharePoint Steen Jakobsen Fusion Principal Architect
Special Education Teachers and Speech Language Pathologist Effective Technology Tools By: Beth Fulks, June 23, 2014.
Unit 3 Virtualization.
11. Looking Ahead.
File Management in the Cloud
Objectives Create a folder in Google Drive.
SECTION 1: Add-ons to PowerPoint
BEST CLOUD COMPUTING PLATFORM Skype : mukesh.k.bansal.
The effort-saving, cost-cutting, low-overhead, cloud capture platform.
The advantages and the disadvantages of working in the cloud.
The importance of being Connected
Information Technology Deanship
Joseph JaJa, Mike Smorul, and Sangchul Song
Anywhere, Anytime, Anyone
Presented by, K.K.Radhika.
Store, Share, Sync and Collaborate
Google Apps for Creative Collaboration
Instructor Name Instructor Title Library Name
John Bordsen Technology Trainer Gail Borden Public Library
Searchable. Secure. Simple.
Collaboration with Google Docs
BrightSign Network Secure, scalable and affordable cloud-based digital sign network service.
File Stream and Team Drives
A Complete Business Productivity Suite
Data Visualization Web Application
InLoox PM Web App product presentation
DIGITAL LIBRARY.
An Introduction to Collaborative Online Documents
Partnering to bring business workloads to Box.
Searchable. Secure. Simple.
Keep Your Digital Media Assets Safe and Save Time by Choosing ImageVault to be Your Digital Asset Management Solution, Hosted in Microsoft Azure Partner.
Media365 Portal by Ctrl365 is Powered by Azure and Enables Easy and Seamless Dissemination of Video for Enhanced B2C and B2B Communication MICROSOFT AZURE.
Information Technology Ms. Abeer Helwa
Enterprise Program Management Office
SharePoint 2010 – SharePoint 101
Office Edition Overview (Dec. 2018).
Mobility Based Last Mile Banking Solution For
4CeeD Demonstration Step-by-step demonstration showing creation, uploading, and sharing of research data Timothy Spila, Ph.D. June 4, 2018.
AI Discovery Template IBM Cloud Architecture Center
COMPANY PROFILE: REELWAY
Contract Management Software 100% Cloud-Based ContraxAware provides you with a deep set of easy to use contract management features.
Presentation transcript:

4CeeD: Private Cloud and Data Cyber-Infrastructure for Scientific Instruments Steve Konstanty, Senior Research Programmer, CSL

Overview What is 4CeeD Storage Facility and 4CeeD Services Why are 4CeeD Services important to MRL/MNTL researchers How does MRL/MNTL researcher work with 4CeeD Services How does MRL/MNTL researcher sign for 4CeeD Services

4CeeD Storage Facility and Services Goals Address Scientific Digital Data Acquisition, Curation and Sharing prior to Scientific Publication of Results via Private Cloud Storage Facility

Scientific Digital Data Acquisition and Workflow Challenges Data come of various types and multimodal formats Each type requires a different data processing workflow Sample output data from SEM microscopy SiO2 Mask Deposition Diffusion Plasma Etching Oxidation Device Characterization Metallization Lithography SiNx Deposition SiNx Removal Profilometry Ellipsometry SIMS SEM Profilometry SEM SEM Profilometry SEM Optical microscopy SPA Optical microscopy What are the main challenges for building an advanced cyberinfrastructure for long-tail scientific data? Long-tail data is often small/medium in size, but heterogeneous & multimodal in formats Analysis on long-tail data often requires real-time response

4CeeD System Architecture Scientific Data Input/Output Scientific Devices User Office Uploader Curator Client LAN/WAN Uploader/Curator Server Task/Resource Coordinator Service Private Cloud Compute/Storage Services github.com/4ceed

4CeeD Private Cloud Compute/Storage Service What we looked for Redundancy Availability Scalability Storage Layer 40 TB (20 TB per investor) Replicated for redundancy Compute Layer Docker container orchestration (Kubernetes) Single master (High Available masters in future) 4CeeD Cloud Storage 4CeeD Cloud Compute

(Simple and Speed-Up Usage at Microscopes) 4CeeD Uploader Service (Simple and Speed-Up Usage at Microscopes) Simple steps, with support for advanced usage 1. Choose or select a collection. 2. Load template and enter user defined metadata to create a dataset. 3. Upload files to cloud coordinator. Accurately and quickly record metadata through use of templates. Users can create, edit, and share templates.

4CeeD Curator Service (Speed-Up Curation) File View Dashboard View Previewer Download, Delete, or Subscribe for updates Hierarchy of file. Add tags. Example of system generated metadata (cropped from list of 100’s) Owner information [Preview, annotate, download, extracted metadata] [Dashboard management]

4CeeD Smart Data Management 4CeeD Data Model organizes projects into collections, datasets, and files. These can then be shared in spaces. 1. Picture of the process used by researchers to collect data from a sample. 2. Example of how the 4ceed data model stores data and metadata, There’s no reason for users to have to convert their data now. The 4CeeD Data Model organizes projects into collections, datasets, and files. These can then be shared in spaces.

Why 4CeeD? Easy to use tool for collecting data and metadata for storage and sharing with research group

Current Situation in MRL Facilities Current situation for experimental data involves manual processes for data capture and storage leading to poor documentation of results Data transfer is often done via “sneaker-net” techniques using flash-drives or email No data file conversion is available “Best” results and images are kept, but what is “best” is determined by a narrow, specific scientific objective. “Imperfect” data is often discarded or not available for others to review.

Current State of Data Capture Fabricate experimental sample Prepare analytical sample Bring sample to instrument for analysis Extract data (File conversion) Transport data to office computer Analyze data Repeat per iteration Extract data (File conversion) Transport data to office computer Limits of home analysis -Only what you have -Connect notes to data (after time delay) -Retroactive not reactive Analyze data ‘Sneakernet’ -Security risk at both ends -Very limited transport space -Lost or forgotten flash drives Metadata Loss -Excessive file name schemes -Manual notes -What metadata is important? Instrument (MRL/MNTL) Flash Drives Office

4CeeD Data Capture Direct web interface Real time Lossless data Fabricate experimental sample Prepare analytical sample Bring sample to instrument for analysis Extract data Transport data to office computer Analyze data Repeat per iteration Extract data (NO FILE CONVERSION) Transport data to office computer (DIRECT) Analyze data (REAL TIME) Direct web interface -Highly secure data storage -Large transport volume -Easy to use/ annotate Real time -Metadata paired with data -Reactive interpretation -Easy interpretation/searching Lossless data -More simple file names -Automatic note taking -All metadata included Instrument (MRL/MNTL) Laptop Collaborators Campus PC Office

Access to 4CeeD in MRL Access to 4CeeD through web-browser (https:/4CeeD.illinois.edu) Extraction of meta-data automatically performed Storage of data on RAID storage cloud maintained by Engr-IT All researchers of MRL Facilities (and their collaborators) will be allowed access to the MRL space

4CeeD advantages One stop location for all of your research data Sharable between you, your advisor, and your research partners Visual interpretation of data without having to open specialty software Cloud based  you can access your data anywhere Provides machine specific meta-data on measurement for future replication of measurements Provides templates which allow for the entry of metadata specific to YOUR project

Extractors for: TEM/SEM/AFM/SIMS/XRAY How does it work? Web based application for uploading, sharing, downloading, and working with data. Extractors for: TEM/SEM/AFM/SIMS/XRAY

Process of signing up, and confirming through email Process of signing up, and confirming through email. Must have an account to share.

Dashboard. File tree view Dashboard. File tree view. Copy, Move, Hover Preview, Delete (multi) coming soon.

View files in dataset. Share, copy, download.

View dataset user metadata and make changes.

View file preview, and system extracted metadata. Apply tags if desired.

Select a collection. Create or use an existing dataset Select a collection. Create or use an existing dataset. Templates enable shareable and reusable recipes to collect user metadata. Drag files in to window.

For archive or complete automation, zip up your files, and we’ll take care of everything for you.

Standard free text search. Faceted filtered search coming.

Create virtual spaces to share out data and assign role based permissions to users.

Advanced Materials Characterization Workshop June 6 & 7, 2017

Partner with Engineering IT for Maintenance 4CeeD private cloud for MRL/MNTL Production cloud with Engineering IT staff maintenance 40 TB cloud for Material Research and Semiconductor Research at UIUC Purchase and Installation Started in Spring 2017 Extensive Testing during Spring/Summer 2017 Educational Material to Use the 4CeeD facility are available on 4CeeD website https://4ceed.github.io/ http://t2c2.csl.illinois.edu/ Workshops to Use the 4CeeD facility will be offered in Summer and Fall 2017 4CeeD Services are available now 4CeeD Services will be in full production mode in August 2017 (beginning of Fall semester) 24 Dev Ops Center, the full power and support of Engineering IT. Upcoming 4CeeD workshops and educations materials. SIGN UP NOW!!!