Scale Fail or, how I learned to stop worrying and love the downtime.

Slides:



Advertisements
Similar presentations
Advanced Troubleshooting with Debug Diagnostics on IIS 6
Advertisements

Capacity Planning for LAMP Architectures John Allspaw Manager, Operations Flickr.com Web Builder 2.0 Las Vegas.
Advanced Oracle DB tuning Performance can be defined in very different ways (OLTP versus DSS) Specific goals and targets must be set => clear recognition.
Presentation Date Top Down Performance Management with OEM Grid Control Or how I learned to stop worrying and love OEM Grid Control 10/1/2010 John Darrah.
ITIS 3110 Jason Watson. Replication methods o Primary/Backup o Master/Slave o Multi-master Load-balancing methods o DNS Round-Robin o Reverse Proxy.
Network+ Guide to Networks, Fourth Edition
VxWorks Real-Time Kernel Connectivity
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
  Copyright 2003 by SPAN Technologies. Performance Assessments of Internet Systems By Kishore G. Kamath SPAN Technologies Testing solutions for the enterprise.
1 ICS 214B: Transaction Processing and Distributed Data Management Lecture 15: Data Replication with Network Partitions & Transaction Processing Systems.
Server Architecture Models Operating Systems Hebrew University Spring 2004.
Proxy Cache Leonid Romanovsky Olga Fomenko Winter 2003 Instructor: Konstantin Sinyuk.
Windows Azure for scalable compute and storage SQL Azure for relational storage for the cloud AppFabric infrastructure to connect the cloud.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
Performance Engineering and Debugging HPC Applications David Skinner
Google AppEngine. Google App Engine enables you to build and host web apps on the same systems that power Google applications. App Engine offers fast.
CS 4700 / CS 5700 Network Fundamentals Lecture 17.5: Project 5 Hints (Getting a job at Akamai) Revised 3/31/2014.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
I/O Systems ◦ Operating Systems ◦ CS550. Note:  Based on Operating Systems Concepts by Silberschatz, Galvin, and Gagne  Strongly recommended to read.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
Scalability By Alex Huang. Current Status 10k resources managed per management server node Scales out horizontally (must disable stats collector) Real.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
Network+ Guide to Networks, Fourth Edition Chapter 1 An Introduction to Networking.
IDK0040 Võrgurakendused I Building a site: Implementation Deniss Kumlander.
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Ideas to Improve SharePoint Usage 4. What are these 4 Ideas? 1. 7 Steps to check SharePoint Health 2. Avoid common Deployment Mistakes 3. Analyze SharePoint.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
Imperial College - Department of Computing Continuous Performance Testing in Virtual Time Nikos Baltas & Tony Field Department of Computing Imperial College.
ECE200 – Computer Organization Chapter 9 – Multiprocessors.
Twitter Copes with Web2.0rhea The Twits are catching up… Alex Carian ISM 158 May 6, 2010.
XOberon Operating System CLAUDIA MARIN CS 550 Fall 2005.
“Load Testing Early and Often” By Donald Doane Presentation to the Rockville MDCFUG.
Srik Raghavan Principal Lead Program Manager Kevin Cox Principal Program Manager SESSION CODE: DAT206.
Process Architecture Process Architecture - A portion of a program that can run independently of and concurrently with other portions of the program. Some.
1 MSRBot Web Crawler Dennis Fetterly Microsoft Research Silicon Valley Lab © Microsoft Corporation.
© 2002 Global Knowledge Network, Inc. All rights reserved. Windows Server 2003 MCSA and MCSE Upgrade Clustering Servers.
Cloud Computing is a Nebulous Subject Or how I learned to love VDF on Amazon.
NETWORK LOAD BALANCING (NLB) Microsoft Windows Server 2003 By Mohammad Alsawwaf ITEC452 Supervised By: Dr. Lee RADFORD UNIVERSITY.
Code Development for High Performance Servers Topics Multithreaded Servers Event Driven Servers Example - Game Server code (Quake) A parallelization exercise.
Lecture 27 Multiprocessor Scheduling. Last lecture: VMM Two old problems: CPU virtualization and memory virtualization I/O virtualization Today Issues.
(re)-Architecting cloud applications on the windows Azure platform CLAEYS Kurt Technology Solution Professional Microsoft EMEA.
NCBI Grid Presentation. NCBI Grid Structure NetCache NetSchedule Load Balancer (LBSM) Load Balancer (LBSM) Worker Nodes CGI Gateway.
Galaxy in Production Nate Coraor Galaxy Team Penn State University.
Using Correlated Tracing to Diagnose Query Level Performance What’s slowing down my app? Jerome Halmans Senior Software Development Engineer Microsoft.
BIG DATA/ Hadoop Interview Questions.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
No more waiting. Sponsors About me  Database Technology Specialist  MVP  Blogger  Author 3
MySQL HA An overview Kris Buytaert. ● Senior Linux and Open Source ● „Infrastructure Architect“ ● I don't remember when I started.
Performance Tuning Renegade
Understanding and Improving Server Performance
Deploying Web Application
Lecture 18: Scalable Web Services
Reactive Android Development
Affinity Depending on the application and client requirements of your Network Load Balancing cluster, you can be required to select an Affinity setting.
Service Restore Flow Receives/retrives input of list of server involved in the process Flow Performs multi level health check like process status, replication.
Nebula A cloud-based back end for
R-GMA as an example of a generic framework for information exchange
APA-OTS WordPress Multi-Site HTTPS Migration: a Case Study
Software Architecture in Practice
Distributed Systems - Comp 655
Scaling and Performance
SpiraTest/Plan/Team Deployment Considerations
ColdFusion Performance Troubleshooting and Tuning
Network+ Guide to Networks, Fourth Edition
First slide Rest of project 2 due next Friday Today:
Jean Joseph DBA\DEVELOPER
February 11-13, 2019 Raleigh, NC.
SEDA: An Architecture for Well-Conditioned, Scalable Internet Services
MapReduce: Simplified Data Processing on Large Clusters
Client/Server Computing and Web Technologies
Presentation transcript:

Scale Fail or, how I learned to stop worrying and love the downtime

Downtime is Sexy

1. Be Trendy

Resource Monitoring Performance Testing Traffic Monitoring Load Testing Tuning Analysis

Let History Be Your Guide Was database to slow last time? The database is too slow this time. Did the network cards fry last time? They're frying this time. Did Apache choke on memory leaks last time? It's Apache this time. Fixed things by using queues at your last job? Then fix them with queues at this job!

Barn Door Troubleshooting

Parallel Programming NOT SEXY

Don't Worry About It Program- ming SEXY

How to Single-Thread Your Code Locks, Locks, Locks! Ignore scope and memory address space Frequently-updated single-row table Master Control Server process Single Master Queue Blocking threads which wait while checking on a remote data source

The Old Way: Caching DB App Server Web Server Web Server Web Server Connection Caching Pool Data Cache Object Cache HTTP Cache

The New Way: Direct2DB DB Web Server Web Server Web Server HTTP Cache

Easy to Scale For Wimps! Web servers Caches Shared-nothing hosts Simple App Servers

Hard to Scale For Real Developers! Databases Queues Shared Filesystems Web Frameworks

SPoF

UserDB App Server Web Server Web Server Web Server PageDB Single Point of Failure Load Balancer Task Manager Load Balancer

UserDB App Server Web Server Web Server Web Server PageDB Cascading Failure Load Balancer Task Manager Load Balancer

Be Sexy: Go Down!