TNPM v1.3 Flow Control. 2 High Level Instead of each component having flow control settings that govern only its directory, we now have a set of flow.

Slides:



Advertisements
Similar presentations
Networking Essentials Lab 3 & 4 Review. If you have configured an event log retention setting to Do Not Overwrite Events (Clear Log Manually), what happens.
Advertisements

FILEMAKER SERVER SOFTWARE & REMOTE ADMINISTRATION
Lectures on File Management
Unauthorized Reproduction Prohibited SkyPoint Alarm Integration Add-On Using OnGuard Alarms to create events in SkyPoint Also called ‘SkyPoint V0’ CR4400.
Chapter 12 - Backup and Disaster Recovery1 Ch. 12 – Backups and Disaster Recovery MIS 431 – Created Spring 2006.
Managing Your Mailbox Facilities IS Presents:. Is your mailbox getting too big? Managing Your Mailbox An overstuffed mailbox can cause problems. You won’t.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 7: Advanced File System Management.
Hands-On Microsoft Windows Server 2003 Networking Chapter 7 Windows Internet Naming Service.
Hands-On Microsoft Windows Server 2003 Administration Chapter 6 Managing Printers, Publishing, Auditing, and Desk Resources.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 7: Advanced File System Management.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
Backup and Recovery Part 1.
Check Disk. Disk Defragmenter Using Disk Defragmenter Effectively Run Disk Defragmenter when the computer will receive the least usage. Educate users.
Chapter 5 Configuring the RMAN Environment. Objectives Show command to see existing settings Configure command to change settings Backing up the controlfile.
Agenda  Overview  Configuring the database for basic Backup and Recovery  Backing up your database  Restore and Recovery Operations  Managing your.
1 Chapter Overview Creating User and Computer Objects Maintaining User Accounts Creating User Profiles.
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
NovaBACKUP 10 xSP Technical Training By: Nathan Fouarge
Module 8: Designing Active Directory Disaster Recovery in Windows Server 2008.
CSC103: Introduction to Computer and Programming
Basic Concepts of Computer Networks
Troubleshooting Guide for Network Hard Disk. Model - NH-200.
Batch Import/Export/Restore/Archive
Course 6425A Module 9: Implementing an Active Directory Domain Services Maintenance Plan Presentation: 55 minutes Lab: 75 minutes This module helps students.
Test Review. What is the main advantage to using shadow copies?
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 7: Advanced File System Management.
Love V4.0 a GREAT freeware program available as app Love V4.0 a GREAT freeware program available as app.
Lesson 7-Creating and Changing Directories. Overview Using directories to create order. Managing files in directories. Using pathnames to manage files.
Copyright ®xSpring Pte Ltd, All rights reserved Versions DateVersionDescriptionAuthor May First version. Modified from Enterprise edition.NBL.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 7: Advanced File System Management.
V 0.1Slide 1 Security – System Configuration How to configure WebSAMS? Access Control Other Information Configuration  system customization  system configuration.
Chapter 7 Making Backups with RMAN. Objectives Explain backup sets and image copies RMAN Backup modes’ Types of files backed up Backup destinations Specifying.
Managing User Accounts. Module 2 – Creating and Managing Users ♦ Overview ► One should log into a Linux system with a valid user name and password granted.
Problem Determination Your mind is your most important tool!
Sliding Window. Sliding window - Sender side Cumulative Acknowledgments Not sentSent, no ACKACK:edFree Sending buffer at the sender: Old data sent that.
Linux Operations and Administration
BZUPAGES.COM Disk Quotas on Microsoft Windows Server 2003 BS-IT 6 th Proudly Presents.
FTP Server and FTP Commands By Nanda Ganesan, Ph.D. © Nanda Ganesan, All Rights Reserved.
Installation Overview Lab#2 1Hanin Abdulrahman. Installing Ubuntu Linux is the process of copying operating system files from a CD, DVD, or USB flash.
1 Interface Two most common types of interfaces –SCSI: Small Computer Systems Interface (servers and high-performance desktops) –IDE/ATA: Integrated Drive.
Chapter 4. INTERNAL REPRESENTATION OF FILES
Computer Emergency Notification System (CENS)
Module 7: Resolving NetBIOS Names by Using Windows Internet Name Service (WINS)
CS 149: Operating Systems April 9 Class Meeting Department of Computer Science San Jose State University Spring 2015 Instructor: Ron Mak
Introduction to AFS IMSA Intersession 2003 AFS Servers and Clients Brian Sebby, IMSA ‘96 Copyright 2003 by Brian Sebby, Copies of these.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
This document gives one example of how one might be able to “fix” a meteorological file, if one finds that there may be problems with the file. There are.
1 © 2003, Cisco Systems, Inc. All rights reserved. Proprietary and Confidential Unity Connection 7.0(1) Messaging Redundancy TOI July 2008 Jane Rygg
The Diagnostic Pathfinder System Introduction Getting Started.
File Systems cs550 Operating Systems David Monismith.
SQL SERVER 2008 Installation Guide A Step by Step Guide Prepared by Hassan Tariq.
FTP COMMANDS OBJECTIVES. General overview. Introduction to FTP server. Types of FTP users. FTP commands examples. FTP commands in action (example of use).
Integrity Check As You Well Know, It Is A Violation Of Academic Integrity To Fake The Results On Any.
CSC414 “Introduction to UNIX/ Linux” Lecture 6. Schedule 1. Introduction to Unix/ Linux 2. Kernel Structure and Device Drivers. 3. System and Storage.
Lecture 02 File and File system. Topics Describe the layout of a Linux file system Display and set paths Describe the most important files, including.
1 Chapter Overview Understanding Shared Folders Planning, Sharing, and Connecting to Shared Folders Combining Shared Folder Permissions and NTFS Permissions.
MINIX Presented by: Clinton Morse, Joseph Paetz, Theresa Sullivan, and Angela Volk.
HSC IT Center Training University of Florida Novell GroupWise: Reducing your Mailbox Size Health Science Center IT Center – Training
ALSMS Upgrade Configuration Example Alcatel-Lucent Security Products Configuration Example Series.
Your current Moodle 1.9 Minimum Requirements Ability to do a TEST RUN! Upgrading Moodle to Version 2 By Ramzan Jabbar Doncaster College for the Deaf By.
CACI Proprietary Information | Date 1 PD² v4.2 Increment 2 SR13 and FPDS Engine v3.5 Database Upgrade Name: Semarria Rosemond Title: Systems Analyst, Lead.
Linux Filesystem Administration
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 7: Advanced File System Management.
Is 221: Database Administration
Introduction to Computers
Intuit has launched QuickBooks File Doctor tool (QBFD) in QuickBooks File Doctor is a tool that has been designed to recover the damaged company.
Performing Database Recovery
File System Management
Overview Multimedia: The Role of WINS in the Network Infrastructure
Internal Representation of Files
Presentation transcript:

TNPM v1.3 Flow Control

2 High Level Instead of each component having flow control settings that govern only its directory, we now have a set of flow control settings for each datachannel root directory including all components that live in that directory Components no longer monitor their own space usage. Instead, inside the AMGR there is a Disk Usage Server (DUS) that monitors the space for each datachannel root directory on that host Components ask the DUS if there is enough space to write to disk and stop processing when there is not enough space When the overall space consumed in a datachannel root directory becomes too low, the DUS tells all components that live in that root directory to free up some space (or all available space) Components try harder to not overuse space by only acquiring a few hours of data before processing it and stopping when there are a few hours of data waiting to be picked up

3 High Level Components can still become flow controlled (stopped) because there is not enough space or the quota for the datachannel root directory has been exceeded Components still store old data that is no longer needed in their done directory and delete this data when more space is required

4 Flow Control Overview AMGR DiskUsageServer /dc /dc/CME.1.1 /dc/FTE.1.1 /dc/LDR.1 /dc/DLDR.1 CME.1.1 DiskUsageClient FTE.1.1 DiskUsageClient LDR.1 DiskUsageClient DLDR.1 DiskUsageClient Components ask DUS if they can use more disk space DUS tells components to free disk space when necessary

5 Managing Consumed Space When disk consumption is < 80%  DiskUsageServer will continue to answer yes to space requests When disk consumption is >= 80%  DiskUsageServer will contact all components who reside in this root directory and tell them to free up some space as they see fit. For example, each component may delete only 5 hour directories or only 50 files, etc.  DiskUsageServer will continue to answer yes to space requests

6 Managing Consumed Space When disk consumption is >= 90%  DiskUsageServer will contact all components who reside in this root directory and tell them to free up all space that they can  DiskUsageServer will answer no to space requests which will stop all components (except LDR & DLDR components) in this root directory. LDR and DLDR components are allowed to run because the system cannot unblock itself unless these components run. The LDR and DLDR components are given 9% of the total quota to operate and load data which can unblock the system if there are no errors happening. When disk consumption is >= 99%  DiskUsageServer will answer no to space requests from LDR & DLDR components in this root directory

7 Managing Free Space When free disk space <= FS_LL  DiskUsageServer will contact all components who reside in this root directory and tell them to free up all space that they can  DiskUsageServer will answer no to space requests which will stop all components in this root directory

8 Good Citizen Components try to behave as good citizens by:  Only acquiring and buffering a few hours of data in advance in their do directory (default if 4 hours). Can be configured at the component level by modifying FC_MAX_DO_HOURS  Only producing a few hours of data in their output directory and stopping if this data is not picked up by downstream components (default is 4 hours) Can be configured at the component level by modifying FC_MAX_OUTPUT_HOURS  Honoring their retention interval and only keeping a certain number of hours of data in the done directory even if space is available. This has not changed from the previous release. Can be configured at the component level by modifying FC_RETENTION_HOURS

9 Supported Configurations  Single datachannel root directory  Component directories on the same disk (not mounted or linked) Disk 1 Datachannel Root Datachannel Root FTE.1

10 Supported Configurations  Multiple datachannel root directories (can be on different disks)  Component directories are NOT mounted or linked  Can create a root directory for each channel or for all FTEs or any other organization you choose Disk 1 Datachannel Root 1 Datachannel Root 1 FTE.1.1 Disk 2 Datachannel Root 2 Datachannel Root 2 FTE.2.1

11 New Restrictions Previously if you were running low on disk space you could mount or link a component directory (say CME.1.1) from another file system. This is no longer allowed. Instead of mounting or linking a component directory, you can mount another datachannel root directory and put some components in this new datachannel root directory. This new datachannel root directory must have its own DUS configuration settings.

12 Unsupported Configurations  Datachannel root and component directories are on different disks  To do this they use mounted or linked component directories  This is NOT SUPPORTED and will cause problems Disk 1 Datachannel Root 1 Datachannel Root 1 FTE.1.1 Disk 2 link or mount

13 Example DUS Configuration AMGR.DC1C.DUS.1.FC_FSLL= AMGR.DC1C.DUS.1.FC_QUOTA= AMGR.DC1C.DUS.1.LOCAL_ROOT_DIRECTORY=/opt/datachannel AMGR.DC1C.DUS.1.REMOTE_PASSWORD=CACCDHDBCCCJ AMGR.DC1C.DUS.1.REMOTE_ROOT_DIRECTORY=/opt/datachannel AMGR.DC1C.DUS.1.REMOTE_USERNAME=pvuser AMGR.DC1C.DUS.1.USE_SECURE_FILE_TRANSFER=TRUE AMGR.DC1C.DUS.1.PORT_NUMBER=21

14 DUS Configuration Settings – FC_FSLL is the free space low limit. When the disk has less than this amount of space available (in bytes), components will become flow controlled (stopped) – FC_QUOTA is the amount of space (in bytes) you wish to allocate to the components running in this datachannel root directory. – LOCAL_ROOT_DIRECTORY is the full local path to the datachannel root directory – REMOTE_ROOT_DIRECTORY is the path to the datachannel root directory when accessing this directory via ftp or sftp – REMOTE_USERNAME is the username to use when accessing this datachannel root directory via ftp or sftp – REMOTE_PASSWORD is the password to use when accessing this datachannel root directory via ftp or sftp – USE_SECURE_FILE_TRANSFER allows you to say that you want to use sftp when accessing this datachannel root directory from another host – PORT_NUMBER is the port number to use for ftp or sftp

15 DUS Configuration in Topology Editor

16 Log Messages V1: UTC AMGR.DC1C- 4673:8272 FLOW_CTRL_STATE 1 Dir=/opt/datachannel Actual free space = 416,288,768 Free space low limit = 150,000,000 Actual consumed space = 237,341,696 Space quota = 2,800,000,000 Consumed space calc milliseconds =91 The DUS inside AMGR will log this message so you can see how much space is currently used and available on the filesystem

17 Log Messages UTC DG :2515 FLOW_CTRL_ON 1 Flow control is being asserted – Components will log this message when the system is low on available disk space and the DUS is answering no to components space requests. This means the component is flow controlled (stopped) until more space becomes available UTC DG :2515 FLOW_CTRL_OFF 1 Flow control has been deasserted – Components will log this message when space has become available and they are returning to normal processing. This means the component is no longer flow controlled (stopped) because more space has become available.

18 Log Messages UTC FTE :7706 FLOW_CTRL_PROCESSING_PAUSED GYMDC39209W Processing paused because output at maximum – Components will log this message when there is too much data in the output directory waiting to be acquired by downstream components UTC CME :1784 FOW_CTRL_PROCESSING_UNPAUSED GYMDC39211I Processing unpaused because no longer at max output – Components will log this message when enough output data has been acquired

19 Log Messages UTC AMGR.DCAIX :4888 FLOW_CTRL_PURGE_SOME 1 Notifying components in dir (/opt/proviso/datachannel) to purge some – DUS will log this message when it is telling components to delete some data from their done directory. This is normal and should not cause worry UTC CME :15281 FLOW_CTRL_PURGE_SOME 1 Server requests I purge some – Components will log this message when they are told to delete some data from their done directory. This is normal and should not cause worry.

20 Log Messages UTC AMGR.DC1C-4673:11897 FLOW_CTRL_PURGE_ALL 1 Notifying components in dir (/opt/proviso/datachannel) to purge all – DUS will log this message when it is telling components to delete all data from their done directory UTC CME :5271 FLOW_CTRL_PURGE_ALL 1 Server requests I purge all – Components will log this message when they are told to delete all data from their done directory

21 Log Messages UTC AMGR.DC1C-4673:11897 FLOW_CTRL_QUOTA_FAILURE GYMDCDC10111 Error: Some error. Unable to get disk consumption for dir: /opt/datachannel – DUS will log this message when it encounters an error while running the du command UTC AMGR.DC1C-4673:11897 FLOW_CTRL_FS_FAILURE GYMDCDC10157 Error: Some error. Unable to get free disk space for dir: /opt/datachannel – DUS will log this message when it encounters an error while calculating the amount of free space available on this filesystem

22 Troubleshooting Tips Grep the log for FLOW_CTRL log messages Run the du command manually on the root directory to make sure it works Run the df command manually to see how much free space is available If your system is catching up after some components were stopped it is normal to see components log FLOW_CTRL_PROCESSING_PAUSED and FLOW_CTRL_PROCESSING_UNPAUSED as they rush ahead and downstream components are unable to keep up with the output of new data. BCOL and LDR have FLOW_CTRL_SKIP log messages that describe why BCOL or LDR is skipping the acquisition of data. Usually it is because too much data has already been acquired and buffered. CME logs NOT_ACQUIRING_TUPLES for a number of reasons. It could flow controlled or it could have already acquired and buffered too much data. This could also indicate a problem with CME receiving input from some inputs but not other inputs caused by a down collector or stopped FTE or CME.

23 Troubleshooting Tips The system depends on LDR and DLDR being able to load data into the database and then delete that data from the disk. This means that LDR and DLDR are allowed to run even if other components are stopped because the system is low on disk space. When flow control problems happen, components will back up from right to left (see diagram below). If your LDR is crashing it will eventually cause CME then FTE then UBA to flow control. This means when you notice a problem, start looking at components on the right to see if they are the cause. UBAFTECM E LDRDLDRstart Flow control problems cause backups upstream

24 Upgrade All installations before upgrade should have one datachannel root directory per host Check that there are no linked or mounted component directories under the datachannel root directory. If there are, they need to be reconfigured so that they are local directories under the main root directory or a new mounted root directory The Topology Editor will sum up component quotas and set the default root directory quota to this sum. Check that this sum is not greater than the amount of disk space available

25 Environment Design Guidelines Never link or mount a component directory under a datachannel root directory FC_QUOTA for a root directory should not exceed the amount of actual space available on the filesystem FC_FSLL should be large enough to be useful. Setting this number too low will make it very hard to recover if the system runs out of space. Think of this number as the buffer of space that will be available to recover from running out of space.