Tech Operation Operation’s KPI & Tasks Processes Team Architecture
KPI & Tasks Ensure the availability of production services Set up monitoring and incident handling process Build High Availability feature into the production architecture Responsible for deployment and change of production services Set up deployment/change process Set up information database (CMDB) including information of all the items from infrastructure, OS, middleware to application Set up management process of network resources, such as Ips, bandwidth, etc Set up management process of hardware resources, such as servers, CPU, HDDs, etc Audit for unauthorized access or change to the systems Set up Audit process of server access and changes
Deployment/Change Process
Change Ticket System
Monitoring System (4 level monitoring)
Monitoring Process(with incident auto- handling)
Integrated Dashboard
Detailed Instant Values of Server/Application
List of Incident Tickets
Information of Private IP
Information of Public IP
Information of Physical Servers
Information of Network Device
Information of Virtual Machine
Deployment Map
Information of Backup Tasks
Information of Database
Statistics of Availability
Statistics of Availability and incidents
Audit of Access to Servers
Audit of Changes to Network Device
Team ResponsibilitiesMembers System Engineer 1.24x7 standby for incident handing 2.Change & Deployment, Allocation of Resources 3.Routine audit process 4.DBA 4 System Engineer Tool DeveloperDevelopment of operation tools and scripts for operation information database and auto data collection, monitoring platform, ticket system, automation of incident handling and deployment, etc 1 developer IDCInstallation, maintenance of IDC network, server hardware 2 IDC Engineer