I am wondering why the system can be down, outage, or slow. If we build better codes with DevOps. Nowadays, we can improve cloud operations through a process namely System Operations (SysOps). SysOps provides five main activities namely deploy, monitor, fortify, secure, optimize, and deploy. When doing complex system operations, we should have the baseline. The baseline on the cloud era namely AWS Well-Architected Framework.
Based on the well-architected framework, we can develop a process that helps system operations better through these five main principles.
- Securing the system access through integrated authentication and authorization such as using Azure AD, or Amazon Cognito
- Managing the policy for the organization through automation
- Auditing system resources through System manager and system configuration
- Having deployment and upgrading strategies
- Tagging resources and naming conventions