Operations & Process
Technique asking “why” repeatedly to find root cause.
Managing who can access systems and data.
Using tools to reduce manual work.
Controlling changes to reduce risk.
Written guides, references, and knowledge base.
Process to handle outages or major issues.
Central repository of help articles and internal docs.
Regular schedule of meetings for alignment.
Routine for setting and reviewing OKRs.
Rotation where engineers respond to incidents.
A practical guide for how to execute a repeatable activity.
Review after an incident to learn and prevent recurrence.
A repeatable sequence of steps to achieve an outcome.
Internal process of purchasing tools/services.
Quality Assurance: testing to ensure product works as expected.
Framework defining Responsible, Accountable, Consulted, Informed.
Planning and controlling software releases.
Finding the underlying cause of a problem.
Step-by-step guide for operational tasks and incidents.
Evaluation of security controls and risks.
Service Level Agreements defining uptime and support response.
Standard Operating Procedure: documented step-by-step process.
Internal tools and software used to run operations.
Selecting and managing third-party providers.
Flow of tasks through steps and people.
