PROFESSIONAL-CLOUD-DEVOPS-ENGINEER · Question #120
PROFESSIONAL-CLOUD-DEVOPS-ENGINEER Question #120: Real Exam Question with Answer & Explanation
The correct answer is B: Develop a postmortem that includes the root causes, resolution, lessons learned, and a prioritized. In SRE culture, the correct response to an outage is a blameless postmortem. Answer B is correct because it includes root causes, resolution, lessons learned, and a prioritized action plan with assignees and due dates - all required elements for stakeholders to action remediation
Question
Your organization wants to implement Site Reliability Engineering (SRE) culture and principles. Recently, a service that you support had a limited outage. A manager on another team asks you to provide a formal explanation of what happened so they can action remediations. What should you do?
Options
- ADevelop a postmortem that includes the root causes, resolution, lessons learned, and a prioritized
- BDevelop a postmortem that includes the root causes, resolution, lessons learned, and a prioritized
- CDevelop a postmortem that includes the root causes, resolution, lessons learned, the list of
- DDevelop a postmortem that includes the root causes, resolution, lessons learned, the list of
Explanation
In SRE culture, the correct response to an outage is a blameless postmortem. Answer B is correct because it includes root causes, resolution, lessons learned, and a prioritized action plan with assignees and due dates - all required elements for stakeholders to action remediations. Answers A and B appear identical in the truncated text, but B is the complete version that includes owner assignments and timelines. Answers C and D are wrong because they likely include a 'list of people responsible,' which violates the blameless postmortem principle central to SRE - blame discourages transparency and hinders learning from incidents.
Topics
Community Discussion
No community discussion yet for this question.