Job Details
Location:
Sydney, New South Wales, Australia
Posted:
Sep 16, 2023
Job Description
Working at Atlassian Atlassians can choose where they work – whether in an office, from home, or a combination of the two. That way, Atlassians have more control over supporting their family, personal goals, and other priorities. We can hire people in any country where we have a legal entity. Interviews and onboarding are conducted virtually, a part of being a distributed-first company. With a sufficient timezone overlap with the team, we're able to hire eligible candidates for this role from any location in Australia and New Zealand. If this sparks your interest, apply today and chat with our friendly Recruitment team further. The Reliability Process Group (RPG) comprises of ten fully remote engineers with backgrounds in Software, Process Management and Data Science. We are passionate about uplifting the reliability of all Atlassian products and continue to shift this needle by improving the incident management process and tools used by all of Atlassian. You will report into an Australian-based Senior Engineering Manager and working closely with another Australian-based Process Engineer to help guide the team.
Key Responsibilities:
- Operate, own and evolve the centralised Atlassian incident process which is used by all Engineering teams at Atlassian
- Collaborate with teams and senior partners to ensure incident response, resolution, and post-incident review processes are followed
- Conduct root cause analysis and post-mortems for major incidents, working with technical teams and executive partners to lead systemic improvements.
- Collaborate with the Incident Management automation team to refine and automate incident/PIR workflows.
- Deliver and enhance our training programs on Major Incident Management and Post Incident Reviews
Required Experience:
- At least two years as experience with Incident Management within a cloud environment at scale
- Experience in root cause analysis and post-mortems
- Experience delivering training to large groups
- Experience driving process improvements across areas of an organisation
Preferred Skills:
- Experience in incident wargaming / simulation
- Experience in preparing high quality video training
- Understanding of operating systems, networking, code and databases (being technical always helps in incident resolution)
- Operations background (eg NOC, sysadmin)
More about our team Reliability Process Group is part of our Production Engineering organisation; a specialist team that builds and operates highly scalable & reliable platform services.
Our perks & benefits Atlassian offers a variety of perks and benefits to support you, your family and to help you engage with your local community. Our offerings include health coverage, paid volunteer days, wellness resources, and so much more. Visit
go.atlassian.com/perksandbenefits to learn more.
About Atlassian At Atlassian, we're motivated by a common goal: to unleash the potential of every team. Our software products help teams all over the planet and our solutions are designed for all types of work. Team collaboration through our tools makes what may be impossible alone, possible together. We believe that the unique contributions of all Atlassians create our success. To ensure that our products and culture continue to incorporate everyone's perspectives and experience, we never discriminate based on race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. All your information will be kept confidential according to EEO guidelines. To provide you the best experience, we can support with accommodations or adjustments at any stage of the recruitment process. Simply inform our Recruitment team during your conversation with them. To learn more about our culture and hiring process, visit
go.atlassian.com/crh . Apply for this job