The Agile Online Summit happens Oct 24th-26th. Get your SUPER EARLY BIRD TICKET, limited availability!

Get the SUPER EARLY BIRD ticket now!

BONUS: Navigating the Path to SRE, A Guide to Adopting Site Reliability Engineering in Your Enterprise, with Vlad Ukis and Philipp Gündisch

Siemens Healthineers engaged Philipp and Vlad due to growing challenges with their platform. As more users started using the platform, the availability requirements increased, and they wanted to reduce downtime. However, their current operations capabilities did not allow them to achieve the uptime and availability needed, and there was also a problem with the time it took to recover from failures. To address these challenges, Siemens Healthineers decided to adopt SRE as a solution to improve their operations and increase reliability. The adoption of SRE was added to the list of big initiatives, and Vlad and Philipp worked through the organization to get buy-in and support for the change.

Overcoming Challenges in Adopting Agile, Scrum, SRE: A Journey in Change Management and Leadership

Philipp and Vlad faced several challenges in adopting SRE at Siemens Healthineers. One of the biggest challenges was getting support from those who were skeptical about the change. They also struggled to find the business metrics to justify the change and to update the code in operation. The transition from code-and-forget to code-and-operate was also a significant challenge. This change required a business transition as well, from on-premise to software subscription/SaaS. The company did not previously have the responsibility to operate an online service, but now they had to take on this responsibility. The transition also affected the way the business worked and required the customers to get ready to consume an online service. The team also realized that they needed to operate the services when they delivered them quickly, which had a sales impact from having continuous delivery.

However, Philipp and Vlad were successful in encouraging the teams to find out how good/available their systems already were, and played the educator by explaining the solution. They encouraged the teams to measure their services and helped them feel the pain so they would be motivated to improve their services. However, they also advised not to throw the teams into cold water and leave them there, but instead to provide guidance and support along the way. They linked the teams to common service quality indicators and encouraged them to measure their services, so they could understand the impact of SRE adoption. The idea was to provide support but also to allow the teams to work independently, to encourage creativity and innovation.

Successful Adoption of SRE: The Importance of Metrics, DevOps, and Change Management Leadership

One of the biggest successes in the adoption of SRE at Siemens Healthineers was the Central Tools Team. This team built the necessary tools for the adoption of SRE, and enabled knowledge to be transferred to other teams through the adoption of these tools.

Philipp and Vlad also worked with the teams to come up with meaningful targets that reflected customer behavior, these helped the teams define their SLA and SLO. SLA, or Service Level Agreement typically outlines key performance indicators (KPIs), such as uptime, response time, and availability, and defines the consequences if these KPIs are not met. While SLO stands for Service Level Objective. It is a target performance level that a service provider aims to achieve, usually expressed in terms of key performance indicators (KPIs) such as availability, latency, or error rate.

The Central Tools Tem also provided a standard way to collect alerts and visibility, with the work being done once and scaled to all the teams through dashboards that were also used by Product Owners, further aiding in the adoption of SRE.

The success of the Central Tools Team was enabled through technology, process, and coaching. The team had the knowledge and expertise needed to build the necessary tools, and the coaching sessions helped transfer that knowledge to the rest of the teams. The centralized solution provided by the Central Tools Team for collecting alerts and visibility, made it easier for the teams to adopt SRE. This shows the importance of technology, process, and coaching in the successful adoption of SRE.

Resources for SRE adoption in your organization

In this episode, we refer to many different resources that can help you adopt SRE in your organization. Here’s the list of resources:

About Vlad Ukis and Philipp Gündisch

Vlad is a leader of R&D and reliability lead at Siemens Healthineers. In this capacity, he drives Continuous Delivery, SRE, and DevRel transformation, helping this large distributed development organization evolve architecture, deployment, testing, operations, and culture to implement these new processes at scale.

You can link with Vlad Ukis on LinkedIn.

Philipp studied Computer Science in Erlangen and worked in the Operations Team at “teamplay” in Siemens Healthineers. He was responsible for building up / implementing the SRE Infrastructure based on tools from Microsoft Azure. Together with Vlad they drove the mindset transformation by working with the development teams. Recently, Philipp went back to University where he is in a Doctorate degree program.

You can link with Philipp Gündisch on LinkedIn.

Get The Booklet!
How to deliver on time and eliminate scope creep By scoping projects around outcomes and impacts, not requirements!
Get the Product Owner Booklet!
Avoid scope creep! And learn to scope projects around impacts and outcomes, not requirements!
Get These Valuable Lessons Today!
Down-to-earth, hard-earned Scrum Masters lessons and the Tips from the Trenches e-book table of contents, delivered by email
Enter e-mail to download a clickable PO Cheat Sheet
This handy Coach Your PO cheat-sheet includes questions to help you define the problem, and links to handy, easy techniques to help you coach your Product Owner
Enter e-mail to download a clickable PO Cheat Sheet
This handy Coach Your PO cheat-sheet includes questions to help you define the problem, and links to handy, easy techniques to help you coach your Product Owner
Enter e-mail to download a checklist to help your PO manage their time
This simple checklist and calendar handout, with a coaching article will help you define the minimum enagement your PO must have with the team
Enter e-mail to download a checklist to help your PO manage their time
This simple checklist and calendar handout, with a coaching article will help you define the minimum enagement your PO must have with the team
Internal Conference
Checklist
Internal Conference
Checklist
Download a detailed How-To to help measure success for your team
Motivate your team with the right metrics, and the right way to visualize and track them. Marcus presents a detailed How-To document based on his experience at The Bungsu Hospital
Download a detailed How-To to help measure success for your team
Read about Visualization and TRANSFORM The way your team works
A moving story of how work at the Bungsu Hospital was transformed by a simple tool that you can use to help your team.
Read about Visualization and TRANSFORM The way your team works