DevOps Journey Part 4 – Site Reliability Engineering

DevOps Journey Part 4 – Site Reliability Engineering

We’ve talked about why stuff breaks in the production environment when it didn’t in development (see “Works for me”), how Continuous Improvement / Continuous Deployment helps in failure detection, and how we can set up a “like-production” staging environment in which to test your features.

At this point in the journey, we act as consultants – or Site Reliability Engineers – in the production environment. Remember the various types of pain along the journey, that we help take away? The pain at this stage is the lack of knowing the big picture when it comes to how all the different features are interacting and contributing to the grand vision of what’s trying to be accomplished in your environment. Acting as your Site Reliability Engineer means that we do regular code level investigation for issues through the entire workflow. In addition, we provide proven best-practice consultation on Cloud native software designs.

See this Wikipedia article for a simple high-level summary of the differences between SRE and DevOps.

Categories: DevOps

By Slava Martin

November 5, 2018

Slava Martin
Author: Slava Martin

The Crafty Penguins team provides Linux, DevOps, and SRE tools and services for leading edge technology companies around the world.

PREVIOUS

DevOps Journey Part 5 – Production Infrastructure

NEXT

DevOps Journey Part 3 – Staging Environment