Site Reliability Engineering at Airbnb:
Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of Airbnb's infrastructure and products. SREs design and implement the tools that automate building reliable and performant systems.
What makes Site Reliability Engineering different at Airbnb?
We emphasize building tools over manual processes. We create, not operate. Things should go from repeatable to automated quickly
We're rooted in open source (http://airbnb.io/) and give as much back to the community as possible with both new and contributions to existing projects
Our job is to focus on building reliable infrastructure and tools for our product teams so that they can focus on solving user problems and new features, not reinventing platforms
SREs don't sit on the other side of the tossing fence -- we're a first class engineering citizen and help lead our infrastructure focus
What are some examples of Site Reliability Engineering work at Airbnb?
Work with product engineering teams on design and implementation choices of large scale distributed systems
Automate as much as humanly possible and always configure as code
Bring ideas to life (i.e. production) to help make the lives of engineers better
Predict our future failures and work proactively to mitigate them