SRE

Information contained in the articles on this site may not be representative of actual use cases. The views expressed in the articles are personal views of the author and are not necessarily those of State Farm Mutual Automobile Insurance Company, its subsidiaries and affiliates (collectively “State Farm”). Nothing in the articles should be construed as an endorsement by State Farm of any non-State Farm product or service.

Watching your tail with latency histograms by Luke Goyer

Using percentiles to observe the tail of your service's latency

Apr 29, 2022

This blog post explores:

A Product Team's Journey

Mar 8, 2022

On a frigid night in late February 2012, I received a call for an emergency production issue. One of the critical batch jobs that processes data for the next day’s business had failed. At 9PM, there I was back in the office with 11 other State Farm developers. My then-product manager got us pizzas, coffee, and hot chocolate to cheer up the room, and the team spent the next couple hours pushing an emergency fix to production. Over the past decade, system failure analysis and maintenance…

A Site Reliability Engineer's guide to using actuators effectively

Jul 26, 2021

Want to learn about actuators and which ones may benefit you from a Site Resiliency Engineering (SRE) perspective? This article is for you!

SRE

Watching your tail with latency histograms by Luke Goyer

Using percentiles to observe the tail of your service's latency

Embracing Chaos by Ranjita Sahu

A Product Team's Journey

Actuators in Action by Billy Malone

A Site Reliability Engineer's guide to using actuators effectively