Operations Assignment: Operations Reliability Agent

We ensure our software is operating such that there is little to no disruption in value delivered


Expectations / Description

The One Number we are accountable for

  • Mean time to awareness
    • How long it takes to recover from a severity-2+ event (app going down, major functionality having issues, major bug released into production)

this definition and outcome need some work

Supporting / leading indicators we pay attention to

  • Mean time to restore (this is actually a lagging indicator... but the better we make ourselves aware, the more urgent things will be fixed appropriately)
  • # of issues we notice and act on Before they are submitted by clients/partners vs issues we act on after they inform us
  • Squad Performance metrics feedback loop health

Links to dashboard that showcase these measures... coming soon

Things you might deliver/do

  • Logging certain events in New Relic so that we can monitor their health
  • Having the data to know if things are going well or off the rails
    • Knowing success rate on builds

Examples

  • Make sure we are handling AirBrakes
    • We know if the integrations goes down
    • EEs know the process and following
    • They are fixed (folks know what to do)
  • not that things are fixed, but we know about it and know the impact
  • ensure nightly processes actually run
  • Making sure the value we have promised to deliver consistently

Requirements

Configuration Health

  • ✅ Has 3 Abilities
  • ✅ Is a part of 1 Position
  • ⚠️ Has been referenced in no observations
  • ℹ️ No one has reacted to this Assignment
  • ℹ️ Fewer than five people (1) have an official rating on this Assignment. To ensure anonymity, analysis will only appear after at least five people have ratings.
  • ⛔️ Last updated: over 3 years ago
  • ℹ️ Last conversed about: over 3 years ago

Examples / Observations

An observation relating to  Operations Reliability Agent  has not been publicly recognized yet.

Official Operations Reliability Agents

Manager Details:
This section is for CareerPlug folks only. Sign your team up to find your Gruuv!

Teams needing an Operations Reliability Agent

This section is for CareerPlug folks only. Sign your team up to find your Gruuv!

Positions that reference being an Operations Reliability Agent

This section is for CareerPlug folks only. Sign your team up to find your Gruuv!

Conversations about Operations Reliability Agent

This section is for CareerPlug folks only. Sign your team up to find your Gruuv!

Embed code

<iframe src="http://ourgruuv.com/our/roles/10169?embed=true&name=operations_reliability_agent&organization=careerplug"></iframe>