We ensure our software is operating such that there is little to no disruption in value delivered
Expectations / Description
The One Number we are accountable for
- Mean time to awareness
- How long it takes to recover from a severity-2+ event (app going down, major functionality having issues, major bug released into production)
this definition and outcome need some work
Supporting / leading indicators we pay attention to
- Mean time to restore (this is actually a lagging indicator... but the better we make ourselves aware, the more urgent things will be fixed appropriately)
- # of issues we notice and act on Before they are submitted by clients/partners vs issues we act on after they inform us
- Squad Performance metrics feedback loop health
Links to dashboard that showcase these measures... coming soon
Things you might deliver/do
- Logging certain events in New Relic so that we can monitor their health
- Having the data to know if things are going well or off the rails
- Knowing success rate on builds
Examples
- Make sure we are handling AirBrakes
- We know if the integrations goes down
- EEs know the process and following
- They are fixed (folks know what to do)
- not that things are fixed, but we know about it and know the impact
- ensure nightly processes actually run
- Making sure the value we have promised to deliver consistently
Requirements
- Operations Reliability Agent's must have a position with the reach of 2.1 or higher
- Operations Reliability Agent's must be milestone 1+,Data analysis / synthesis
- Operations Reliability Agent's must be milestone 2+,Prioritization Strategy (Sense and Respond)
- Operations Reliability Agent's must be milestone 2+,Communication
Configuration Health
- ✅ Has 3 Abilities
- ✅ Is a part of 1 Position
- ⚠️ Has been referenced in no observations
- ℹ️ No one has reacted to this Assignment
- ℹ️ Fewer than five people (1) have an official rating on this Assignment. To ensure anonymity, analysis will only appear after at least five people have ratings.
- ⛔️ Last updated: over 3 years ago
- ℹ️ Last conversed about: over 3 years ago
Examples / Observations
An observation relating to Operations Reliability Agent has not been publicly recognized yet.
Official Operations Reliability Agents
Manager Details:
This section is for CareerPlug folks only. Sign your team up to find your Gruuv!
Teams needing an Operations Reliability Agent
This section is for CareerPlug folks only. Sign your team up to find your Gruuv!
Positions that reference being an Operations Reliability Agent
This section is for CareerPlug folks only. Sign your team up to find your Gruuv!
Conversations about Operations Reliability Agent
This section is for CareerPlug folks only. Sign your team up to find your Gruuv!