From initiatives to impact: making workplace wellbeing actually count

James Haggarty
Global Wellbeing Lead

Executive Summary

  • Activity metrics confirm engagement, but they cannot answer the questions your board asks about absence, retention, and productivity.
  • The measurement gap is structural. Most wellbeing programmes track what they deliver rather than the risk they exist to reduce.
  • Sickness absence data records failure after it happens. It misses presenteeism, hidden strain in high performers, and the variation buried inside aggregate figures.
  • Risk becomes actionable only when you can see it by cohort, sorted by role, tenure, shift pattern, or demographic group.
  • A credible framework starts with a risk baseline, defines the outcome to shift, then measures intervention against that baseline rather than against utilisation.

Why wellbeing programmes generate activity but not answers

Most wellbeing programmes produce a healthy stream of activity data and very little that answers the questions a board actually asks. You can show how many people attended a resilience session, downloaded the app, or called the EAP last quarter. None of those numbers tells you whether absence fell, retention improved, or productivity held steady under pressure.

Activity metrics are not the problem. They confirm that people are engaging, they signal how far a programme reaches across the workforce, and they help the people running it manage delivery week to week. A team that cannot see utilisation cannot manage capacity or spot a service nobody uses. The trouble starts when those metrics are the only layer of measurement in place.

A CFO asking what the wellbeing budget returned does not want a participation figure. They want to know whether the spend changed something measurable about workforce health. Activity data records what happened. It cannot tell you whether what happened mattered, because it was never connected to a workforce outcome in the first place.

The problem is structural, not a failure of effort. Most programmes are built to measure delivery, because delivery is what the programme controls. Without a baseline view of where risk sits before any intervention runs, activity metrics have nothing to be measured against. You count the inputs and hope they correlate with the outcome, rather than knowing they do. The gap senior HR leaders feel is real, and it sits upstream of the metrics they currently report.

The difference between output measurement and outcome measurement

Output measurement counts what your programme produces, such as sessions delivered, app downloads, EAP calls logged, and webinar registrations. Outcome measurement tracks whether the indicators you care about move, such as absence rates, voluntary attrition in a given cohort, self-reported strain over two quarters, and the proportion of employees flagged as high-risk this year versus last. The first tells you the programme ran. The second tells you whether anything changed.

The distinction sounds obvious, yet most measurement frameworks collapse it. A team reports that 40 percent of staff downloaded the wellbeing app and treats that figure as evidence of impact. It is evidence of reach. Reach and effect are different claims, and a board knows the difference. When a CFO asks whether the spend reduced absence, an engagement number does not answer the question. It answers a different one.

Outputs are easy to count and arrive quickly, which is why the confusion persists. You can pull utilisation data the week after a campaign, and a rising number feels like accountability. Outcomes move slowly, depend on factors outside the programme, and often require a baseline that was never set. So teams report the metric they have rather than the one that matters, and the gap between activity and impact stays hidden inside a dashboard that looks busy.

Outcome measurement is harder for a structural reason. You cannot measure a change without first defining the state you are trying to change. Until you know where risk sits and how severe it is, every output number floats free, attached to no result it can be tested against.

Why sickness absence data misleads more than it reveals

Sickness absence tells you where your wellbeing strategy has already failed, not where it is failing. By the time an employee is signed off, the risk that produced the absence has been building for weeks or months. Absence sits at the tail end of a curve that started somewhere upstream, and the data records only the moment the person could no longer come to work. You learn that something broke. You learn nothing about what was bending before it broke.

The bigger blind spot is everyone who keeps showing up. Presenteeism, where people work through poor mental or physical health, never appears in an absence figure, yet it carries a measurable productivity cost. High performers are the most likely to mask strain, holding their output steady until they leave or collapse, and an absence report flags neither warning. The risk concentrates precisely where the numbers look cleanest.

Aggregate absence figures hide the variation that would tell you where to act. A 3 percent rate across the organisation can sit on top of a 9 percent rate in one shift pattern, one team, or one age band. Average the cohorts together and the signal disappears into a number that looks manageable. The question is not how much absence you have. The question is who is carrying it, and why.

What cohort-level risk intelligence changes

Aggregate workforce health data hides the patterns that matter. An organisation-wide stress score of 40% tells you very little, because it averages a finance team running at 15% against a frontline shift cohort running at 70%. Risk only becomes something you can act on when you can see which group carries it, and what is driving it for them specifically. Disaggregation by role, tenure, shift pattern, or demographic group turns a flat number into a map.

Disaggregating the data changes what you measure. Once you know that night-shift staff in their first year report the highest fatigue and the lowest psychological safety, you stop measuring whether your resilience webinar was well attended. You start measuring whether fatigue scores in that cohort fall over the next two quarters. The outcome is defined by the risk, not by the programme.

Cohort-level intelligence also changes what you fund. Most wellbeing budgets spread thinly across the whole workforce, which guarantees that the highest-risk groups get the same generic support as the lowest-risk ones. Cohort-level intelligence lets you concentrate spend where the concentration of risk justifies it, and that concentration is what makes an effect measurable. A targeted intervention against a known baseline produces a signal you can read. A universal intervention against an undifferentiated average produces noise.

Sequencing is the discipline that makes this work. You identify the cohort, establish its baseline, then commit resource against a specific indicator you expect to move. When the review comes, you compare the cohort to its own starting point rather than to a participation rate. A CFO can follow that logic, because it mirrors how every other budget line is justified. Spend went to the highest-risk area, and the risk measurably reduced.

What a risk-first measurement framework looks like in practice

A risk-first framework follows a fixed sequence, and the order matters more than any single step. Start by establishing a risk baseline for each cohort, not an organisation-wide average. Define the cohorts that reflect how work actually varies, by role, tenure, shift pattern, or location. The baseline tells you where strain concentrates before you commit a penny to fixing it.

Next, define the outcome you intend to shift, and make it a workforce health indicator rather than a participation figure. A vague aim like "improve wellbeing" cannot be measured. A specific aim like "reduce stress-related absence in the night-shift cohort over twelve months" can. The outcome you name dictates the data you collect, so name it precisely.

Only then do you select or commission the intervention. When you know that a cohort shows elevated stress indicators and rising short-term absence, you can match the response to the driver instead of buying a programme and hoping it lands. The intervention answers a defined problem, which means its job is clear from the start.

Set a measurement cadence next, and fix it before launch rather than after. Quarterly re-measurement against the baseline shows direction. Annual review confirms whether the change held. The cadence keeps you honest, because it commits you to a comparison you cannot quietly drop if the numbers disappoint.

Finally, review against the baseline, never against utilisation. Attendance and app downloads tell you the programme ran. They cannot tell you whether the night-shift cohort's stress indicators fell.

Consider a warehouse team with absence running well above the company average. The risk-first approach maps that team's drivers first, finds elevated fatigue and musculoskeletal strain, sets a target to cut short-term absence in that team specifically, then re-measures at six and twelve months. If absence drops against the baseline, you have evidence. If it does not, you have learned where the real driver sits, which is also a result worth funding against.

Common objections and how to address them

Three objections surface whenever you propose a more rigorous measurement model, and each has a short, defensible answer.

On data privacy, the concern is that diagnosing risk means surveilling individuals. It does not. Risk intelligence works at cohort level, reporting on groups large enough to protect anonymity. You learn that a shift pattern carries elevated stress indicators, not which person reported what. Properly designed, this approach collects less identifiable data than the absence records you already hold.

On the cost of better diagnostics, the honest answer is that you are already spending. The question is whether that spend buys you anything you can defend. A diagnostic that tells you where risk concentrates lets you redirect existing budget toward the cohorts that need it, rather than funding programmes evenly across a population that does not.

On the claim that soft outcomes cannot be quantified, point to the outcomes that already sit in your reporting. Absence days, turnover rates, and productivity measures are all quantified now. The argument is not that wellbeing is unmeasurable. It is that you have been measuring the wrong layer. Define the outcome you want to shift, set a baseline, and the measurement follows.

Building the case for board-level investment

A CFO already understands that unmanaged risk carries a cost. Workforce health risk works the same way. When stress concentrates in a critical team or attrition climbs in a high-tenure cohort, the cost lands whether or not anyone has measured it. The choice is not between spending and not spending, but between paying for prevention you can see and paying for failure you discover late.

Two framing moves shift the conversation onto the board's terms. First, translate cohort risk into the figures the finance function already tracks. A team showing elevated burnout indicators has a quantifiable replacement cost if those people leave, and a measurable productivity drag if they stay and disengage.

Second, present wellbeing investment the way you would any other risk control. You are not asking the board to fund activity. You are asking it to fund a targeted reduction in a defined cost, measured against a baseline you can name. That reframing turns a soft ask into a financial case the board can scrutinise and approve.

Conclusion

Better outcomes follow better questions, and the most useful question is not how many people used the programme. It is where risk is building, in which teams, and what is driving it. Once you can answer that, every pound you spend has a defined target and a baseline to measure against. The measurement problem eases when you start upstream by identifying risk rather than counting delivery.

If you want to pressure-test your current approach, speak with our team about a Workforce Health Risk Assessment.