Staffing to Percentile vs Mean Forecast

From WFM Labs
Staffing to percentile: choosing your risk tolerance on the forecast distribution.

Staffing to Percentile vs. Mean Forecast refers to the decision framework governing how a workforce planner selects a target staffing level when demand is uncertain. Rather than staffing to the single-point (mean) forecast, planners may elect to staff to a higher percentile of the demand distribution — for example, the 75th (P75) or 90th (P90) percentile — accepting higher expected labor cost in exchange for reduced probability of a service-level failure.[1] The framework integrates Probabilistic Forecasting, queueing theory, and cost optimization, and is foundational to risk-aware capacity planning in contact centers and other service operations. The optimal staffing percentile depends on the asymmetry between understaffing costs (service failures, customer dissatisfaction, agent overload) and overstaffing costs (excess labor expense), the shape of the demand distribution, and the organization's explicit or implicit risk tolerance.[2]

The Case Against Staffing to the Mean

Staffing to the mean (P50) of a demand distribution minimizes expected overstaffing cost but produces a systematic service-level risk. Because contact center service performance is a nonlinear function of traffic intensity (as described by Erlang-C and Erlang-A models), understaffing is not symmetric with overstaffing: a demand realization above the mean produces a service-level degradation that is disproportionately larger than the service-level improvement produced by a demand realization equally below the mean.

This asymmetry means that, in expectation, staffing to the mean produces worse average service-level outcomes than staffing to a percentile above the mean. The magnitude of this effect depends on the coefficient of variation of the demand distribution and the slope of the service-level function at the operating point. In high-utilization environments operating near the knee of the Erlang-C curve, the asymmetry is pronounced; in low-utilization environments, it is negligible. See also Queueing Theory Fundamentals and Power of One.

Demand Uncertainty and the Forecast Distribution

Probabilistic Forecasting methods characterize demand not as a single number but as a distribution. Key parameters include:

  • Point estimate (P50): The median or mean of the forecast distribution. This is the output of most traditional forecasting methods.
  • Prediction intervals: The range within which actual demand is expected to fall with a specified probability (e.g., a 90% prediction interval spans from P5 to P95).
  • Coefficient of variation (CV): The ratio of the standard deviation to the mean of the forecast distribution. Higher CV indicates greater forecast uncertainty and, typically, a larger penalty for staffing to the mean.

Sources of demand uncertainty in contact centers include inherent Poisson arrival variability (irreducible), forecast model error (reducible with better methods), and event-driven demand shocks (partially predictable). Brown et al. (2005) document that call center arrival processes exhibit overdispersion relative to a pure Poisson process, implying that forecast uncertainty is larger than what Poisson assumptions suggest.

Staffing Percentile Selection

Cost-Asymmetry Framework

The optimal staffing percentile P* balances the marginal cost of overstaffing against the marginal cost of understaffing. Under a stylized cost model:

P* = C_under / (C_under + C_over)

where C_under is the per-unit cost of being understaffed (cost of one FTE-interval of missing capacity) and C_over is the per-unit cost of being overstaffed (cost of one FTE-interval of excess capacity). This is the newsvendor solution applied to staffing decisions.

In environments where understaffing costs substantially exceed overstaffing costs — for example, where service-level misses trigger regulatory penalties, revenue loss from abandoned contacts is significant, or agent burnout drives high attrition — the optimal percentile shifts toward P75 or higher. In environments where labor cost pressure dominates and service-level consequences are modest, P50 or slightly above may be appropriate.

Practical Percentile Targets

Percentile Typical Use Case Implied Risk Posture
P50 (mean) Cost-minimization, mature low-variance environments Accepts ~50% probability of under-capacity in any interval
P70–P75 Balanced risk/cost, moderate demand variability Accepts ~25–30% probability of under-capacity
P80–P85 Service-critical operations, regulatory SLAs Accepts ~15–20% probability of under-capacity
P90+ Crisis planning, safety-critical environments, peak seasons Accepts <10% probability of under-capacity

The appropriate percentile is not universal; it varies by channel, queue type, and planning horizon. Long-run headcount decisions (annual hiring plans) appropriately use a lower percentile than intraday or weekly tactical decisions, because long-run plans can be revised as forecasts improve.

Interval vs. Aggregate Application

The staffing percentile decision applies at two distinct levels:

Aggregate (Monthly / Quarterly) Planning

At the aggregate planning level, the demand distribution reflects uncertainty about total monthly or quarterly volume. Staffing to a higher percentile at this level means authorizing more headcount than the mean forecast requires, reducing the risk of entering a quarter understaffed. This is the primary application of cost-of-delay analysis (see Cost-of-Delay in Staffing Decisions).

Interval (Half-Hour / Hour) Staffing

At the interval level, demand variability is driven by both forecast model error and Poisson arrival variability within the interval. Interval Level Staffing Requirements calculations using Erlang-C or Erlang-A already incorporate Poisson variability in the service-level formula. The additional question is whether the offered load input to the Erlang formula should itself be inflated to a higher percentile of the forecast distribution.

Whitt (2006) demonstrates analytically that the optimal interval staffing level depends on forecast uncertainty and the cost asymmetry structure, and that simple mean-staffing can be substantially suboptimal when forecast CV exceeds 10–15%. Practical implementations may use a safety buffer — a fixed or percentage-based addition to mean-forecast staffing — as a tractable approximation to full distributional optimization.

Relationship to Safety Staffing

The concept of safety staffing in queueing theory refers to the practice of staffing above the offered load by a margin proportional to the square root of the offered load, a result that emerges from heavy-traffic approximations. Specifically, in a system with offered load R Erlangs and safety parameter β:

N = R + β√R

where N is the number of agents and β > 0 represents the safety margin. This square-root staffing rule provides a staffing level that maintains quality-of-service as scale increases, and corresponds approximately to staffing to a consistent service percentile across different queue sizes. See Erlang-C and Pooling Theory for the scale-efficiency implications of this result.

Dynamic Percentile Strategies

Mature planning organizations may employ dynamic percentile selection that adjusts the target percentile based on:

  • Forecast age: Newer forecasts have higher uncertainty; older forecasts for the same period have been revised and typically have lower uncertainty. A reasonable strategy targets a higher percentile early in the planning cycle and converges toward P50 as forecast uncertainty decreases.
  • Headcount flexibility: Organizations with high workforce flexibility (large contingent labor pools, effective cross-training programs) can afford to staff to a lower base percentile, activating contingent resources when realized demand exceeds the base plan.
  • Queue type: Real-time queues with Abandonment consequences may target higher percentiles than asynchronous queues (email, cases) where work can accumulate without immediate customer abandonment.

Maturity Model Considerations

At L1–L2 maturity, staffing decisions use single-point forecasts without explicit acknowledgment of forecast uncertainty. The concept of staffing to a percentile is absent; planners staff to the number their forecasting tool produces, with informal judgment-based adjustments.

At L3, organizations compute prediction intervals for key demand forecasts and apply a documented staffing buffer — often a fixed percentage above mean forecast — that implicitly reflects a percentile target.

At L4–L5, organizations explicitly compute the cost-asymmetry ratio and derive theoretically justified staffing percentiles by queue type and planning horizon. Staffing decisions are documented with an explicit risk statement: the probability of under-capacity under the chosen staffing plan. See WFM Labs Maturity Model.

Related Concepts

References

  1. Whitt, W. (2006). Staffing a call center with uncertain demand forecasts. Management Science, 52(10), 1by519–1538.
  2. Brown, L., Gans, N., Mandelbaum, A., Sakov, A., Shen, H., Zeltyn, S., & Zhao, L. (2005). Statistical analysis of a telephone call center: A queueing-science perspective. Journal of the American Statistical Association, 100(469), 36–50.