Root cause analysis turns a one-off incident into a permanent fix. Five Whys, fishbone, and FRAM all work for wastewater - here is how to pick and run them.
Root cause analysis turns a one-off incident into a permanent fix. Five Whys, Ishikawa fishbone, and FRAM all work for wastewater — the choice depends on how clean the causal chain is and how far it crosses departmental lines. Done well, RCA closes the loop between an incident and the next FMEA update.
Root cause analysis is a structured walk backward from a known failure to the underlying conditions that allowed it. The principle is that visible symptoms (a tripped pump, a permit exceedance, a flooded basement) are almost never the actual cause. Stop the investigation at the symptom and the same failure recurs within months.
A useful RCA produces three outputs: a causal chain from symptom back to root, a set of corrective actions ordered by where in the chain they intervene, and an update to the existing risk picture — usually one or more rows added or revised in the plant's FMEA worksheet.
RCA is not blame allocation. The fastest way to kill an RCA programme is to use it to discipline an operator. Once the operating crew learns that being honest in the post-incident interview costs them, the analyses get shallow and the same failures keep coming back.
Not every incident justifies a formal RCA. The pragmatic triggers are:
The simplest method, and the right one for most technical failures with a clean causal chain. Ask "why" of the failure, then "why" of the answer, and so on, typically four to six times, until the chain reaches a condition you can actually fix.
A worked example:
The root is not the rag, not the trip, not even the missing part. It is an FMEA classification error that has now been visible for two months. The corrective action is one row change in the FMEA, one min/max update in the parts inventory, and one SOP change for screening cleaning rotation. The pump itself does not need work.
When the causal chain is messy or multi-factor, Five Whys runs out before it reaches anything actionable. The Ishikawa diagram — a fishbone with one major bone per cause category — works better. The standard six categories adapted for water utilities:
The diagram surfaces multiple contributing causes and their interactions. A storm-driven bypass typically has bones in Environment (rainfall intensity), Machine (storage capacity), Method (operator decision rules), and Measurement (rain gauge upstream coverage). The action plan addresses all four; fixing only the Machine bone is the classic incomplete RCA.
Some failures cross departments and reflect organisational drift rather than a specific equipment failure. A pattern of late discharge monitoring reports is rarely a calibration issue; it is usually a chain involving the lab, the planner, the duty manager, and IT. Five Whys produces a fight; the fishbone produces six bones with no clear priority.
FRAM (Functional Resonance Analysis Method) and STAMP (Systems-Theoretic Accident Model and Processes) are designed for this case. Both treat the plant as a network of functions that interact through expected couplings; they look for couplings that are not what the procedures assume. The output is harder to read than a Five Whys, but it produces durable interventions where the simpler methods produce cosmetic ones.
Use FRAM/STAMP for the second or third recurrence of the same systemic issue. They are slow and analyst-intensive; reserve them for problems that have already resisted the simpler methods.
An RCA without follow-through is a meeting. Every analysis should produce three categories of action, each with an owner and a due date entered into the CMMS:
The RCA also feeds back into the FMEA. At minimum, increase the Occurrence score for the failure mode, reduce the Detection score if the existing controls missed it, and recompute the RPN. If the new RPN crosses an action threshold, raise the new work orders the day the RCA closes. This loop is where the long-term CMMS effectiveness numbers actually come from.
The four most common ways an RCA programme drifts off the rails: