Reliability
How to Reduce Unplanned Downtime
Written by SteelTree · Last updated June 17, 2026
Unplanned downtime is the time your equipment is stopped when it was supposed to be running, and it is one of the most expensive problems in manufacturing. Siemens puts the cost to the world's 500 largest companies at around $1.4 trillion a year, roughly 11 percent of revenue. Reducing it comes down to three things: knowing which assets matter most, catching problems before they stop the line, and making sure the right person acts in time. Most of the gains come not from collecting more data, but from acting on the data you already have.
What counts as unplanned downtime
Unplanned downtime is any unexpected stop: a breakdown, a jam, a fault that takes a line out when it was scheduled to run. It is the opposite of planned downtime, which you schedule on purpose for preventive maintenance, changeovers, or cleaning. The distinction matters because the two are reduced in completely different ways. You reduce planned downtime by making scheduled work faster and tighter. You reduce unplanned downtime by stopping failures before they happen. This guide is about the second one.
Why unplanned downtime is so expensive
The lost production is only the visible part, and usually the smaller one. Across manufacturing, the average cost of unplanned downtime runs about $260,000 per hour according to Aberdeen, and for tightly coupled operations like automotive it can exceed $2 million per hour. The reason the number climbs so high is everything that rides along with the stop. Reliability research from Siemens finds the hidden costs, the scrap from a hard stop, the overtime to catch up, the missed and late orders, run two to three times the visible production loss. Emergency parts ordered to get a line running again typically cost 30 to 40 percent more than the same parts bought on plan. On some assets there is a safety or compliance exposure on top of all of it. The full cost of an hour down is almost always a multiple of the output you lost, which is why even small reductions pay back quickly.
What causes unplanned downtime
It is tempting to treat downtime as purely an equipment problem, but the data says otherwise. Equipment failure accounts for roughly 42 percent of unplanned downtime, with human error behind another 23 percent according to ABB, and much of the rest comes down to how fast and how well a plant responds. The recurring causes are a short list.
- Reactive maintenance. Running equipment to failure instead of servicing it on time. The failure then comes at the worst possible moment, every time.
- A growing maintenance backlog. Deferred work that was known and never done. Today's maintenance backlog is tomorrow's breakdown.
- No early warning. The signs of a developing failure, vibration, temperature, noise, were there, but nobody was watching the right signal.
- Slow, unclear response. When something does fail, hours get lost just finding the right technician, with the right skills and the right history, for that machine.
- Missing spares. The fix is known, but the part is not on the shelf, so the line stays down while it ships.
- Recurring failures that were never root-caused. The same asset fails the same way, gets patched, and fails again, because nobody fixed the underlying cause.
If the same asset stops twice in a quarter and nobody can say exactly why, that is not only a maintenance problem. It is a data problem.
How to reduce it
There is no single lever. Reducing unplanned downtime is a stack of practices that compound.
- Start with the assets that matter. Not every machine deserves equal attention. Rank your equipment by asset criticality so your effort, spares, and monitoring go where a failure hurts most.
- Move from reactive to preventive maintenance, and measure compliance. Service critical equipment on a schedule before it fails. Aim for PM compliance above 85 percent, the point where unplanned events start to fall off noticeably; below about 70 percent, breakdowns stay common. Tie schedules to actual run-time or machine cycles rather than the calendar, so a machine is serviced on how hard it has worked, not what day it is.
- Add predictive monitoring where it pays. Condition data, vibration, temperature, current draw, can flag a failure before it happens, but sensors are an investment and do not belong on everything. Focus them on the top 10 to 20 percent of assets that drive the most downtime cost, and pair every alert with a clear response, not just a notification.
- Fix the spare parts problem. Identify your high-failure and long-lead parts and stock them deliberately, and automate reordering so a critical part is on the shelf before the technician needs it.
- Speed up the response. Cut the manual steps between a failure and the right person standing at the machine. The faster the right technician arrives with the asset's history in hand, the shorter the stop.
- Track the metrics that tell you where you stand. MTBF and MTTR tell you how often things fail and how fast you recover, and OEE ties downtime to its effect on output. You cannot reduce what you do not measure.
- Watch leading indicators, not just lagging ones. Downtime itself is a lagging indicator: it tells you what already happened. PM compliance and condition trends are leading indicators that let you act before the stop.
- Root-cause your repeat offenders. Review your top recurring failures every month, rank them by the hours they cost, and fix the cause of the worst few rather than patching the same symptom again.
The metrics to track
If you are serious about reducing unplanned downtime, measure a small set of metrics on a regular cadence and watch the trend more than the snapshot.
| Metric | What it tells you | Where to aim |
|---|---|---|
| Unplanned downtime % | Share of available time lost to unexpected stops | Trend down toward 5 to 10 percent |
| MTBF | How often equipment fails | Rising year over year |
| MTTR | How fast you recover when it does | Falling year over year |
| Repeat failure rate | How often the same failure comes back | Down sharply within two quarters |
| PM compliance | Whether scheduled work actually gets done | 85 percent or higher |
| OEE | Downtime's effect on real output | Rising as stops fall |
A backlog creeping up or PM compliance slipping is an early warning that unplanned stops are about to rise, often weeks before they do.
Where to start
If you are starting from reactive, the order that works is this. Rank your assets by criticality first, so you know where to aim. Put preventive maintenance on the critical ones and get compliance up. Add condition monitoring where a failure is expensive and gives warning. Work the backlog down on the assets that matter. And make sure that when something starts to trend, it turns into an assigned action, not a note in a report. Each step lowers the rate of unplanned stops, and they build on each other.
Common myths
A few beliefs get in the way of actually reducing downtime.
- Myth: predictive maintenance fixes everything. Predictive monitoring only works on top of a solid preventive foundation. Bolting sensors onto a plant that skips its basic PMs is one of the most common reasons these projects fail.
- Myth: you just need a maintenance software platform. Software is a tool. The reduction comes from the workflows people actually adopt and the discipline to act, and above all from acting on the data rather than collecting more of it. A dashboard nobody acts on does not move the number.
- Myth: run-to-failure is always cheaper for non-critical assets. It is only cheaper if the asset truly has no downstream effect and can be fixed instantly. Most plants underestimate how far a small stop ripples across the rest of the floor.
- Myth: reducing downtime means buying more sensors. Most preventable downtime was preventable because the signal already existed somewhere in the data. It just never reached the right person as an action in time.
From tracking downtime to preventing it
Knowing what causes unplanned downtime is one thing. Catching it in time, across every asset, shift after shift, is another. The signals are usually already in your data, scattered across your CMMS, your sensors, and your shift logs. The problem is that no one has time to read across all of it and connect a trend to an action before the line goes down.
This is what SteelTree handles. It connects to the systems already holding your operational data, and where that data is scattered or missing it captures what it needs from the work. It watches for the assets trending toward trouble, explains why, recommends the next action, and routes it to the right person before the failure, not after. And because it captures the reasoning behind each decision, it gets sharper at your plant the longer you run it.
See how SteelTree can transform your operational processes →
Frequently asked questions
What is unplanned downtime?
Unplanned downtime is time equipment is stopped when it was supposed to be running, caused by an unexpected failure or stoppage. It is the opposite of planned downtime, which is scheduled on purpose for maintenance, changeovers, or cleaning.
What is the average unplanned downtime in manufacturing?
It varies by sector. Discrete manufacturing commonly runs in the 10 to 20 percent range of available time, while the best-run operations hold it under 5 percent. The average plant loses on the order of 800 hours a year to unplanned stops.
What causes unplanned downtime?
Equipment failure is the largest single cause, about 42 percent, with human error behind roughly another 23 percent. The rest traces to reactive maintenance, a growing backlog, missing spares, slow response when a failure happens, and recurring failures that were never root-caused.
How do you reduce unplanned downtime?
Focus maintenance on your most critical assets first, move from reactive to preventive maintenance with PM compliance above 85 percent, add predictive monitoring on the top 10 to 20 percent of assets by downtime cost, track MTBF and MTTR, work down the backlog, and make sure the right person acts on a developing issue in time.
How much does unplanned downtime cost?
Across manufacturing the average is about $260,000 per hour, climbing past $2 million for automotive, and the figure has risen roughly 50 percent since 2019. For your plant the real number is a multiple of lost production once scrap, overtime, expedited parts, and missed orders are counted.
What PM compliance should I target?
Above 85 percent is the common benchmark, the level where unplanned events start to drop noticeably. Below about 70 percent, breakdowns stay frequent and disruptive.
Is predictive maintenance worth it for every asset?
No. It is generally justified for the top 10 to 20 percent of assets by downtime cost. For the rest, a well-run preventive maintenance program is more cost-effective.
What is the difference between planned and unplanned downtime?
Planned downtime is scheduled in advance for PMs, changeovers, or cleaning. Unplanned downtime is unexpected, from failures and stoppages. They are reduced in different ways: planned downtime by making scheduled work tighter, unplanned downtime by preventing failures before they happen.
Related resources
Turn operational data into decisions
SteelTree connects to the systems already holding your operational data, surfaces what needs attention, explains why it matters, and recommends the next action.