A Practical Guide to Redundancy in Legacy PLC Systems

The Ticking Clock in Your Control Cabinet

Picture this: an important line stops working in the middle of a big production run. Alarms go off, workers rush around, and they find the problem: one old part of the Programmable Logic Controller (PLC) has broken. The search for a replacement begins, but the original company no longer makes or supports the part. There is no more production. If you work in a place that still uses old control systems, it's not a matter of if something like that will happen, but when. The results can be very bad.

It may seem like a normal problem to have unplanned downtime, but it costs a lot of money. Every year, industrial manufacturers have to deal with about 800 hours of equipment downtime. This costs the industry up to $50 billion a year. These unplanned stops cost the world's biggest companies a shocking 11% of their annual income. This comes to almost $1.4 trillion. The price per hour can get very high. A shocking 93% of businesses say that downtime costs them more than $300,000 per hour, and almost a quarter of them say that lost production costs them more than $5 million per hour. In the fast-paced car industry, a line stoppage costs an incredible $2.3 million per hour.

These numbers are not just numbers; they mean the business has lost money. About 53% of the damage is lost income. The effect also includes lost productivity (47%) and damage to the brand's reputation and customer trust that lasts for a long time (41%). A single average downtime event can cost the company $2 million in lost profits.

It's not easy to figure out how much money a legacy system could cost you. It makes things bigger. When an old part breaks, the already high hourly cost of downtime goes up even more. It can take a long time to find a replacement for an old part. You might have to search the whole world. This can make a repair that should only take a few hours take days or even weeks. That delay makes the financial damage worse. It makes a big problem into one that could put the business out of business. A "find-a-part-when-it-breaks" plan is a big risk that many businesses take, as shown by the fact that there is a special global market for parts that are no longer made.

So, redundancy isn't just a technical thing or a cost for IT. This is a basic plan for keeping your business running and protecting your money. It's about getting rid of single points of failure carefully so that you don't have to pay the huge and measurable costs of a shutdown. The goal is to go from taking emergency action to planned strength, which will turn a weak system into a strong one.

What Does "Redundancy" Actually Mean for Your Plant Floor?

Basically, redundancy is a simple idea: adding extra, backup parts or systems that can take over if something fails. The main goal is to reduce or completely stop downtime and data loss. This removes any single point of failure that could stop a whole process. For PLC systems, there is an important choice to make to get that strength: the choice between hot and cold standby systems.

The Critical Fork in the Road: Hot vs. Cold Standby

A Hot Standby system is the best choice for very important operations. In this setup, a backup system is powered on, running, and always synced with the main system in real-time. Data is always shared over a special high-speed link, often fiber optic, so the backup is always ready to take over right away and automatically if the main system fails. The goal is a "bumpless" transfer, a smooth switchover so fast that the process itself is never stopped. For continuous manufacturing, power generation, or any process where even a few milliseconds of downtime is not okay, hot standby is the best choice.

A Cold Standby system works the opposite way. Here, the backup system is powered down or not active until it's needed. When the main system fails, a cold standby needs a longer time to start up and often needs someone to manually bring the backup online. It has a lower initial cost, but that lower cost comes with a lot of downtime during a failover. A solution like that is only good for non-critical processes where a temporary stop is okay.

A third option, Warm Standby, is in the middle. The backup system is powered on but is only updated sometimes, not all the time. The failover is faster than a cold standby, but it still has a short stop or "bump" as the backup system's data catches up to the live process state.

The lower initial cost of a cold standby system can be attractive but wrong. Those upfront savings are often paid back, with a lot of interest, during the first big shutdown. A hot standby system would have stopped that long period of lost production. This period is like a "downtime tax" that can easily be much bigger than the initial cost difference. A good analysis must look at more than the initial cost and figure out the cost of a single failure. This often shows the hot standby option to be the better financial choice over the system's life.

Also, a hot standby system does more than just stop failures; it really changes how a plant can operate. It allows for "online maintenance." In online maintenance, the main system can be taken offline on purpose for service, upgrades, or testing while the fully synced standby system keeps running the process. That ability changes maintenance from a disruptive, planned-downtime event into a normal, non-disruptive activity. This improves not just uptime but also overall operational flexibility.

Redundancy Strategies at a Glance: Hot vs. Cold Standby

Feature	Hot Standby	Cold Standby
Takeover Speed	Instant, automatic failover	Delayed, often needs manual help
Downtime Impact	"Bumpless" - very little to zero process stop	Big process stop and downtime
Data Synchronization	Continuous, real-time	None until turned on
Initial Cost	Higher	Lower
Operational Complexity	Higher (ongoing maintenance of active systems)	Lower (backup system is not active)
Best For	Critical processes (e.g., continuous manufacturing, power generation)	Non-critical processes where downtime is acceptable

The Four Pillars of a Resilient Control System

To get true operational strength, you need a complete approach. A redundant CPU is useless if a single power supply failure can shut down the whole rack. A strong system is built on four connected pillars of redundancy. Each one deals with a critical potential point of failure. The strength of the whole system is decided by its weakest, non-redundant link.

Pillar 1: The Brains - CPU Redundancy

This is the most common type of redundancy. It uses two CPUs—a primary and a standby—that run at the same time and share the same I/O system. The scan clocks of the two processors are synced, and the standby unit always checks the primary's health. It gets state table updates at the end of each logic scan. The main goal here is a "bumpless" transfer of control if the primary CPU fails, with no stop to the ongoing process. A smooth switchover needs high-speed data sync between the CPUs. This is usually done over a special communication link that is separate from the main control network.

Pillar 2: The Powerhouse - Power Supply Redundancy

A power failure is a much more common reason for control system downtime than a CPU failure, but it's often forgotten. Power supply redundancy uses two or more power supplies connected together to the PLC rack. If one unit fails, the other smoothly takes over and handles the full electrical load without stopping. This is usually managed with special redundancy modules or diodes. These stop a failed, short-circuited supply from sending power back and damaging the rest of the system. The most common setups are 1+1 (one backup for one primary) or N+1 (one backup for a group of N primary supplies). For the best strength, this plan should go beyond the rack to include Uninterruptible Power Supplies (UPS). These get power from separate electrical circuits to protect against bigger grid problems or outages.

Pillar 3: The Senses - I/O Redundancy

The Input/Output (I/O) modules are the system's connection to the real world of sensors, valves, motors, and actuators. A failure of a single I/O module can be just as bad as a CPU failure, and it could shut down an important part of the process. I/O redundancy means having copies of these modules. If one fails, a backup immediately takes over the checking and control of the connected field devices. Plans can go from having duplicate I/O cards in the same rack to putting them in completely separate racks. For very important inputs, like a tank level transmitter, a strong design might use two separate sensors based on different technologies. Each one would be wired to a different I/O card to protect against failures that affect both.

Pillar 4: The Nervous System - Network Redundancy

Control systems have become more spread out. Because of this, the communication network has become a critical potential point of failure. Network redundancy gives backup data paths to keep connection if a cable is cut or a network switch fails. The choice of network topology—the physical layout of the network—is a basic design choice that shows an organization's comfort with risk.

Ring Topology: Devices are connected in a circle. A single break in the ring does not stop communication, because data can just travel the other way around the loop. It's a cheap solution that offers very fast recovery times. It is often managed with protocols like MRP (Media Redundancy Protocol) or DLR (Device Level Ring).
Star Topology: All devices connect to a central switch. This is simple, but a failure of that central switch can take down the whole network. A redundant star topology reduces that risk with duplicate central switches and links.
Mesh Topology: This setup provides multiple backup paths between devices, and it offers the highest possible level of fault tolerance. But, it is also the most complex and expensive to set up. It is usually only used for the most critical applications.

Protocols like the Spanning Tree Protocol (STP) and its faster version, Rapid Spanning Tree Protocol (RSTP), are used to manage these backup paths. They automatically turn off backup links to stop data loops that could crash the network. They turn them back on if a primary link fails.

The Legacy Challenge: Why Adding Resiliency Isn't Simple

Putting these modern redundancy ideas on old equipment is where the real challenge starts. Legacy systems were often designed for a different time, long before today's standards for high availability and cybersecurity were created. Trying to add strength to these older platforms brings a unique set of problems.

Obstacle 1: The Scarcity of Parts and Knowledge

The very systems that need redundancy the most because of their age and higher chance of failure are, strangely, the hardest to upgrade. As manufacturers stop making product lines, finding the exact-match spare parts needed to build a redundant system becomes a big problem. Facilities are often forced to use a risky and expensive aftermarket of used or rebuilt parts. These parts are not as reliable as new parts and have their own risks. At the same time, the engineers and technicians who first designed and maintained these systems are retiring. They are taking decades of unwritten knowledge with them. That knowledge loss leaves facilities without the people who can effectively fix, change, or correctly add new parts to the old system.

Obstacle 2: The Integration Puzzle - Hardware and Software Incompatibility

You can't always just plug a modern module into an old backplane. Trying to add modern redundancy or communication cards to a legacy PLC rack can be physically impossible because of different connectors, power needs, or communication protocols. The software is an even bigger challenge. Legacy PLC programming software often runs on old and insecure operating systems. The firmware of an old CPU may not have the key features needed to sync with a partner for a hot standby setup. The control logic itself, after years of fixes and changes from different programmers, can become a messy and poorly documented puzzle that is very hard to change safely.

Obstacle 3: The High Cost and Hidden Risks

The small supply of legacy parts naturally makes their price go up on the surplus market. Also, ongoing maintenance and support costs for older platforms are much higher than for modern systems. Besides the direct costs, there are hidden dangers. Legacy systems are big targets for cyber-attacks because they have many known weaknesses and no longer get security updates from the manufacturer. The act of adding new network connections for redundancy can accidentally expose a previously separate and weak system to many new and dangerous threats.

Your Step-by-Step Plan for a Redundancy Upgrade

Handling the difficulties of a legacy redundancy project needs a structured, practical approach. A successful upgrade is not just about buying new hardware; it's a careful process of checking, designing, implementing, and thorough testing.

Step 1: The System Autopsy - Conduct a Thorough Risk Assessment

Before any part is ordered, a deep analysis of the current system is needed to find the weakest links and understand the business effect of their failure. The process starts with a full list of all control system parts: PLC models and firmware versions, I/O modules, power supplies, network switches, and software. From there, find which of these parts are absolutely needed for operation. Which failure would cause the biggest problem? Each critical part should then be checked against key legacy signs: Is it out of vendor support? Are spare parts easy to find? Are there known security weaknesses? The result of this check is a prioritized list of risks. This lets you focus resources where they will have the biggest effect. That check also becomes a powerful financial planning tool. When you measure the potential losses from a specific part failure—for example, a shutdown of a line that makes $500,000 in income per day—you are building the ROI calculation needed to get budget approval. It changes the project from a proactive investment in risk reduction to a clear choice between that and accepting a measurable risk of a huge loss.

Step 2: Blueprint for Resilience - Designing Your Redundancy Strategy

With a clear understanding of the risks, the next step is to design a solution that matches the right redundancy setup to each prioritized need. A single plan for everything is both not efficient and too costly. For the highest-priority systems, a full hot standby CPU and power supply setup will likely be needed. For less critical I/O, a cold standby approach—like a pre-programmed spare module kept on a shelf—might be a good and cheap solution. The design must be complete and cover all four pillars. It must specify redundant power from separate sources, define the network layout, and plan the I/O grouping strategy to reduce the effect of any single failure. For hot standby systems, the design must also describe the high-speed sync link between CPUs and the way to keep them perfectly matched.

Step 3: The Phased Rollout - Implementation Without Total Shutdown

For a live plant, a "big bang" upgrade that needs a total shutdown is often too risky and disruptive. A phased implementation is almost always the better way. This can mean modernizing the system in stages. You could put in the redundant power and network parts first, and then the final CPU swap at a later time. In some cases, it may be possible to build and test the new redundant system next to the old one. You can run both at the same time before a final, scheduled switch. The key is to use planned maintenance shutdowns to do the biggest parts of the upgrade, like controller replacement or major I/O rewiring. This reduces the effect on production.

Step 4: The Fire Drill - Rigorous Failover Testing

A redundancy project is not finished until it has been proven to work. The system must be carefully tested by faking real-world failures before one actually happens. This is the "fire drill" that proves the whole investment. In a controlled setting, the team should fake every possible failure. They can physically unplug the primary power supply, disconnect the primary CPU's network cable, or even force a fault in the primary processor's code.

During each test, key numbers must be measured and checked. How long did the failover take (Recovery Time Objective, or RTO)? Was any process data lost (Recovery Point Objective, or RPO)? Did the process have an unacceptable "bump"? A successful test is the final proof that the team not only installed the hardware correctly but that they deeply understand the process they are controlling. A failed test is not a problem. It's an important learning chance that shows hidden connections and wrong ideas in a safe place. This stops a much more dangerous and costly failure later. Finally, all failover and recovery steps must be written down. Maintenance staff must be trained on how to identify a failover event and properly replace a failed part without causing another shutdown.

From Fragile to Fortified

There is no time to be lazy, especially when using old legacy control systems, because unplanned downtime can cost a lot of money. It's not an accident that you have real operational strength. It's the result of a planned, full strategy that adds redundancy to the four main parts of the control system: the CPU, power supplies, I/O, and communication network.

Getting the newest technology just because it's new isn't what modernizing a legacy system with redundancy is all about. It is a planned, strategic decision to keep valuable and well-known manufacturing processes safe from the flaws that come with old hardware. It is an investment that helps important tools last longer. It also turns a weak system that is always one part failure away from disaster into a strong and dependable one. A good redundancy project lays the groundwork for future upgrades. It is a very important step toward making an industrial operation that is more stable, dependable, and profitable. You need to take charge of your operational future instead of letting a broken part decide it.

Beyond strategic redundancy, long-term reliability also depends on the availability of critical replacement parts. This is where Amikon plays a key role. With its extensive inventory of discontinued PLC modules and automation components, Amikon helps manufacturers minimize maintenance costs, restore system performance, and extend the operational life of legacy control systems. By combining parts supply with technical support and fast delivery, Amikon provides the industry’s leading response time, helping facilities stay productive even when original equipment is no longer supported.

Partner with Amikon Today to Keep Your Control Systems Strong, Efficient, and Future-Ready.

Explore Next

Keep your system in play!

Select: ABB

Accutrac

Acopian

AC Tech

Action Instruments

Adam

Adaptec

Advance

Advanced Input Devices

Advanced Micro Controls

AEG

AIS

Alcatel

Allen-Bradley

Allied Telesis

3M

Alstom

AMCI

Antex Electronics

Apparatebau Hundsbach

Array Electronic

Asea

ASTEC

Automation Direct

Aydin Controls

B&R

Balluff

Banner Engineering

Barco Sedo

Bartec

BECK

Beier

Beijer Electronics

Bently Nevada

Berthel

Bestobell Mobrey

Bierrebi

Biviator

Black Box

Block

Bofors Electronik

Bosch

Braun

Bürkert

BURLE

Canary

Carroll Touch

CEAG

3COM

Comat

Conrac

Controlon

Cooper Bussmann

Cooper Crouse-Hinds

Copes Vulcan

Crompton

Crouzet

Control Techniques

CTI-Control Technology Inc

Custom Servo Motors

Cutler-Hammer

Danfoss

Daniel Woodhead

DEC - Digital Equipment Corp

Delta Computer Systems

Delta Electronics

Devol

DGD Gardner Denver

DIA Electronic

DIGI

Digital

Digitronics

Durag

Dynapar

EATON

EBELT

Eberle

Echelon

E. Dold & Söhne - DOLD

EES Elelkra Elektronik

EIL

eka Technik

Elecktro-Automatik

Electronics Development Corp – EDC

Eletec Elektronic

Elliot Automation

Elographics

Emerson

e-motion

Endress Hauser

Entrelec Schiele

EPIC Data

ERMA

ERO Electronic

EtherCom

ESD

ESS Störcontroller

ETSI - Electronic Technology Systems

Eurotherm

Fanuc

Farnell

FEAS

Festo

Finder Varitec

Fischer Porter

Forney Engineering

FOTEK

Fuji Electric

Galil Motion Control

General Electric

Gildemeister

Gordos

Grapha Electronic

Grayhill

Grenzebach Electronics

Harting

Hawa

Hedin Tex

HEIDENHAIN

Helmholz

Herren Electronics

Hex Valve – Richards

HIMA

Hirschmann

Hitachi

Hitex

HK Systems

Honeywell

Horner - FACTS

Hüller Hille

iba

IBHsoftec

IBM

idec

IDS

IFM Electronic

INAT

INIVEN

Intel

Invensys

IPF Electronic

IRT SA

ISSC

ITT North Power Systems

Jameco ReliaPro

JAQUET

Jetter AG

JH Technology

Kent

Kent Industrial

KEPCO

Kettner

Kieback & Peter

Kingston Technology

Klockner Moeller

Kniel

Köster Systemtechnik

Koyo

Krauss Maffei

Kuhnke

Lambda

Landis Gyr

Lauer

L&N - Leeds & Northrup

Lenze

Leukhardt Systems

LG GoldSec

Liebherr

Littlefuse

Lumberg

Lutze

Magnecraft

Mannesmann

Matric Ltd

Matsushita

MDB Systems

Mean Well

Measurement Systems

Measurex

MEDAR

Micro Innovation AG

Micron Control Transformers

Mitsubishi

Molex

Moog

MSC Tuttlingen

MTL Insturments Group

MTS

Murr Elektronik

Myers Power Products

NAIS

Nandi Powertronics

NEC

Netstal

Neumann

Niobrara R&D

Nobel Elektronik

Omega Engineering

Omron

Opto 22

Orbitran Systems

PANALARM

Penril Datability Networks

Pepperl + Fuchs

Pester

Philips

Phoenix Contact

Pilz

Plasma

Plüth Energietechnik

Potter & Brumfield

Ramsey Engineering

Red Lion

Reis Robotics

Reliance Electric

Rexroth

Rinck Electronic

RIS - Rochester

RMP

Robust Data Comm

Ronan

RWT

SAE Elektronik

SAIA

SATT Control

Sauter

Schad SinTec

Schaffner

Shawmut - Gould/Ferraz

Schiele

Schildknecht

Schiller Electric

Schleicher

Schleuniger AG

Schlicht + Küchenmeister

Schlumberger

Schneider Electric

Schrack Technik

SCM PC-Card

Selectron

Sensycon

SEW

Sigma Information Systems

Sixnet

SOHARD

Sorcus

Spectrum Controls

Sprecher + Schuh

SPS Technologies

Square D

Stahl

Standard Microsystems

STI - Scientific Technologies, Inc.

Stromberg

Struthers-Dunn

SUTRON Electronic

SYNATEC Electronic

Syslogic

SysMik

Taylor

Tecnint HTE

Telemecanique

Tillquest

Timonta

Toshiba

Transition Networks

TR Electronic

Uhlmann

Unicomp

UniOP

United Sciences

VAHLE

Van Dorn

Vibro-Meter

VIPA

Visolux

Wachendorff Advantech

Wago

Walcher

Weber

Weidmuller

Wenglor

Westronics

Wieland

Wöhrle

Wolf

Woodward

Würth Elektronik

Yokogawa

Zebra Technologies

Ziehl-Abegg

Zollner

Xycom

Epro

bachmann

Saftronics

Siemens

KEB

Opti Mate

Arista

Sanki

Daiei Kogyosha

Brooks CTI-Cryogenics

MKS

Matrix

Motortronics

Metso Auttomation

ProSoft

Nikki Denso

K-TEK

Motorola VME

Force Computers Inc

Berger Lahr

ICS Triplex

Sharp PLC

YASKAWA

SCA Schucker

Grossenbacher

Hach

Meltal

Bremer

Molex Woodhead

Alfa Laval

Siemens Robicon

Perkins

Proface

Supcon

Carlo Gavazzi

DEA

SST

Hollysys

SOLIDSTATE CONTROLS

ETEK

OPTEK

KUKA

WHEDCO

indramat

Miscellaneous Manufacturers

TEKTRONIX

Rorze

DEIF

SIPOS

TICS TRIPLEX

SHINKAWA

ANYBUS

HVA

GERMAN POWER

KONTRON

ENTEK

TEL

SYSTEM

KOLLMORGEN

LAZER

PRECISION DIGITAL

LUBRIQUIPINC

NOKIA

SIEI-Gefran

MSA AUER MUT

KEBA

ANRITSU

DALSA

Load Sharer

SICK

Brad

SCHENCK

STAIGER MOHILO

ENTERASYS

USB-LG

TRS

BIOQUELL

SCHMERSAL

CORECO

KEYENCE

BIZERBA

BAUERBAUER

CONTROL

PACIFIC SCIENTIFIC

APPLIED MATERIALS

NMB

NI

Weishaupt

Weinview

CISCO

PARKER

Lenovo

KONECRANES

TURBUL

HMS

HOFFMAN

HUTTINGER

TDK-Lambda

RESOLVER

Knick

ATLAS

GAMX

TDK

CAMERON

NSK

Tamagawa

GIDDINGS & LEWIS

BENDER

SABO

WOODHEAD

FRICK YORK

SHENLER

BALDOR

Lam Research

NTN BEARING

ETA

WEST INSTRUMENTS

TDK-Lambda

SMC

Fireye

DAHUA

TESCH

ACROSSER

FLUKE

Sanyo Denki

Bruel & Kjaer

EPSON

HIOKI

Mettler Toledo

RAYTEK

EPCOS

DFI

SEMIKRON

Huawei

INDUSTRONIC

ASI-HVE

BARTEC POLARIS

AMAT

GD Bologna

Precise Automation

RADISYS

ZEISS

Reveal Imaging

Saiernico

ASEM

ASEM

Advantech

ANSALDO

ELpro

MARCONI

EBMPAPST

ROTORK

KONGSBERG

SOCAPEL

TAIYO

SUN

York

KURODA

ADLINK

Notifier

HBM

Infineon

LNIC

Saipwell

JIANGYIN ZHONGHE

W.E.ST. Elektronik

EXPO

DEEP SEA ELECTRONICS

BECKHOFF

BOMBARDIER TRANSPORTATION

Drager

ZENTRO ELEKTRONIK

ATOS

TRSystemtechnik

JDS Uniphase

ADEPT

REO