Close Menu
2025-10-28 10:07:31

A Practical Guide to Redundancy in Legacy PLC Systems

The Ticking Clock in Your Control Cabinet

Picture this: an important line stops working in the middle of a big production run. Alarms go off, workers rush around, and they find the problem: one old part of the Programmable Logic Controller (PLC) has broken. The search for a replacement begins, but the original company no longer makes or supports the part. There is no more production. If you work in a place that still uses old control systems, it's not a matter of if something like that will happen, but when. The results can be very bad.

It may seem like a normal problem to have unplanned downtime, but it costs a lot of money. Every year, industrial manufacturers have to deal with about 800 hours of equipment downtime. This costs the industry up to $50 billion a year. These unplanned stops cost the world's biggest companies a shocking 11% of their annual income. This comes to almost $1.4 trillion. The price per hour can get very high. A shocking 93% of businesses say that downtime costs them more than $300,000 per hour, and almost a quarter of them say that lost production costs them more than $5 million per hour. In the fast-paced car industry, a line stoppage costs an incredible $2.3 million per hour.

These numbers are not just numbers; they mean the business has lost money. About 53% of the damage is lost income. The effect also includes lost productivity (47%) and damage to the brand's reputation and customer trust that lasts for a long time (41%). A single average downtime event can cost the company $2 million in lost profits.

It's not easy to figure out how much money a legacy system could cost you. It makes things bigger. When an old part breaks, the already high hourly cost of downtime goes up even more. It can take a long time to find a replacement for an old part. You might have to search the whole world. This can make a repair that should only take a few hours take days or even weeks. That delay makes the financial damage worse. It makes a big problem into one that could put the business out of business. A "find-a-part-when-it-breaks" plan is a big risk that many businesses take, as shown by the fact that there is a special global market for parts that are no longer made.

So, redundancy isn't just a technical thing or a cost for IT. This is a basic plan for keeping your business running and protecting your money. It's about getting rid of single points of failure carefully so that you don't have to pay the huge and measurable costs of a shutdown. The goal is to go from taking emergency action to planned strength, which will turn a weak system into a strong one.

What Does "Redundancy" Actually Mean for Your Plant Floor?

Basically, redundancy is a simple idea: adding extra, backup parts or systems that can take over if something fails. The main goal is to reduce or completely stop downtime and data loss. This removes any single point of failure that could stop a whole process. For PLC systems, there is an important choice to make to get that strength: the choice between hot and cold standby systems.

The Critical Fork in the Road: Hot vs. Cold Standby

A Hot Standby system is the best choice for very important operations. In this setup, a backup system is powered on, running, and always synced with the main system in real-time. Data is always shared over a special high-speed link, often fiber optic, so the backup is always ready to take over right away and automatically if the main system fails. The goal is a "bumpless" transfer, a smooth switchover so fast that the process itself is never stopped. For continuous manufacturing, power generation, or any process where even a few milliseconds of downtime is not okay, hot standby is the best choice.

A Cold Standby system works the opposite way. Here, the backup system is powered down or not active until it's needed. When the main system fails, a cold standby needs a longer time to start up and often needs someone to manually bring the backup online. It has a lower initial cost, but that lower cost comes with a lot of downtime during a failover. A solution like that is only good for non-critical processes where a temporary stop is okay.

A third option, Warm Standby, is in the middle. The backup system is powered on but is only updated sometimes, not all the time. The failover is faster than a cold standby, but it still has a short stop or "bump" as the backup system's data catches up to the live process state.

The lower initial cost of a cold standby system can be attractive but wrong. Those upfront savings are often paid back, with a lot of interest, during the first big shutdown. A hot standby system would have stopped that long period of lost production. This period is like a "downtime tax" that can easily be much bigger than the initial cost difference. A good analysis must look at more than the initial cost and figure out the cost of a single failure. This often shows the hot standby option to be the better financial choice over the system's life.

Also, a hot standby system does more than just stop failures; it really changes how a plant can operate. It allows for "online maintenance." In online maintenance, the main system can be taken offline on purpose for service, upgrades, or testing while the fully synced standby system keeps running the process. That ability changes maintenance from a disruptive, planned-downtime event into a normal, non-disruptive activity. This improves not just uptime but also overall operational flexibility.

Redundancy Strategies at a Glance: Hot vs. Cold Standby

Feature Hot Standby Cold Standby
Takeover Speed Instant, automatic failover Delayed, often needs manual help
Downtime Impact "Bumpless" - very little to zero process stop Big process stop and downtime
Data Synchronization Continuous, real-time None until turned on
Initial Cost Higher Lower
Operational Complexity Higher (ongoing maintenance of active systems) Lower (backup system is not active)
Best For Critical processes (e.g., continuous manufacturing, power generation) Non-critical processes where downtime is acceptable

The Four Pillars of a Resilient Control System

To get true operational strength, you need a complete approach. A redundant CPU is useless if a single power supply failure can shut down the whole rack. A strong system is built on four connected pillars of redundancy. Each one deals with a critical potential point of failure. The strength of the whole system is decided by its weakest, non-redundant link.

Pillar 1: The Brains - CPU Redundancy

This is the most common type of redundancy. It uses two CPUs—a primary and a standby—that run at the same time and share the same I/O system. The scan clocks of the two processors are synced, and the standby unit always checks the primary's health. It gets state table updates at the end of each logic scan. The main goal here is a "bumpless" transfer of control if the primary CPU fails, with no stop to the ongoing process. A smooth switchover needs high-speed data sync between the CPUs. This is usually done over a special communication link that is separate from the main control network.

Pillar 2: The Powerhouse - Power Supply Redundancy

A power failure is a much more common reason for control system downtime than a CPU failure, but it's often forgotten. Power supply redundancy uses two or more power supplies connected together to the PLC rack. If one unit fails, the other smoothly takes over and handles the full electrical load without stopping. This is usually managed with special redundancy modules or diodes. These stop a failed, short-circuited supply from sending power back and damaging the rest of the system. The most common setups are 1+1 (one backup for one primary) or N+1 (one backup for a group of N primary supplies). For the best strength, this plan should go beyond the rack to include Uninterruptible Power Supplies (UPS). These get power from separate electrical circuits to protect against bigger grid problems or outages.

Pillar 3: The Senses - I/O Redundancy

The Input/Output (I/O) modules are the system's connection to the real world of sensors, valves, motors, and actuators. A failure of a single I/O module can be just as bad as a CPU failure, and it could shut down an important part of the process. I/O redundancy means having copies of these modules. If one fails, a backup immediately takes over the checking and control of the connected field devices. Plans can go from having duplicate I/O cards in the same rack to putting them in completely separate racks. For very important inputs, like a tank level transmitter, a strong design might use two separate sensors based on different technologies. Each one would be wired to a different I/O card to protect against failures that affect both.

Pillar 4: The Nervous System - Network Redundancy

Control systems have become more spread out. Because of this, the communication network has become a critical potential point of failure. Network redundancy gives backup data paths to keep connection if a cable is cut or a network switch fails. The choice of network topology—the physical layout of the network—is a basic design choice that shows an organization's comfort with risk.

  • Ring Topology: Devices are connected in a circle. A single break in the ring does not stop communication, because data can just travel the other way around the loop. It's a cheap solution that offers very fast recovery times. It is often managed with protocols like MRP (Media Redundancy Protocol) or DLR (Device Level Ring).
  • Star Topology: All devices connect to a central switch. This is simple, but a failure of that central switch can take down the whole network. A redundant star topology reduces that risk with duplicate central switches and links.
  • Mesh Topology: This setup provides multiple backup paths between devices, and it offers the highest possible level of fault tolerance. But, it is also the most complex and expensive to set up. It is usually only used for the most critical applications.

Protocols like the Spanning Tree Protocol (STP) and its faster version, Rapid Spanning Tree Protocol (RSTP), are used to manage these backup paths. They automatically turn off backup links to stop data loops that could crash the network. They turn them back on if a primary link fails.

The Legacy Challenge: Why Adding Resiliency Isn't Simple

Putting these modern redundancy ideas on old equipment is where the real challenge starts. Legacy systems were often designed for a different time, long before today's standards for high availability and cybersecurity were created. Trying to add strength to these older platforms brings a unique set of problems.

Obstacle 1: The Scarcity of Parts and Knowledge

The very systems that need redundancy the most because of their age and higher chance of failure are, strangely, the hardest to upgrade. As manufacturers stop making product lines, finding the exact-match spare parts needed to build a redundant system becomes a big problem. Facilities are often forced to use a risky and expensive aftermarket of used or rebuilt parts. These parts are not as reliable as new parts and have their own risks. At the same time, the engineers and technicians who first designed and maintained these systems are retiring. They are taking decades of unwritten knowledge with them. That knowledge loss leaves facilities without the people who can effectively fix, change, or correctly add new parts to the old system.

Obstacle 2: The Integration Puzzle - Hardware and Software Incompatibility

You can't always just plug a modern module into an old backplane. Trying to add modern redundancy or communication cards to a legacy PLC rack can be physically impossible because of different connectors, power needs, or communication protocols. The software is an even bigger challenge. Legacy PLC programming software often runs on old and insecure operating systems. The firmware of an old CPU may not have the key features needed to sync with a partner for a hot standby setup. The control logic itself, after years of fixes and changes from different programmers, can become a messy and poorly documented puzzle that is very hard to change safely.

Obstacle 3: The High Cost and Hidden Risks

The small supply of legacy parts naturally makes their price go up on the surplus market. Also, ongoing maintenance and support costs for older platforms are much higher than for modern systems. Besides the direct costs, there are hidden dangers. Legacy systems are big targets for cyber-attacks because they have many known weaknesses and no longer get security updates from the manufacturer. The act of adding new network connections for redundancy can accidentally expose a previously separate and weak system to many new and dangerous threats.

Your Step-by-Step Plan for a Redundancy Upgrade

Handling the difficulties of a legacy redundancy project needs a structured, practical approach. A successful upgrade is not just about buying new hardware; it's a careful process of checking, designing, implementing, and thorough testing.

Step 1: The System Autopsy - Conduct a Thorough Risk Assessment

Before any part is ordered, a deep analysis of the current system is needed to find the weakest links and understand the business effect of their failure. The process starts with a full list of all control system parts: PLC models and firmware versions, I/O modules, power supplies, network switches, and software. From there, find which of these parts are absolutely needed for operation. Which failure would cause the biggest problem? Each critical part should then be checked against key legacy signs: Is it out of vendor support? Are spare parts easy to find? Are there known security weaknesses? The result of this check is a prioritized list of risks. This lets you focus resources where they will have the biggest effect. That check also becomes a powerful financial planning tool. When you measure the potential losses from a specific part failure—for example, a shutdown of a line that makes $500,000 in income per day—you are building the ROI calculation needed to get budget approval. It changes the project from a proactive investment in risk reduction to a clear choice between that and accepting a measurable risk of a huge loss.

Step 2: Blueprint for Resilience - Designing Your Redundancy Strategy

With a clear understanding of the risks, the next step is to design a solution that matches the right redundancy setup to each prioritized need. A single plan for everything is both not efficient and too costly. For the highest-priority systems, a full hot standby CPU and power supply setup will likely be needed. For less critical I/O, a cold standby approach—like a pre-programmed spare module kept on a shelf—might be a good and cheap solution. The design must be complete and cover all four pillars. It must specify redundant power from separate sources, define the network layout, and plan the I/O grouping strategy to reduce the effect of any single failure. For hot standby systems, the design must also describe the high-speed sync link between CPUs and the way to keep them perfectly matched.

Step 3: The Phased Rollout - Implementation Without Total Shutdown

For a live plant, a "big bang" upgrade that needs a total shutdown is often too risky and disruptive. A phased implementation is almost always the better way. This can mean modernizing the system in stages. You could put in the redundant power and network parts first, and then the final CPU swap at a later time. In some cases, it may be possible to build and test the new redundant system next to the old one. You can run both at the same time before a final, scheduled switch. The key is to use planned maintenance shutdowns to do the biggest parts of the upgrade, like controller replacement or major I/O rewiring. This reduces the effect on production.

Step 4: The Fire Drill - Rigorous Failover Testing

A redundancy project is not finished until it has been proven to work. The system must be carefully tested by faking real-world failures before one actually happens. This is the "fire drill" that proves the whole investment. In a controlled setting, the team should fake every possible failure. They can physically unplug the primary power supply, disconnect the primary CPU's network cable, or even force a fault in the primary processor's code.

During each test, key numbers must be measured and checked. How long did the failover take (Recovery Time Objective, or RTO)? Was any process data lost (Recovery Point Objective, or RPO)? Did the process have an unacceptable "bump"? A successful test is the final proof that the team not only installed the hardware correctly but that they deeply understand the process they are controlling. A failed test is not a problem. It's an important learning chance that shows hidden connections and wrong ideas in a safe place. This stops a much more dangerous and costly failure later. Finally, all failover and recovery steps must be written down. Maintenance staff must be trained on how to identify a failover event and properly replace a failed part without causing another shutdown.

From Fragile to Fortified

There is no time to be lazy, especially when using old legacy control systems, because unplanned downtime can cost a lot of money. It's not an accident that you have real operational strength. It's the result of a planned, full strategy that adds redundancy to the four main parts of the control system: the CPU, power supplies, I/O, and communication network.

Getting the newest technology just because it's new isn't what modernizing a legacy system with redundancy is all about. It is a planned, strategic decision to keep valuable and well-known manufacturing processes safe from the flaws that come with old hardware. It is an investment that helps important tools last longer. It also turns a weak system that is always one part failure away from disaster into a strong and dependable one. A good redundancy project lays the groundwork for future upgrades. It is a very important step toward making an industrial operation that is more stable, dependable, and profitable. You need to take charge of your operational future instead of letting a broken part decide it.

Beyond strategic redundancy, long-term reliability also depends on the availability of critical replacement parts. This is where Amikon plays a key role. With its extensive inventory of discontinued PLC modules and automation components, Amikon helps manufacturers minimize maintenance costs, restore system performance, and extend the operational life of legacy control systems. By combining parts supply with technical support and fast delivery, Amikon provides the industry’s leading response time, helping facilities stay productive even when original equipment is no longer supported.

Partner with Amikon Today to Keep Your Control Systems Strong, Efficient, and Future-Ready.

Keep your system in play!

Select
ABB
Accutrac
Acopian
AC Tech
Action Instruments
Adam
Adaptec
Advance
Advanced Input Devices
Advanced Micro Controls
AEG
AIS
Alcatel
Allen-Bradley
Allied Telesis
3M
Alstom
AMCI
Antex Electronics
Apparatebau Hundsbach
Array Electronic
Asea
ASTEC
Automation Direct
Aydin Controls
B&R
Balluff
Banner Engineering
Barco Sedo
Bartec
BECK
Beier
Beijer Electronics
Bently Nevada
Berthel
Bestobell Mobrey
Bierrebi
Biviator
Black Box
Block
Bofors Electronik
Bosch
Braun
Bürkert
BURLE
Canary
Carroll Touch
CEAG
3COM
Comat
Conrac
Controlon
Cooper Bussmann
Cooper Crouse-Hinds
Copes Vulcan
Crompton
Crouzet
Control Techniques
CTI-Control Technology Inc
Custom Servo Motors
Cutler-Hammer
Danfoss
Daniel Woodhead
DEC - Digital Equipment Corp
Delta Computer Systems
Delta Electronics
Devol
DGD Gardner Denver
DIA Electronic
DIGI
Digital
Digitronics
Durag
Dynapar
EATON
EBELT
Eberle
Echelon
E. Dold & Söhne - DOLD
EES Elelkra Elektronik
EIL
eka Technik
Elecktro-Automatik
Electronics Development Corp – EDC
Eletec Elektronic
Elliot Automation
Elographics
Emerson
e-motion
Endress Hauser
Entrelec Schiele
EPIC Data
ERMA
ERO Electronic
EtherCom
ESD
ESS Störcontroller
ETSI - Electronic Technology Systems
Eurotherm
Fanuc
Farnell
FEAS
Festo
Finder Varitec
Fischer Porter
Forney Engineering
FOTEK
Fuji Electric
Galil Motion Control
General Electric
Gildemeister
Gordos
Grapha Electronic
Grayhill
Grenzebach Electronics
Harting
Hawa
Hedin Tex
HEIDENHAIN
Helmholz
Herren Electronics
Hex Valve – Richards
HIMA
Hirschmann
Hitachi
Hitex
HK Systems
Honeywell
Horner - FACTS
Hüller Hille
iba
IBHsoftec
IBM
idec
IDS
IFM Electronic
INAT
INIVEN
Intel
Invensys
IPF Electronic
IRT SA
ISSC
ITT North Power Systems
Jameco ReliaPro
JAQUET
Jetter AG
JH Technology
Kent
Kent Industrial
KEPCO
Kettner
Kieback & Peter
Kingston Technology
Klockner Moeller
Kniel
Köster Systemtechnik
Koyo
Krauss Maffei
Kuhnke
Lambda
Landis Gyr
Lauer
L&N - Leeds & Northrup
Lenze
Leukhardt Systems
LG GoldSec
Liebherr
Littlefuse
Lumberg
Lutze
Magnecraft
Mannesmann
Matric Ltd
Matsushita
MDB Systems
Mean Well
Measurement Systems
Measurex
MEDAR
Micro Innovation AG
Micron Control Transformers
Mitsubishi
Molex
Moog
MSC Tuttlingen
MTL Insturments Group
MTS
Murr Elektronik
Myers Power Products
NAIS
Nandi Powertronics
NEC
Netstal
Neumann
Niobrara R&D
Nobel Elektronik
Omega Engineering
Omron
Opto 22
Orbitran Systems
PANALARM
Penril Datability Networks
Pepperl + Fuchs
Pester
Philips
Phoenix Contact
Pilz
Plasma
Plüth Energietechnik
Potter & Brumfield
Ramsey Engineering
Red Lion
Reis Robotics
Reliance Electric
Rexroth
Rinck Electronic
RIS - Rochester
RMP
Robust Data Comm
Ronan
RWT
SAE Elektronik
SAIA
SATT Control
Sauter
Schad SinTec
Schaffner
Shawmut - Gould/Ferraz
Schiele
Schildknecht
Schiller Electric
Schleicher
Schleuniger AG
Schlicht + Küchenmeister
Schlumberger
Schneider Electric
Schrack Technik
SCM PC-Card
Selectron
Sensycon
SEW
Sigma Information Systems
Sixnet
SOHARD
Sorcus
Spectrum Controls
Sprecher + Schuh
SPS Technologies
Square D
Stahl
Standard Microsystems
STI - Scientific Technologies, Inc.
Stromberg
Struthers-Dunn
SUTRON Electronic
SYNATEC Electronic
Syslogic
SysMik
Taylor
Tecnint HTE
Telemecanique
Tillquest
Timonta
Toshiba
Transition Networks
TR Electronic
Uhlmann
Unicomp
UniOP
United Sciences
VAHLE
Van Dorn
Vibro-Meter
VIPA
Visolux
Wachendorff Advantech
Wago
Walcher
Weber
Weidmuller
Wenglor
Westronics
Wieland
Wöhrle
Wolf
Woodward
Würth Elektronik
Yokogawa
Zebra Technologies
Ziehl-Abegg
Zollner
Xycom
Epro
bachmann
Saftronics
Siemens
KEB
Opti Mate
Arista
Sanki
Daiei Kogyosha
Brooks CTI-Cryogenics
MKS
Matrix
Motortronics
Metso Auttomation
ProSoft
Nikki Denso
K-TEK
Motorola VME
Force Computers Inc
Berger Lahr
ICS Triplex
Sharp PLC
YASKAWA
SCA Schucker
Grossenbacher
Hach
Meltal
Bremer
Molex Woodhead
Alfa Laval
Siemens Robicon
Perkins
Proface
Supcon
Carlo Gavazzi
DEA
SST
Hollysys
SOLIDSTATE CONTROLS
ETEK
OPTEK
KUKA
WHEDCO
indramat
Miscellaneous Manufacturers
TEKTRONIX
Rorze
DEIF
SIPOS
TICS TRIPLEX
SHINKAWA
ANYBUS
HVA
GERMAN POWER
KONTRON
ENTEK
TEL
SYSTEM
KOLLMORGEN
LAZER
PRECISION DIGITAL
LUBRIQUIPINC
NOKIA
SIEI-Gefran
MSA AUER MUT
KEBA
ANRITSU
DALSA
Load Sharer
SICK
Brad
SCHENCK
STAIGER MOHILO
ENTERASYS
USB-LG
TRS
BIOQUELL
SCHMERSAL
CORECO
KEYENCE
BIZERBA
BAUERBAUER
CONTROL
PACIFIC SCIENTIFIC
APPLIED MATERIALS
NMB
NI
Weishaupt
Weinview
CISCO
PARKER
Lenovo
KONECRANES
TURBUL
HMS
HOFFMAN
HUTTINGER
TDK-Lambda
RESOLVER
Knick
ATLAS
GAMX
TDK
CAMERON
NSK
Tamagawa
GIDDINGS & LEWIS
BENDER
SABO
WOODHEAD
FRICK YORK
SHENLER
BALDOR
Lam Research
NTN BEARING
ETA
WEST INSTRUMENTS
TDK-Lambda
SMC
Fireye
DAHUA
TESCH
ACROSSER
FLUKE
Sanyo Denki
Bruel & Kjaer
EPSON
HIOKI
Mettler Toledo
RAYTEK
EPCOS
DFI
SEMIKRON
Huawei
INDUSTRONIC
ASI-HVE
BARTEC POLARIS
AMAT
GD Bologna
Precise Automation
RADISYS
ZEISS 
Reveal Imaging
Saiernico
ASEM
ASEM
Advantech
ANSALDO
ELpro
MARCONI
EBMPAPST
ROTORK
KONGSBERG
SOCAPEL
TAIYO
SUN
York
KURODA
ADLINK
Notifier
HBM
Infineon
LNIC
Saipwell
JIANGYIN ZHONGHE
W.E.ST. Elektronik
EXPO
DEEP SEA ELECTRONICS
BECKHOFF
BOMBARDIER TRANSPORTATION
Drager
ZENTRO ELEKTRONIK
ATOS
TRSystemtechnik
JDS Uniphase
ADEPT
REO
Get Parts Quote
Newsroom

Related articles Browse All