1. Description of the problem and scope of application
This guide is intended for diagnosing and troubleshooting communication failures of programmable logic controllers (PLCs) in industrial automation field networks. Typical symptoms include: complete or intermittent loss of communication with one or more network devices, I/O errors, slow or unstable control system operation, unplanned shutdowns of production processes, and communication error messages on operator panels (HMIs) or in PLC syslogs. The manual covers diagnostics of the most common industrial Ethernet-based protocols such as PROFINET, EtherNet/IP, as well as the serial protocol Modbus RTU/TCP.
Applicable equipment: PLCs from various manufacturers (eg, Siemens, Rockwell Automation, Schneider Electric), distributed I/O systems, industrial network switches, frequency converters, servo drives, sensors, actuators with field network support, as well as passive network components (cables, connectors, terminators).
Classification of severity of malfunctions:
- Critical: Complete loss of communication with the PLC or key devices, resulting in an emergency shutdown of the production line or significant financial losses. Requires immediate intervention.
- Significant: Intermittent communication failures causing unstable hardware, reduced performance, or frequent but short-term shutdowns. Requires urgent diagnosis.
- Minor: Sporadic communication errors that do not directly affect the production process, but may indicate initial problems or network degradation. Scheduled diagnostics is recommended.
2. Safety measures
CAUTION: SAFETY!
- Lockout and Tagout (LOTO): Before performing any work that requires tampering with electrical or mechanical parts of the equipment, you MUST apply lockout and tagout procedures (DSTU EN 1037, ISO 14118). Make sure there is no voltage using proven measuring tools.
- Electrical safety: Work with electrical equipment must be performed only by qualified personnel in accordance with the requirements of NPAOP 40.1-1.21-98. Always assume electrical circuits are live until proven otherwise.
- Personal protective equipment (PPE): Always use appropriate PPE (gloves, safety glasses, safety shoes, protective clothing) according to the workplace risk assessment (DSTU EN 340, DSTU EN 388, DSTU EN 166).
- Residual Energy: After the power is turned off, certain components (such as capacitors in power supplies, pneumatic or hydraulic accumulators) can store dangerous energy. Wait for complete discharge or reset of energy before starting work.
- Hot surfaces: Some components (eg PLC modules, power supplies) may have hot surfaces that can cause burns. Be careful.
3. Necessary diagnostic tools
The following set of tools is required for effective diagnosis of PLC communication failures:
| Tool | Specification/Model | Measuring range/Function | Purpose |
|---|---|---|---|
| Industrial network analyzer | Fluke LinkRunner, WireShark (with appropriate adapter), PLC manufacturer tools (eg Siemens PRONETA, Rockwell BOOTP/DHCP Server) | Traffic analysis, measurement of jitter, delay, packet loss, network topology | Detection of collisions, damaged packets, network congestion, incorrect IP addresses, device malfunctions. |
| Cable tester for Ethernet | Fluke CableIQ, IDEAL Networks LanTEK III, Class D/E/EA testers | Checking the integrity, wiring map, cable length, presence of breaks, short circuits, cross-talks, resistance. For optics: attenuation measurement, OTDR. | Identification of physical damage to copper or fiber optic cables, verification of compliance with standards (for example, ISO/IEC 11801). |
| Digital multimeter | FLUKE 17X/87V, Kyoritsu 1012/1021R (with integrity and resistance test function) | Voltage (AC/DC) up to 1000 V, resistance up to 50 MΩ, integrity check (beep), diode test. | Checking the presence of power on devices, measuring the resistance of bus terminators (for example, Modbus RTU), diagnosing breaks. |
| Thermographic camera | FLIR E-Series, Testo 87X (with a sensitivity of 0.05 °C) | Temperature range from -20°C to +350°C, detection of temperature anomalies | Detection of overheating of network components, PLC modules, power supply units, which may indicate a malfunction. |
| Laptop with PLC software | Windows 10/11, TIA Portal (Siemens), Studio 5000 (Rockwell), Unity Pro/EcoStruxure (Schneider) | Access to PLC programs, status monitoring, diagnostic utilities, firmware updates. | Checking program logic, module status, network settings, system logs. |
| Oscilloscope (preferably with isolated channels) | Tektronix TBS1000, Rohde & Schwarz RTB2000 (with 100 MHz bandwidth) | Visualization of electrical signals, analysis of noise, interference, integrity of data packets. | Detailed analysis of the physical level of communication, especially for the detection of high-frequency interference. |
4. Initial evaluation checklist
Before starting a detailed diagnosis, perform the following steps to collect initial information and visual assessment:
| Item | action | Record/Description |
|---|---|---|
| 1. Visual overview | Check all cables, connectors, status indicators (LEDs) on PLCs, switches, I/O devices. | Is there visible damage to the cables? Are the connectors tight? What is the status of the Link/Activity/Error indicators? |
| 2. Terms of use | Record the ambient temperature, humidity level, presence of vibrations, aggressive substances. | Do the conditions match the hardware specifications (eg EN 61131-2)? |
| 3. Recent Changes | Find out if there have been recent network configuration changes, hardware replacements, software updates, or mechanical work nearby. | When was the last time the system ran smoothly? What was changed? |
| 4. Event/crash logs | View syslogs of PLCs, HMIs, industrial switches. | What error messages are there? Date and time of occurrence? Repetition rate? |
| 5. Power status | Check the power indicators on all devices involved in the communication. | Are all devices turned on and receiving stable power? |
| 6. Network topology | Check the current network diagram (if available) or draw one. | Determine which devices are on the same network segment. |
| 7. Direct communication check (Ping) | If possible, try ping to the problematic device from your laptop or PLC. |
Is there an answer? What is the response time? Is there packet loss? |
5. Systematic diagnostic algorithm
The following algorithm will help to consistently identify the source of the problem:
- Symptom: No communication with one or more devices on the network.
- Checking the status indicators:
- If the power (PWR) or status (STATUS/RUN) indicators of the device are off or flashing red:
- Check the power source of the device (voltage, fuses, connections).
- If the power is OK but the status indicators are faulty: Probable cause: device malfunction. Go to Device Fault Diagnosis.
- If the device's power and status indicators are normal, but the communication indicators (LINK/ACT/COMM) are off or flashing red:
- Go to step 1b.
- If the power (PWR) or status (STATUS/RUN) indicators of the device are off or flashing red:
- Physical level check (cables and connectors):
- Visually inspect the cable connecting the problem device:
- Are there visible damages, bends, pinching?
- Are the connectors firmly inserted into the ports? Try reconnecting.
- Use a cable tester to check the integrity and routing of the cable:
- If the test fails (open, short, cross-wire): Probable cause: damaged cable or connector. Go to Troubleshooting the Cabling System.
- If the test is successful: Go to 1c.
- Check network settings and configuration:
- Connect the laptop to the same network (or directly to the device) and try executing the command
ping <device IP address>:- If there is no response or packet loss: Go to 1c.ii.
- If the answer is yes, but there is still no communication with the PLC: Probable cause: incorrect configuration of the PLC or device. Go to the "Network Configuration Diagnostics" section.
- Use PLC software (TIA Portal, Studio 5000, etc.) or a dedicated network analyzer:
- Check the IP address, subnet mask, gateway of the problem device and make sure they match the project.
- For PROFINET/EtherNet/IP: Check the device names and make sure they are unique and match the PLC configuration.
- Check the speed and duplex settings (auto-adjust or fixed).
- If the settings are incorrect: Probable cause: incorrect network configuration. Go to "Troubleshooting network configuration".
- If the settings are correct: Go to step 1d.
- Connect the laptop to the same network (or directly to the device) and try executing the command
- Node isolation and interference effects:
- If the problem persists, try to temporarily isolate the problem device by connecting it directly to the PLC or to a minimum network test segment:
- If communication is restored: Probable cause: network infrastructure problem (switch, other device, interference) or IP address conflict. Go to section "Isolation and diagnostics of network infrastructure".
- If the connection is not restored: Probable cause: malfunction of the device or its network interface. Go to the section "Diagnostics of the malfunction of the device".
- Use a thermal imaging camera to check for overheating network components or power supplies related to communication.
- If electromagnetic interference (EMF) is suspected: Use an oscilloscope or a specialized EMF analyzer to evaluate the noise level.
- If the problem persists, try to temporarily isolate the problem device by connecting it directly to the PLC or to a minimum network test segment:
- Checking the status indicators:
- Symptom: Intermittent communication failures, packet loss, high jitter.
- Check section 1.a, 1.b.
- Use network analyzer to monitor traffic:
- Are there collisions, looping packets, broadcast storms?
- Are there an abnormally high number of erroneous packets?
- Probable cause: network congestion, faulty switch, EMF, duplicate IP/names. Go to Network Infrastructure Isolation and Diagnostics or Interference Impact Diagnostics.
- Check the stability of the power supply for all network devices (voltage sags are possible).
6. Malfunction-cause matrix
The table below summarizes typical symptoms, likely causes, and methods of diagnosing them:
| Symptom | Probable causes (by probability) | Diagnostic test | Expected result (if the cause is confirmed) |
|---|---|---|---|
| Complete loss of communication with one device | 1. Damaged cable/connector 2. No power on device 3. Invalid IP address/name 4. Malfunction of the network interface of the device 5. IP address/name conflict |
Visual inspection, cable tester, multimeter, ping, PLC software, network analyzer | Cable tester: open/short circuit. Multimeter: 0 V. Ping: Timeout. Analyzer: No traffic from the device, duplicate IPs. |
| Intermittent crashes/loss of packets with one device | 1. Cable damage (loose contact) 2. Electromagnetic interference (EMF) 3. Network overload (broadcast storms) 4. Unstable power supply of the device 5. Faulty switch/device network port |
Cable tester (retest), network analyzer (monitoring), oscilloscope, thermal imaging camera, switch event log | Cable tester: sporadic errors. Analyzer: high percentage of erroneous packets, collisions. Oscilloscope: noise on the signal. |
| Loss of communication with an entire network segment (multiple devices) | 1. Malfunction of the industrial switch 2. Main cable break 3. Congestion/broadcast storm on switch 4. Switch/backbone power problem 5. Grounding or EMFs affecting the entire segment |
Visual inspection of the switch/cable, ping all devices on the segment, network analyzer, check the power of the switch | Switch: All port lights are off/red. Ping: Timeout for the entire segment. Analyzer: No traffic or blocking at all. |
| Slow or unstable network operation, high jitter | 1. Network overload (too much traffic) 2. Incorrect speed/duplex setting 3. Network loops (absence of STP/RSTP) 4. EMF 5. Outdated/defective network equipment |
Network analyzer (measurement of delay, jitter, bandwidth usage), checking switch settings, indicators on switches | Analyzer: high bandwidth utilization (>70%), jitter >100 µs. Switch: Loop/Error indicators are active. |
| CRC errors, Frames with errors | 1. Damaged cable 2. EMF 3. Faulty device/switch network port 4. Incorrect speed/duplex setting |
Network analyzer, cable tester, switch/device event log | Analyzer/Log: significant number of CRC errors, fragmented frames. |
7. Root cause analysis for each malfunction
Understanding the root cause is critical to preventing repeated failures.
7.1. Damage to the cable system
- Explanation: Cables are the physical backbone of a network. They can be damaged mechanically (bends, pinches, cuts), under the influence of aggressive environments (chemicals, oils), high temperatures, vibration or rodents. Internal wire breaks, short circuits, or crosstalks can occur due to poor installation or insulation degradation.
- How to confirm: Cable tester (breaks, short circuit, incorrect wiring, high return loss), visual inspection, physical movement of the cable (may restore communication temporarily).
- Damage, if not corrected: Sporadic or continuous loss of communication, resulting in equipment shutdowns, data errors, controllability, and, as a result, significant production losses.
7.2. Incorrect network configuration
- Explanation: Each device in an industrial network must have a unique IP address (for Ethernet-based protocols), correct subnet mask, gateway, and for PROFINET/EtherNet/IP a unique device name. Mistakes in these settings (for example, duplicate IP addresses, wrong device name) lead to conflicts and the inability to establish communication.
- How to confirm: Network analyzer (detects duplicate IP addresses, name conflicts), PLC software (reads/writes device configuration),
pingcommands. - Damage if not fixed: Defective devices, incorrect management, unpredictable network behavior, inability to integrate new hardware.
7.3. Problems with powering devices
- Explanation: Unstable, low or absent supply voltage leads to incorrect operation of network interfaces of devices or to their complete shutdown. Triggering of protections, interruptions in power supply circuits, malfunction of power supply units, overvoltage.
- How to confirm: Multimeter (measurement of the supply voltage at the device input, comparison with the nominal), visual inspection of fuses, power indicators. Standard: 24 V DC ±10% for industrial systems.
- Damage if not repaired: Malfunctions of powered devices, failure of other components due to unstable voltage, production stoppages.
7.4. Electromagnetic interference (EMF)
- Explanation: High-frequency noises caused by power cables, inverters, welding equipment, radio transmitters, electric motors can travel on signal cables and distort data transmission. Poor cable shielding, lack of or incorrect grounding can increase this effect.
- How to confirm: Oscilloscope (visualization of noise on signal lines), network analyzer (increase in number of CRC errors, decrease in throughput), ground check (multimeter).
- Damage if not fixed: Intermittent and unpredictable communication failures that are difficult to diagnose, data errors, network slowdowns that can lead to false positives and crashes.
7.5. Network equipment/interface failure
- Explanation: Malfunction of the industrial switch (burnt port, faulty board), PLC network adapter or field device network interface. This can be due to overheating, short circuit, overvoltage or natural wear of components.
- How to confirm: Node isolation (direct connection), switch/device replacement with a known good one, thermal imaging camera (detection of overheating), switch event log (notification of port failures).
- Damage, if not repaired: Total or partial failure of the network, resulting in production stoppage.
8. Step-by-step troubleshooting procedures
For each identified root cause, perform the following corrective actions:
8.1. Troubleshooting the cable system
CAUTION: SAFETY! Apply LOTO before working with cables that may be live.
- Damage identification: Use a cable tester to pinpoint the location and type of damage (break, short circuit, crosstalk).
- Cable Replacement: If the damage is significant, replace the entire length of cable with a new one that meets industry standards (eg CAT5e/CAT6A for Ethernet, shielded, with copper cores). Use cables with the appropriate protection class (IP) and resistance to external influences.
- Replacing connectors: If only the connector (eg RJ45) is damaged, carefully cut it and install a new one using a special tool (crimper). Make sure the wiring is correct (T568A or T568B).
- Checking the shielding and grounding: Ensure that the cable shielding is correctly connected to grounding on both sides (for shielded cables) or on one side (for some configurations). Check ground resistance with a multimeter (should be <4 ohms).
- Verification: After replacement or repair, retest the cable with a cable tester. Make sure all parameters are standard (eg ISO/IEC 11801). Restore the power and check the communication with the device.
8.2. Troubleshooting network configuration
CAUTION: Be careful when changing network settings, it may affect the entire system.
- Determining the correct settings: Refer to the project documentation or to the current configuration of other similar devices.
- Change IP address/device name: Using PLC software (eg TIA Portal, Studio 5000) or specialized utilities (eg Siemens Primary Setup Tool, Rockwell BOOTP/DHCP Server), set the correct IP address, subnet mask, gateway and device name. Make sure these parameters are unique.
- Checking speed/duplex settings: If fixed settings are set, make sure they match the corresponding switch port settings. It is recommended to use auto-reconcile if possible.
- Rebooting the device: After changing the settings, as a rule, you need to reboot the device to apply them.
- Verification:
pingto the device. Check the communication through the PLC software. Verify that the device appears in the network topology without errors.
8.3. Restore power to devices
CAUTION: SAFETY! Apply LOTO. Before taking voltage measurements, make sure the multimeter is set to the correct range.
- Voltage measurement: Using a multimeter, measure the supply voltage directly at the terminals of the problematic device. Make sure it is within acceptable limits (eg 24 V DC ±10%).
- Check fuses: Inspect and check the fuses protecting the device's power circuit for integrity. Replace blown fuses with new ones of similar rating and type (EN 60127).
- Power Supply Diagnostics: If voltage is low or absent, check the output voltage of the power supply supplying power to the device. If necessary, replace the faulty power supply unit.
- Checking the power cables: Check the power cables for breaks, insulation damage, loose contacts.
- Verification: After restoring stable power, check the device's power indicators. Reconnect to the PLC.
8.4. Reducing the influence of electromagnetic interference
CAUTION: SAFETY! Grounding work must be performed by qualified personnel.
- Ground Check: Make sure the control cabinets and all components are properly grounded according to EN 60204-1. Check the integrity of the ground loop with a multimeter (resistance should be minimal).
- Cable separation: Separate signal cables and power cables. They should be laid in separate channels or at a sufficient distance (minimum 20 cm).
- Use of shielded cables: Use only shielded industrial Ethernet cables (eg PROFINET Type B/C, EtherNet/IP ODVA Industrial Ethernet) with proper shield connection.
- Direction filters: Apply ferrite rings or EMF filters to the power cables of devices that generate interference (such as frequency converters).
- Verification: Network traffic monitoring using an analyzer to reduce the number of CRC errors and improve connection stability.
8.5. Replacement of faulty network equipment/interface
CAUTION: SAFETY! Apply LOTO. Before replacing equipment, make sure you have a suitable replacement.
- Fault identification: Confirm the fault by isolating the assembly or replacing with a known-good component.
- Switch replacement: Replace a faulty industrial Ethernet switch with a new one with similar characteristics (number of ports, speed, protocol support).
- Replacing the PLC module: If the PLC network module (eg Siemens CP module) is faulty, replace it according to the manufacturer's instructions.
- Field device replacement: If the network interface of a field device (for example, an I/O module) is faulty, replace the entire device.
- Configuration: After replacing the new equipment, it is necessary to restore its network settings and configuration according to the project documentation.
- Verification: Check communication with all devices connected to the replaced hardware. Network stability monitoring.
9. Preventive measures
Prevention is more effective than elimination of consequences.
| The root cause | Prevention strategy | Monitoring method | Recommended interval |
|---|---|---|---|
| Damage to the cable system | Use of industrial cables (EN 50173, ISO/IEC 11801), protection against mechanical damage, correct laying, correct bending radius. | Visual inspection of cables, planned tests of cables (cable tester). | Monthly (visual), yearly (testing). |
| Incorrect network configuration | Maintaining up-to-date network documentation (IP addresses, device names), standardization of configurations, access control for changing settings. | Audit of network settings, inventory of IP addresses. | Quarterly or after any changes. |
| Problems with powering devices | Use of high-quality industrial power supply units (UkrSEPRO certified), uninterruptible power supply system (UPS), regular checking of voltage and current. | Measurement of power supply voltage, temperature monitoring of power supply units (thermographic camera). | Monthly. |
| Electromagnetic interference (EMF) | Correct grounding (EN 50310), shielding of cables, separation of power and signal cables, use of EMF filters. | Monitoring the number of CRC errors in network traffic, periodically checking the ground loop. | Quarterly (grounding check), constantly (network monitoring). |
| Network equipment/interface failure | Planned replacement of critical components, temperature monitoring in cabinets, use of industrial-class equipment with appropriate certificates (CE, UkrSEPRO). | Switch/module temperature monitoring, switch/PLC event log, thermographic camera. | Annually (planned replacement), constantly (monitoring). |
10. Spare parts and components
Availability of up-to-date spare parts is critical for quick recovery.
| Description of the part | Specification | When to replace | Category UNITEC |
|---|---|---|---|
| Industrial Ethernet cable | CAT5e / CAT6A, shielded (SF/UTP or S/FTP), for industrial use (e.g. PUR jacket), length according to the network card. | If physical damage is detected or the cable test fails. | Network components |
| RJ45 connector for industrial Ethernet | Industrial connector (IP20/IP67), with the possibility of quick installation without tools or under crimping, metal housing for shielding. | In case of damage to the connector or incorrect operation after crimping. | Network components |
| Industrial Ethernet switch | Number of ports (4/8/16), speed (100 Mbit/s or 1 Gbit/s), Unmanaged/Managed, protocol support (IGMP Snooping, RSTP), DIN rail mounting. | When a malfunction is detected (burnt port, lack of switching), or according to the equipment aging plan. | Network equipment |
| PLC network module | Depends on the PLC model (e.g. Siemens CP module, Rockwell Ethernet/IP module). | In the event of a complete malfunction or the inability to establish communication, confirmed by diagnostics. | PLC modules |
| Power supply unit 24 V DC | Industrial power supply, output voltage 24 V DC, current up to 5/10/20 A, DIN rail mounting, overload/short circuit protection. | In case of unstable output voltage, overheating, or failure. | Electrical components |
| Modbus RTU terminator | Resistance 120 Ohms ±5%, 1/4 W. | In case of damage or loss. Required at the ends of the RS-485 bus. | Network components |
To order high-quality spare parts that meet industrial standards and have the necessary certificates (CE, UkrSEPRO), visit our electronic catalog UNITEC.
11. References
- DSTU EN 61784 (ISO 15745) – Industrial communication networks.
- ISO/IEC 11801 – Information technologies. Structured cable system.
- EN 60204-1 – Machine safety. Electrical equipment of machines. Part 1: General requirements.
- EN 61131-2 – Programmable controllers. Part 2: Equipment requirements and operational tests.
- NPAOP 40.1-1.21-98 - Rules for the safe operation of consumer electrical installations.
- ОЕМ посібники з програмування та діагностики для Siemens (PROFINET System Description), Rockwell Automation (EtherNet/IP CIP Networks), Schneider Electric (Modbus TCP/IP Communication).