1. Problem description & scope of application
This guide covers critical communication faults in industrial control systems based on programmable logic controllers (PLCs) and using fieldbus systems such as Profinet, EtherNet/IP and Modbus RTU/TCP. The aim is to systematically diagnose and eliminate errors that can lead to data loss, malfunctions of field devices or complete system downtimes. Affected asset types include manufacturing cells, assembly lines, process plants and infrastructure systems.
The disorders are classified as follows:
- Critical: Total communication failure, immediate system shutdown, high safety relevance.
- Major: Sporadic communication errors, cyclic failures, impaired product quality, increased downtime.
- Minor: Communication delays, alerts without immediate impact on production, reduced system performance.
2. Safety precautions
IMPORTANT: Strict safety measures must be observed before beginning any diagnostic work that requires intervention in electrical or mechanical systems. Failure to do so may result in serious injury or death.
- LOTO (Lockout/Tagout): When working on the power supply or moving parts of the system, the energy supply must be completely interrupted in accordance with DIN EN 1037 (Safety of Machines - Disconnecting the Energy Supply) and secured against being switched on again.
- Personal protective equipment (PPE): Always wear suitable PPE: safety shoes (DIN EN ISO 20345), protective gloves (DIN EN 388), safety glasses (DIN EN 166). When working near live parts, insulating gloves and tools in accordance with VDE 0680 are required.
- Stored energy: Before disconnecting lines or opening housings, ensure that there is no stored energy (compressed air, hydraulic pressure, residual voltage in capacitors). Open vent valves, release pressure and discharge capacitors.
- Hot surfaces: Components can reach high temperatures during operation. Wait for it to cool down before touching it or use suitable heat protection.
- Explosion protection (ATEX): When working in potentially explosive areas, only intrinsically safe tools and measuring devices in accordance with ATEX Directive 2014/34/EU should be used.
3. Required diagnostic tools
The correct selection and use of diagnostic tools is crucial for efficient troubleshooting.
| Tool / device | Specification/Model (Example) | Measuring range / function | Purpose |
|---|---|---|---|
| multimeter | Fluke 87V, Metrawatt MetraHit 27I | V, A, Ω, continuity, frequency | Voltage measurement, resistance testing (cables, terminating resistors), continuity testing |
| Industrial Ethernet field test device | Softing IT Networks NetXpert XG2, Fluke Networks LinkRunner G2 | Cable test (length, shield, attenuation, NEXT, return loss), network analysis (ping, traceroute, VLAN, QoS, LLDP/CDP) | Physical layer verification, protocol analysis for Profinet/EtherNet/IP |
| Oscilloscope (handheld) | Hantek DSO-2D15, Fluke 120B series | 50 MHz - 200 MHz bandwidth, 2 channels | Signal analysis on data lines (levels, edges, jitter, EMI), fault location |
| Cable tester (passive) | Beha-Amprobe CT-20, Fluke Networks CableIQ | Continuity, short circuit, swapping, split pair | Basic inspection of copper cables (RJ45, M12) |
| Network Tap (Passive/Active) | Garland Technology, ProfiTAP | Real-time traffic tapping | Non-intrusive protocol analysis with Wireshark |
| Crimping pliers/stripping tool | Weidmüller PZ 6/5, Phoenix Contact CRIMPFOX | RJ45, M12, ferrules | Cable assembly and repair |
| Laptop with diagnostic software | Siemens TIA Portal, Rockwell Studio 5000, Modbus Poll, Wireshark | Device configuration, network analysis, protocol decoding | Parameterization, troubleshooting at application level |
| Thermography camera | FLIR E-Series, Testo 872 | Temperature range -20°C to 650°C, min. 160x120 px resolution | Identification of overheated components (cables, connectors, modules) |
4. Initial assessment – checklist
Prior to detailed diagnosis, systematic pre-screening is essential to save time and capture important contextual information.
| Checkpoint | Action/Observation | Recording/Status |
|---|---|---|
| Visual inspection | Visually inspect all cables, connectors, modules and end devices for damage, loose connections or corrosion. | OK / Damaged (Details: position, type of damage) |
| Check status LEDs | Read and interpret status LEDs on PLCs, communication modules, switches and field devices (e.g. Link/Act, Error, SF, BF). | Normal / Flashing Red / Solid Red / Off (LED pattern description) |
| Alarm and diagnostic buffer | Read out the alarm and diagnostic buffers of the PLC and communication modules. Note the date and time the error first occurred. | Error messages present (list of messages, timestamp) / No messages |
| Environmental conditions | Check temperature, humidity, dust, vibrations in the control cabinet and where the device is used. | Normal / Abnormal (Details: Value, Deviation from Specification) |
| Latest changes | Have there been any recent changes to the hardware (cable, device replacement), software (PLC program, firmware) or network parameterization? | Yes (Details: What, When, Who) / No |
| Check documentation | Check the network plan, electronic plan, address lists and configuration files to ensure they are up to date and match the current status. | Current & Matching / Outdated / Incorrect |
5. Systematic diagnostic flowchart
This flowchart guides you step-by-step through diagnosing fieldbus communication errors.
- Communication error detected (e.g. SF/BF LED on PLC, device not accessible).
- Initial check of the physical layer:
- Visual inspection: Check all cables and plugs for obvious damage, tightness of the plugs and correct strain relief.
- Status LEDs: Check on communication modules, switches and field devices.
- IF all LEDs normal: THEN continue to 1.2 (logical layer).
- IF Link LED off or error LED active on a device: THEN go to 1.1.1 (cable and connection test).
- Cable and connection test:
- Cable connection: Reconnect the affected cable at both ends.
- Multimeter (Modbus RS485): Measure resistance across the data lines (A/B) to find short circuits or interruptions. Also measuring the terminating resistors (120 Ω).
- Cable tester (Profinet/EtherNet/IP): Use the field test device to check the affected cable for:
- Continuity, short circuit, swapping, split pair.
- Cable length (max. 100m for Cu-Ethernet, 200m for FO multimode, 20km for FO singlemode).
- Shield continuity (continuity from connector housing to connector housing).
- IF Cable test error-free: THEN go to 1.1.2 (environmental factors & EMI).
- IF Cable test error detected: THEN go to 7.1 (cable break/fault) or 7.2 (missing/defective shielding).
- Environmental Factors & EMI (Electromagnetic Interference):
- Environment: Extreme temperatures (>50°C or <0°C), high humidity (>90% rH), high vibration check.
- EMI Sources: Nearby frequency converters, large motors, welders, inductive loads.
- Grounding/potential equalization: Visual inspection of the grounding connections and the potential equalization in accordance with VDE 0100-410 (earthing).
- IF Abnormalities: THEN go to 7.2 (missing/defective shielding) or 7.3 (inadequate grounding/potential equalization).
- IF no abnormalities: THEN continue to 1.2 (logical layer).
- Initial check of the physical layer:
- Checking the logical layer / network configuration:
- IP configuration (Profinet/EtherNet/IP/Modbus TCP):
- Connection via laptop: Direct connection of a laptop to the affected device (or via a switch in the same subnet).
- Ping test: Ping device via its IP address (
ping <IP address>). - IF Ping successful: THEN continue to 1.2.2 (device configuration).
- IF Ping failing: THEN
- IP address/subnet mask: Check the device and the PLC configuration for correctness. Must be in the same subnet.
- Gateway: If communication takes place via router, check gateway address.
- Duplicated IP addresses: Make sure that no other station in the network uses the same IP address (e.g. check using ARP cache or with a field test device).
- THEN go to 7.4 (Incorrect IP configuration) or 7.5 (Duplicated IP address).
- Device configuration (Profinet/EtherNet/IP/Modbus):
- Software: Use diagnostic software (TIA Portal, Studio 5000, Modbus Poll) to access the device.
- Device name (Profinet): Must exactly match that in the PLC configuration (GSDML).
- Modbus address (Modbus RTU/TCP): Check slave address on the device and compare it with the PLC programming.
- Process data mapping: Check whether the data areas to be transferred (inputs/outputs) are correctly configured in the PLC and on the field device.
- IF Configuration error found: THEN go to 7.6 (Incorrect device/module configuration).
- IF Configuration correct: THEN continue to 1.3 (PLC program & firmware).
- IP configuration (Profinet/EtherNet/IP/Modbus TCP):
- PLC program & firmware:
- PLC program:
- Logic: Check whether the communication blocks in the PLC program are called and parameterized correctly (e.g. PN_IO_SEND/RECV, MSG instances).
- Error codes: Search for and interpret specific error codes of the communication blocks in the PLC program.
- IF programming error: THEN go to 7.7 (error in the PLC program).
- Firmware versions:
- Alignment: Align firmware versions of PLCs, communication modules and field devices with the manufacturer's specifications and the project specifications.
- Compatibility: Ensure that all firmware versions are compatible with each other.
- IF Incompatibility or outdated firmware: THEN go to 7.8 (Outdated/Incompatible Firmware).
- PLC program:
- Node isolation (if errors are sporadic or difficult to localize):
- Topology change: Separate individual devices or entire system parts from the fieldbus one after the other.
- Segment-wise testing: Test individual segments of the network in isolation using switches and field test devices.
- IF Error disappears after separation of a node/segment: THEN the error lies in this isolated area. Focus diagnostics on that segment or device.
6. Error-cause matrix
This matrix helps identify likely causes and conduct targeted testing based on the symptoms observed.
| Symptom | Probable causes (ranked by probability) | Diagnostic test | Expected result with confirmed cause |
|---|---|---|---|
| SF/BF LED on PLC lights up / communication dead | 1. Broken cable / bad connection 2. Defective field device / communication module 3. Incorrect IP address / device name 4. Short circuit on data line (Modbus RS485) |
Cable test with field test device, device exchange (cross exchange), ping test, IP configuration check, multimeter (RS485) | Cable test error (open/short circuit), new device works, ping failed, wrong IP/name detected, resistance < 120 Ω (RS485) |
| Sporadic communication errors/data corruption | 1. Electromagnetic Interference (EMI) 2. Cable shielding damage / lack of grounding 3. Loose contact in the plug 4. Overloaded fieldbus (too many devices, too high data rate) 5. Defective switch port (Profinet/EtherNet/IP) |
Oscilloscope (signal analysis), thermography camera, visual inspection of cables/plugs, network analysis (frame drops, CRC errors), port replacement on the switch | Signals noisy, increased temperature on cable/connector, optical contact error, increased frame errors, other switch port works |
| Device not accessible after replacement/commissioning | 1. Incorrect IP address / subnet mask / device name 2. Incorrect Modbus address / baud rate 3. Incorrect GSDML / EDS file loaded 4. Device defective (DOA) |
IP configuration check, Modbus configuration check, PLC configuration check, device replacement | Wrong parameters discovered, wrong GSDML/EDS reference, new device works |
| Communication slow / high cycle time | 1. Overloaded fieldbus 2. Incorrect network topology (e.g. loops) 3. Too many unmanaged switches in series 4. Software error in PLC program |
Network analysis (latency, jitter), topology check, managed switch deployment, PLC program code analysis | High packet delays, loop detection, improvement after switch replacement, logic errors in the PLC program |
7. Root cause analysis for each error
A deep understanding of the causes is crucial for sustainable problem solutions.
- Cable break / cable error:
- Why it happens: Mechanical stress (bending, pulling), vibrations, incorrect installation (bending radius too narrow, crushing), aging of the insulating material.
- How to confirm: A cable tester (e.g. Softing NetXpert) identifies exactly the location and type of fault (open circuit, short circuit). A multimeter shows infinite resistance on an open circuit or very low resistance on a short circuit.
- Damage if not rectified: Complete communication failure, intermittent faults, data corruption, increased wear and tear on other components due to error messages and restarts.
- Missing/defective shielding or poor grounding:
- Why it happens: Damage to the shield (sheath), shield not connected to plugs, faulty equipotential bonding, loop formation due to multiple grounding points. This leads to susceptibility to electromagnetic interference (EMI).
- How to confirm: Visual inspection of shielding and ground connections. Oscilloscope shows high frequency noise on the data lines. A field test device checks the shield for continuity.
- Damage caused by non-remediation: Sporadic communication failures, bit errors, unexplained device malfunctions, problems that are difficult to locate, increased maintenance requirements.
- Incorrect IP configuration / device name / Modbus address:
- Why it happens: Manual entry errors, using a template with incorrect data, insufficient documentation, duplicate addresses.
- How to confirm: Comparison of the addresses actually set in the device with the network plan and the PLC configuration. A ping test shows no response if the IP is incorrect. An ARP scan can reveal duplicate IPs.
- Damage if not rectified: Device not accessible, communication conflicts, system downtime, inefficient troubleshooting.
- Defective field device / communication module:
- Why it happens: Aging of the electronics, overvoltage, overheating, mechanical damage, firmware errors.
- How to confirm: Cross swap with a known working device. Check diagnostic LEDs on the device. In the event of sporadic errors, a thermography camera can identify overheated components on the module.
- Damage if not rectified: Permanent system downtime or unstable operation, consequential damage to coupled systems if incorrect data is transmitted.
- Error in the PLC program:
- Why it happens: Logic errors when implementing communication blocks, incorrect parameter transfer, missing error handling routines, race conditions.
- How to confirm: Online observation of the PLC program (status variables, block calls, error codes). Simulation of communication scenarios.
- Damage if not rectified: Unpredictable system behavior, data inconsistencies, errors that are difficult to diagnose, high complaint costs.
8. Step-by-step troubleshooting procedures
- Cable replacement and assembly:
- SAFETY NOTE: Before disconnecting cables, switch off the system in accordance with LOTO and secure it against being switched on again.
- Identify the affected cable and carefully dismantle it.
- Lay a new cable (at least CAT5e for Profinet/EtherNet/IP, shielded) in accordance with the DIN EN 61918 standard. Pay attention to the bending radius (min. 4x cable diameter).
- Assemble connectors (e.g. M12 D-coded, RJ45 Industrial) professionally using crimping pliers and stripping tools. Place the shield 360° on the housing.
- Check for continuity, short circuit and shield connection with a cable tester. All values must be within the tolerance range.
- Connect cable firmly and ensure strain relief.
- Correcting IP addresses / device names / Modbus addresses:
- SAFETY NOTE: Changes to network parameters can cause communication disruptions. Verify that the correct value is present.
- Access the network configuration via the diagnostic software (e.g. TIA Portal, Studio 5000) or the device's web interface.
- Set the IP address, subnet mask and, if necessary, gateway according to the network plan. For Profinet, match the device name exactly with the GSDML configuration. For Modbus, check and set the slave ID and baud rate.
- Save changes and restart the device if necessary.
- Verify availability and correct communication using a ping test and diagnostic software.
- Fixing EMI / grounding problems:
- SAFETY NOTE: Work on grounding and potential equalization systems should only be carried out by qualified personnel in accordance with VDE 0100.
- Check all ground connections of devices and control cabinets for correct and tight fit (visual inspection, resistance measurement with multimeter: <1 ohm to PE).
- Connect the shielding of the communication cable correctly at both ends (or on one side in the event of potential differences).
- Identify possible EMI sources (frequency converters, large contactors) and maintain distances from communication cables according to the manufacturer's specifications (min. 20 cm). If necessary, install filters or additional shielding.
- Check the potential equalization rail for correct wiring to avoid ground loops.
- Use an oscilloscope to verify the noise behavior on the data lines after the measures.
- Replacement of defective devices / modules:
- SAFETY NOTE: Before replacing, disconnect the power supply in accordance with LOTO.
- Identify defective device/module. Document and disconnect all cables and connections.
- Install a new, identical spare part (same manufacturer, same item number/revision status).
- Restore cabling according to documentation.
- Configure device (IP address, device name, etc.).
- Restore power supply and carry out functional test.
- Correction in the PLC program:
- SAFETY NOTE: Changes in the PLC program may only be carried out by authorized personnel and must be tested in advance. Create a backup of the existing program.
- Open the PLC program and identify the faulty communication blocks or logic sections.
- Make corrections according to specification or best practice. Implement error handling routines (e.g. for communication timeout).
- Load the program (only after release and with test verification) and observe the function online.
9. Preventive measures
Prevention is the most effective way to minimize communication disruptions and maximize asset availability.
| Cause | Prevention strategy | Monitoring method | Recommended interval |
|---|---|---|---|
| Broken cable / bad connection | Regular visual inspection of cable routes, correct routing with sufficient bending radii and strain relief. Use of industrial-grade cables (PUR/PVC) and connectors (IP67/IP69K). | Visual inspection, sporadic cable testing with field test equipment. | Annually / Every maintenance / When there are changes in the cable path. |
| EMI / Lack of grounding | Professional grounding and potential equalization after VDE 0100. separation of communication and power cables. Use of shielded cables and ferrule connectors. | Measuring potential equalization (multimeter), oscilloscope for noise analysis. | Every 3 years / For expansions / For new EMI sources. |
| Incorrect IP configuration / device name | Standardized naming conventions and IP address management. Central storage and maintenance of network documentation. | Regular comparison of current status vs. documentation, network scan for duplicate IPs. | Every six months / with every start-up/change. |
| Aging / defective devices | Regular maintenance, check environmental conditions (temperature, ventilation). Preventive replacement of critical components (note service life). | Thermography recordings, observation of diagnostic messages, spare parts management. | According to manufacturer specifications / Based on wear curve. |
| Error in the PLC program/firmware | Use of version management for PLC programs. Thorough testing phases for software changes. Regular firmware updates (after compatibility check). | Audit of program changes, test protocols, firmware version control. | With every software change / Annually for firmware updates. |
10. Spare Parts & Components
The availability of spare parts is crucial for a quick restoration of operational readiness. UNITEC-D offers a wide range of high-quality components for your fieldbus systems.
| Part description | Specification (example) | When to replace | UNITEC category |
|---|---|---|---|
| Profinet cable | M12 D-coded, 4-pin, CAT5e, PUR, shielded | In case of cable breakage, shield damage, age-related brittleness | Industrial communication |
| EtherNet/IP patch cable | RJ45, CAT6A, S/FTP, shielded, 1m/2m/5m | In case of mechanical damage, poor plug connection | Industrial communication |
| Modbus RTU cable | RS485, 2x2x0.34mm², shielded, PUR/PVC | In the event of cable breakage, corrosion, insulation damage | Industrial communication |
| Fieldbus connector | M12, RJ45 Industrial, push-pull, quick connection | In case of contact problems, corrosion, housing damage | Connectors |
| Industrial Ethernet Switch | Managed, 8-port, Gigabit, DIN rail mount, IP30 | If individual ports fail, overall defect, incompatibility | Network components |
| Modbus RS485 / Ethernet converter | RS485 to Ethernet/IP or Profinet | If the conversion function fails | Gateways & Converters |
| PLC communication module | e.g. Siemens CM 1243-5 (Profinet), Rockwell 1734-AENT (EtherNet/IP) | In the event of an internal error, communication failure in the module | PLC modules |
You can find a comprehensive selection of high-quality spare parts in our UNITEC-D e-catalog. We would be happy to advise you on selecting the right components.
11. References
- DIN EN 61918: Industrial communication networks - installation of communication networks in industrial plants.
- VDE 0100-410: Setting up low-voltage systems - Part 4-41: Protective measures - Protection against electric shock.
- Profibus and Profinet International (PI): Guidelines and specifications for Profinet.
- ODVA (Open DeviceNet Vendor Association): Specifications for EtherNet/IP.
- Modbus-IDA: Modbus specifications.
- Manufacturer documentation of the respective PLC and field devices.
- UNITEC-D maintenance manuals for specific system components.