Factory AI Logo
Back

Robots Maintenance: How Do You Transition from "Fixing Breakdowns" to Asset Performance Management?

Feb 13, 2026

robots maintenance
Hero image for Robots Maintenance: How Do You Transition from "Fixing Breakdowns" to Asset Performance Management?

In the high-stakes environment of modern manufacturing, the industrial robot is the heartbeat of production. Whether it’s a 6-axis articulated arm welding automotive frames or a high-speed delta robot picking and packing pharmaceuticals, these machines are marvels of precision. However, a common misconception persists among facility managers: that following the OEM’s manual for lubrication and battery changes constitutes a complete maintenance strategy.

It does not.

If you are reading this, you are likely facing a specific problem. Perhaps you are noticing a gradual drift in repeatability, causing quality control issues. Maybe you recently suffered a catastrophic joint failure that halted the line for 12 hours because a specific harmonic drive wasn't in stock. Or, you might be a reliability engineer looking to integrate your robotic fleet into a broader Asset Performance Management (APM) strategy.

The core question isn't just "how do I maintain a robot?" It is: How do I ensure zero unplanned downtime and maximum lifecycle value from robotic assets in a 24/7 production environment?

The answer lies in moving beyond simple Preventive Maintenance (PM) checklists and adopting a hybrid approach that combines rigorous mechanical discipline with data-driven predictive analytics.


The Core Question: Why Isn't the OEM Manual Enough?

Most maintenance managers start with the OEM manual. Fanuc, ABB, KUKA, and Yaskawa all provide excellent documentation. They tell you to grease the J1 axis at 1,000 hours and replace the controller backup battery annually.

So, why do robots still fail unexpectedly?

The OEM manual is written for "standard operating conditions." It assumes a clean environment, a standard duty cycle, and moderate payloads. It rarely accounts for:

  • Abrasive dust or conductive particulate matter (common in foundries or machining).
  • Micro-stops and high-torque reversals that accelerate gear wear faster than continuous rotation.
  • Thermal fluctuations in the plant that affect lubricant viscosity.

To solve the problem of unplanned downtime, you must view robots maintenance not as a series of tasks, but as a continuous monitoring of asset health. This is the shift from maintenance to Asset Management.

The "Bathtub Curve" in Robotics

Robots follow a classic reliability curve.

  1. Infant Mortality: Failures in the first 1,000 hours, usually due to installation errors, improper grounding, or software logic bugs.
  2. Useful Life: A long period of stability where failures are random but rare.
  3. Wear-Out Phase: The period where mechanical components (gears, belts, harnesses) begin to degrade exponentially.

Your goal is to extend that "Useful Life" phase and predict the "Wear-Out" phase before it impacts product quality.


What Actually Breaks? The Anatomy of Robotic Failure

To maintain a robot effectively, you must understand its weak points. You aren't just maintaining a "robot"; you are maintaining four distinct subsystems, each with its own failure mode.

1. The Mechanical Unit (The Arm)

The most common mechanical failure isn't the motor; it's the transmission. Most industrial robots use Cycloidal gears or Harmonic Drives.

  • The Failure Mode: These gears rely on zero-backlash precision. Over time, the grease breaks down or becomes contaminated with metal shavings. This leads to "play" in the joint.
  • The Symptom: You won't see the robot stop. You will see oval holes instead of round ones, or welding seams that drift 1mm off center.

2. The Cable Harness (The Nervous System)

This is the number one cause of intermittent downtime. As the robot twists and turns, the internal copper conductors fatigue.

  • The Failure Mode: Internal wire breakage inside the insulation.
  • The Symptom: Intermittent communication errors, "pulse coder" alarms that vanish upon reset, or sudden stops during specific high-flex maneuvers.

3. The Controller (The Brain)

The controller cabinet is often neglected until it dies.

  • The Failure Mode: Overheating due to clogged fan filters is the primary killer of servo amplifiers and main CPU boards.
  • The Symptom: Thermal alarms, random shutdowns, or corrupted memory.

4. The End-Effector (The Tool)

The robot is just a positioning device; the end-effector does the work.

  • The Failure Mode: Pneumatic leaks, gripper wear, or sensor misalignment.
  • The Symptom: Dropped parts or "part not present" alarms.

The Practical Preventive Maintenance (PM) Schedule

While we want to move toward predictive strategies, a rigorous PM schedule is the foundation. If you skip the basics, no amount of AI will save you. Here is a breakdown of what a robust schedule looks like for a high-duty facility.

Daily Checks (Operator Level)

These should be integrated into your PM procedures and performed at the start of every shift.

  • Visual Inspection: Check for oil leaks on the floor or robot arm.
  • Noise Check: Listen for grinding, clicking, or whining during operation.
  • Cable Dress: Ensure the external cable dress isn't rubbing against guarding or the part.
  • Pressure: Verify pneumatic pressure is within range (usually 5-7 bar).

Monthly / 500-Hour Checks

  • Controller Vents: Inspect and clean cooling fans and filters. Tip: Do not use compressed air to blow dust INTO the cabinet. Vacuum it out.
  • E-Stop Test: Physically test the Emergency Stop buttons and gate interlocks.
  • Backups: Perform a full image backup of the robot controller.

Quarterly / 2,000-Hour Checks

  • Cable Harness Inspection: Inspect the internal harness (if accessible) and external dress packs for wear, cuts, or flattening.
  • Brake Test: Verify that the robot holds position when servo power is cut. If an axis drops (gravity fall), the brake is worn or the gap needs adjustment.

Annual / 5,000-Hour Checks

  • Lubrication: This is the big one. Sampling grease for iron content (ferrography) is better than blind changing. If changing, ensure you use the exact spec (e.g., MolyWhite RE00). Mixing greases causes saponification (turning to soap), destroying the gear.
  • Battery Replacement: Replace the CPU battery and the APC (Absolute Pulse Coder) batteries. Crucial: Do this with power ON to avoid losing mastering data.
  • Mastering Check: Verify the robot's zero position.

Moving Beyond the Schedule: Predictive Maintenance (PdM)

You have the checklist. Now, how do you stop failures before the schedule says so? This is where Industry 4.0 and AI predictive maintenance come into play.

Vibration Analysis on Robotics

Vibration analysis is standard for pumps and motors, but it is difficult on robots because the speed and load constantly change. However, modern sensors can be gated to listen only during specific movements.

  • The Strategy: Install wireless vibration sensors on the J1, J2, and J3 axis reducers.
  • The Threshold: Establish a baseline vibration signature during a standard cycle. When vibration amplitude increases at specific frequencies (gear mesh frequencies), you know the reducer is failing months before it seizes.

Torque and Current Monitoring

The robot controller already knows how much torque it is applying.

  • The Strategy: Use software to monitor the disturbance torque (the difference between commanded torque and actual torque).
  • The Insight: If J4 requires 15% more current to perform the same move today than it did three months ago, you have a mechanical bind, a lubrication failure, or a failing brake.
  • Implementation: This data can often be pulled directly via OPC-UA into manufacturing AI software without adding external sensors.

For more on how this applies to the motors driving the joints, see our guide on predictive maintenance for motors.


The Hidden Killers: Calibration, Mastering, and Backlash

One of the most frequent follow-up questions we receive is: "My robot is running, but my quality is dropping. Why?"

The answer is usually a loss of repeatability or accuracy.

Repeatability vs. Accuracy

  • Repeatability: The ability to return to the exact same spot every time.
  • Accuracy: The ability to go to a specific coordinate in space (X, Y, Z).

Robots are highly repeatable but notoriously inaccurate. Once you teach a point, they hit it. But if the mechanical unit wears, repeatability drifts.

Backlash Testing

Backlash is the "slop" in the gear train.

  • How to Test: Place a dial indicator on a fixed surface and touch the robot faceplate. Push the robot arm gently by hand (with servos on) in the positive and negative direction.
  • The Limit: If you see movement exceeding the OEM spec (often < 1 arcmin), your gears are worn. This cannot be fixed with software; the gear must be replaced.

Zeroing and Mastering

"Mastering" is the process of defining the physical zero degree angle for each axis.

  • The Danger: If a robot loses mastering (dead battery, encoder failure), it forgets where it is. Re-mastering by eye (lining up witness marks) is rarely accurate enough for precision applications.
  • The Solution: Use fixture-based mastering or laser tracker calibration to restore the robot to factory specs.

For a deep dive on calibration standards, the National Institute of Standards and Technology (NIST) offers extensive resources on performance metrics.


Electrical and Controller Maintenance: The Brain of the Operation

While mechanical failures are visible, electrical failures are often sudden and total.

The Teach Pendant

The Teach Pendant (TP) is the most abused piece of hardware in the cell. It gets dropped, covered in oil, and stepped on.

  • Maintenance: Inspect the TP cable regularly. A short in the TP cable can blow the fuse on the main power supply, killing the whole robot. Use screen protectors on touch screens.

Thermal Management

Heat is the enemy of electronics.

  • The Issue: Industrial cabinets are often placed in hot corners of the plant. If the internal temperature exceeds 55°C (131°F), the life of capacitors on the drive boards is halved.
  • The Fix: Check the cabinet fans. If the environment is oily, standard filters will clog in days. Consider active cooling units (AC) for the cabinet rather than just fans.

Data Hygiene

Treat the robot controller like a server.

  • Backups: Automated daily backups are essential. If a controller motherboard fries, having a backup on a USB stick from six months ago is useless if points have been touched up since then.
  • Image Files: Ensure you have a "Ghost" image of the hard drive, not just a file backup.

Safety and Compliance in Robot Maintenance

You cannot discuss maintenance without discussing safety. In 2026, safety standards are more integrated with maintenance workflows than ever.

Lockout/Tagout (LOTO) for Gravity Loads

Standard LOTO cuts electrical power. However, robots have potential energy in the form of gravity.

  • The Risk: If you change a motor on J2 (the lower arm) without blocking the arm, it will collapse when the brake is released, potentially crushing the technician.
  • The Procedure: Always block the arm mechanically or use a crane to support the load before removing any motor or brake.

ISO 10218 and RIA 15.06

Compliance requires that safety functions be tested.

  • The Test: Periodically test the "Deadman Switch" on the teach pendant. It should stop the robot when released and when gripped too tight (panic grip).
  • Speed Monitoring: Verify that "T1" mode actually limits the tool center point (TCP) speed to 250mm/s.

For authoritative guidelines on safety standards, refer to Robotic Industries Association (RIA) or relevant ISO documentation.


Implementing a Data-Driven Maintenance Strategy

How do you pull this all together? If you have 50 robots, you cannot manage this on a spreadsheet. You need a centralized system.

1. Centralize Data in a CMMS

Every robot should be an asset in your CMMS software.

  • Track Costs: Log every hour of labor and every tube of grease. This helps you calculate the Total Cost of Ownership (TCO).
  • Automate Work Orders: Trigger PMs based on runtime hours (pulled from the controller), not calendar days. A robot running one shift needs different maintenance than one running three shifts.

2. Mobile Accessibility

Technicians shouldn't have to run back to the office to print a manual.

  • The Solution: Use a mobile CMMS that allows them to scan a QR code on the robot leg and instantly see the grease spec, the last fault code, and the mastering history.

3. Spare Parts Strategy

Robot parts are expensive and have long lead times.

  • The Strategy: You don't need a spare motor for every axis. Analyze your fleet. If you have 20 Fanuc R-2000iC robots, you only need one set of spare motors and one spare servo amp.
  • Consignment: Negotiate with your supplier for consignment stock on critical, high-value items like RV reducers.

4. The "Refurbish vs. Replace" Decision

Eventually, maintenance is no longer viable.

  • The Benchmark: When the cost of a repair (e.g., replacing J1, J2, and J3 reducers + labor) exceeds 50% of the cost of a new unit, or when the robot is older than 15 years and parts are obsolete, it is time to replace.

Conclusion: The Zero-Downtime Mindset

Robots maintenance in 2026 is no longer about greasy hands and paper checklists. It is a sophisticated discipline involving data analysis, electrical troubleshooting, and strategic asset management.

By transitioning from a reactive "fix it when it breaks" mentality to a proactive, data-driven approach, you do more than just keep the robots moving. You extend the life of your capital equipment, ensure product quality, and protect the safety of your workforce.

Don't wait for the next joint failure to audit your maintenance strategy. Start by digitizing your current checklists, integrating predictive sensors on critical axes, and empowering your team with the right software tools.

Ready to modernize your robotics maintenance strategy? Explore how our preventive maintenance software can automate your schedules and integrate directly with your robotic controllers.

Tim Cheung

Tim Cheung

Tim Cheung is the CTO and Co-Founder of Factory AI, a startup dedicated to helping manufacturers leverage the power of predictive maintenance. With a passion for customer success and a deep understanding of the industrial sector, Tim is focused on delivering transparent and high-integrity solutions that drive real business outcomes. He is a strong advocate for continuous improvement and believes in the power of data-driven decision-making to optimize operations and prevent costly downtime.