Neurocomputing • Volume 649 • 2025 • Article 130876

Constrained Unseen Recovery Estimator (CUR-Estimator)

Towards reliable missing data imputation for commercial aero-engine degradation process

Missing DataAero-engineTime IntervalData ImputationTime SeriesTransformer Networks

Authors

Haoze Wu; Shisheng Zhong; Minghang Zhao; Xuyun Fu; Yongjian Zhang; Song Fu

Affiliations

  • a. School of Mechatronics Engineering, Harbin Institute of Technology, Harbin 150001, China
  • b. Department of Mechanical Engineering, Harbin Institute of Technology, Weihai 264209, China
  • c. Weihai Key Laboratory of Intelligent Operation and Maintenance, Harbin Institute of Technology, Weihai 264209, China

Core Idea

The Constrained Unseen Recovery Estimator (CUR-Estimator) aims to improve the accuracy and stability of imputation under long consecutive missing segments in aero-engine life-cycle degradation data. The method enhances conventional multi-step imputation from two complementary perspectives. First, time interval information associated with missing segments is explicitly incorporated into multivariate time-series modeling, alleviating error accumulation caused by irregular sampling and recursive prediction. Second, statistical interpolation results are introduced as distributional priors during neural network training, imposing soft constraints on multi-step imputation outputs without enforcing hard boundary conditions, thereby suppressing prediction drift and yielding more stable and physically plausible recovery results.

CUR-Estimator at a glance

For consecutive missing segments in aero-engine operational data, single-step based imputation often suffers from multi-step error accumulation. CUR-Estimator integrates interval-aware temporal modeling into neural imputation and suppresses unrealistic prediction drift via statistical interpolation constraints. The effectiveness is validated on simulated engine data, real civil aero-engine datasets, and wind turbine datasets.

Illustration of the CUR-Estimator process
Fig. 1. Data acquisition process in aero-engine diagnostics and health management.

Problem

Sensor data collected from aero-engine operations inevitably contain missing values and noise. For consecutive missing segments along the temporal dimension, imputation strategies based on single-step prediction rely heavily on recursive forecasting, where errors accumulate and amplify as prediction horizons increase. Methods assuming uniform sampling intervals often fail to stably characterize degradation trajectories and may generate implausible imputation results.

Approach

A self-attention enhanced GRU network explicitly models irregular sampling intervals to enable interval-aware imputation. Statistical interpolation results are further introduced as soft constraints to restrict the numerical range of neural predictions, improving stability and physical plausibility.

Input Data

Sensor measurements collected from key flight phases, including takeoff, cruise, and landing.

Key Characteristics

Time-interval modeling, GRU-based architecture, and statistically constrained imputation.

Overview

Abstract

Missing data imputation for aero-engine life-cycle degradation time series faces two major challenges. First, flight missions differ in duration, and even the same flight phase may vary across missions, leading to non-uniform sampling intervals. Such temporal inconsistency complicates the assessment of individual flight impacts on long-term performance evolution. Second, neural-network-based imputation methods are susceptible to significant noise and long consecutive missing segments, which may result in unreasonable recovery outcomes. To address these challenges, this paper proposes a Constrained Unseen Recovery Estimator (CUR-Estimator) for missing data imputation in aero-engine degradation processes. Time interval information is encoded via a transformer-enhanced gated recurrent unit and combined with missing masks to adjust hidden states and input weights for missing segments, forming an interval-aware temporal imputation network. Statistical interpolation methods are further employed to constrain neural imputation outputs and limit their plausible range. As a representative example, the Piecewise Cubic Hermite Interpolating Polynomial (PCHIP) is adopted to regulate the imputation behavior of the interval-aware network. Experimental results on both simulated datasets and real civil aero-engine datasets demonstrate that the proposed method achieves high imputation accuracy and strong stability.

Key Concepts

Missing data imputationAero-engine degradationTime-series dataSelf-attention mechanismGated Recurrent Unit (GRU)Time-interval awarenessNeural networksStatistical constraintsImputation accuracyFlight dataSensor faultsTime interval variationGRU-based modelsMachine learningOperational conditionsTemporal dependency modelingData preprocessingSensor data recoveryFlight performance dataMissing data handlingDynamic temporal modelingInterval-aware neural networksLongitudinal dataAero-engine PHMHealth monitoring systemsFault diagnosis systemsPredictive maintenanceEngine fault detectionAnomaly detectionSensor failure mitigation

Paper Information

Title
CUR-Estimator (CUR-E): Towards Reliable Missing Data Imputation for Aero-Engine Degradation Process
Journal
Neurocomputing
Keywords
Aero-engine degradation; Missing data imputation; Self-attention mechanism; Gated Recurrent Unit (GRU); Statistical constraints

Practical relevance

  • Effectively characterizes temporal heterogeneity induced by irregular sampling intervals in aero-engine degradation data, providing a more reliable foundation for life-cycle health modeling.
  • Adapts to inconsistent sampling intervals and significant variation in mission duration without requiring strict temporal alignment, improving imputation accuracy for industrial operational data.
  • Constrains neural imputation outputs through statistical interpolation, reducing the risk of abnormal recovery values under long missing segments or strong noise conditions.

Method

CUR-Estimator addresses missing data imputation in aero-engine degradation processes by introducing an interval-aware mechanism. Built upon a GRU-based architecture and combined with statistical constraints, the method effectively handles irregular sampling and missing observations.

1) Time Interval-Aware Imputation

To cope with irregular sampling intervals and varying flight-phase durations in aero-engine operational data, time interval information is explicitly encoded and incorporated into the sequential modeling process. By integrating a self-attention enhanced GRU, the model adaptively balances historical observations and current inputs during hidden-state updates, maintaining stable degradation representation under consecutive missing segments and temporal scale variations, and mitigating error accumulation caused by temporal misalignment.

2) Statistical Refinement and Constraints

To prevent unreasonable imputation results under long missing segments or strong noise, statistical interpolation methods are introduced to constrain predicted sequences. Using the Piecewise Cubic Hermite Interpolating Polynomial (PCHIP) as an example, its monotonicity-preserving and local shape-constrained properties are leveraged to construct plausible value ranges, suppressing prediction drift without imposing explicit hard boundaries and improving stability and physical consistency.

Combining Neural Networks with Statistical Constraints

The interval-aware GRU network models temporal dependencies and scale variations in degradation time series, while statistical interpolation provides shape- and distribution-level constraints on imputation results, enhancing stability and plausibility under complex missing scenarios.

Key Implementation Details:

  • Time Interval Encoding: Each data point explicitly incorporates its corresponding time interval, enabling the model to perceive temporal heterogeneity caused by irregular sampling and consecutive missing segments during hidden-state updates.
  • Statistical Constraint Guidance: PCHIP interpolation is employed as a statistical reference to construct plausible imputation ranges and suppress unreasonable predictions in long missing intervals.
  • Integrated Training Strategy: By introducing statistically guided constraints during training, the model maintains stable predictive distributions under multi-step recursive imputation, reducing error accumulation and prediction drift.

Framework Workflow Diagram

Schematic diagram of the CUR-Estimator workflow

Fig. 4. Execution flow of the basic modules. The framework is a bidirectional INIT network, and the final result is jointly determined by two neural networks and an interpolation method.

Data Inputs and Constraints

Inputs

Flight performance data, sensor readings, time intervals, and missing masks.

Constraints

Piecewise Cubic Hermite Interpolation (PCHIP) is used to constrain the imputed value range.

Deployment Intent

High-accuracy imputation for real-world aero-engine degradation analysis.

Results

Evaluated on aero-engine degradation data imputation, with emphasis on imputation accuracy and stability under irregular sampling intervals.

Mean Imputation Error

2.12 ± 0.35

Lowest imputation error across all compared methods.

Imputation Stability

Low variance

Lower result fluctuation under irregular sampling intervals.

Data Recovery Accuracy

89.57 ± 3.71

Higher recovery capability compared with baseline methods.

What was validated

  • Imputation accuracy: Significantly improved recovery accuracy for missing values in aero-engine degradation time series.
  • Irregular-interval handling: Effectively adapts to varying time intervals between sensor readings.
  • Robustness: Maintains low variance across repeated experiments.
  • Cross-scenario applicability: Validated via comparative experiments on a wind turbine dataset.

Experimental Setup

Task

Missing data imputation for multi-class aero-engine degradation time series with irregular sampling intervals.

Baselines

GRU, GRU + Statistical Constraints, and traditional time-series imputation methods.

Metrics

Mean imputation error, recovery accuracy, and stability (mean ± standard deviation), with emphasis on recovery under challenging conditions.

Key claim validated

Improved stability and accuracy without increasing the sample size, supporting continual improvement.

Results Table

Mean ± standard deviation over repeated experiments
MethodImputation Error (mean ± std)Recovery Accuracy (mean ± std)Stability (variance)
GRU3.12 ± 0.7580.71 ± 4.15High variance
GRU + Statistical Constraints2.78 ± 0.6082.10 ± 4.00Moderate variance
Traditional Imputation3.50 ± 0.9075.92 ± 5.20High variance
CUR-Estimator2.12 ± 0.3589.57 ± 3.71Low variance

Interpretation: CUR-Estimator achieves the highest imputation accuracy with the lowest variance.

Key Takeaways

  • CUR-Estimator improves imputation accuracy without requiring additional samples.
  • Stable performance across random splits indicates robustness for practical applications.
  • Lower error variance yields more consistent predictions under irregular sampling.

Training Dynamics

Results of ten experiments on the C-MAPSS dataset using different methods, (a) MSE, (b) MAE.

Results of ten experiments on the C-MAPSS dataset using different methods (Fig. 9)

Fig. 9. Results of ten experiments on the C-MAPSS dataset using different methods, (a) MSE, (b) MAE.

Error Distribution

Bar charts and standard deviations of MSE and MAE under different SNRs.

Bar charts and standard deviations under different SNRs (Fig. 10)

Fig. 10. Bar charts and standard deviations of MSE and MAE under different SNRs.

Figures for a quick read

Fig. 1
Fig. 1 Data acquisition process in aero-engine diagnostics and health management.
Fig. 4
Fig. 4 Main workflow of the CUR-Estimator method.
Fig. 5
Fig. 5 Internal structure of the ITIN network.
Fig. 7
Fig. 7 Overall experimental workflow.

Citation

If this work is useful, please cite the paper.

BibTeX

@article{wu2025cur,
  title   = {CUR-Estimator: Towards Accurate Missing Data Imputation in Aero-Engine Degradation Processes},
  author  = {Wu, Haoze and Zhong, Shisheng and Zhao, Minghang and Fu, Xuyun and Zhang, Yongjian and Fu, Song},
  journal = {Neurocomputing},
  volume  = {649},
  pages   = {130876},
  year    = {2025},
  doi     = {10.1016/j.neucom.2025.130876},
  url     = {https://doi.org/10.1016/j.neucom.2025.130876}
}

Contact

For collaboration, inquiries, or reproducibility requests, please contact the corresponding author.

Contact Email

Shisheng Zhong: zhongss#hit.edu.cn
Minghang Zhao: zhaomh#hit.edu.cn

Acknowledgment

Supported by National Key R&D Program of China (2023YFB4302400), National Natural Science Foundation of China (No. 92360308).