In traditional manufacturing, quality testing comes at the end. The part is produced, inspected - and then it becomes clear whether it fits or not. If not: rejects, rework, complaint. The reaction to a quality problem is always retrospective: something has gone wrong, now we analyze why.
Predictive quality reverses this logic. Instead of checking whether a part is good after production, it is predicted during production whether a part will be good - based on process parameters, machine data and historical samples. Quality inspection shifts from final inspection to process control.
That sounds like a dream of the future. But it is not. The underlying technologies - machine learning, process data integration, real-time monitoring - are mature. The limiting factor is not the technology, but the database: if you want to introduce predictive quality, you don't need AI specialists, you first need complete, linked process data.
This article explains what predictive quality actually means, what data you need, how typical ML workflows work and where the method is already delivering measurable results in practice.
THE MOST IMPORTANT FACTS IN BRIEFPredictive quality forecasts the quality of a part during production - not afterwards. It is based on process parameters, machine data and historical quality results, which are condensed into a probability of error using ML models. The critical success factor is not the ML model, but the database. Predictive quality requires complete, time-synchronized data: Process parameters per serial number, quality characteristic per serial number, time stamp machine-set. Without this link, there is no basis for training. Typical use cases are inline defect prediction, adaptive inspection planning and early warning of process drift. The greatest effects arise in the case of high inspection costs (destructive testing, laboratory analyses) or high scrap costs (expensive components, late defect detection). Getting started is easier than expected: an initial proof of concept with one characteristic, 3-6 months of historical data and a random forest model delivers initial results in 2-4 weeks - without a data science team. |
IN A NUTSHELLPredictive quality is not a replacement for quality inspections - it is an advance of them. The aim is not to abolish inspections, but to detect errors before they occur. The key question with predictive quality is: Which process parameter constellation is most likely to lead to a quality problem? The answer is in your data - if it is linked. ML models are only as good as their training data. A perfect algorithm on bad data delivers bad predictions. Data quality is the bottleneck, not model complexity. |
What predictive quality actually means - and what it doesn't mean
Predictive quality is a data-driven approach that predicts the likely quality of a product before or during production - based on process parameters, machine conditions and historical patterns.
| Traditional quality inspection | Predictive quality | |
|---|---|---|
| Point in time | After production (final inspection) | During production (inline prediction) |
| Question | Is this part good? | Will this part be good? |
| Data basis | Inspection result of the individual part | Process parameters + historical patterns |
| Reaction time | Reactive: Defect has already occurred | Proactive: Intervention before defect occurs |
| Inspection scope | 100 % or random sample - independent of risk | Risk-based: more testing with high error probability |
The three core elements of predictive quality
1. process parameters as quality indicators
Every quality characteristic - dimensional accuracy, strength, surface quality - is influenced by process parameters: temperature, pressure, speed, feed rate, cycle time, tool condition. Predictive Quality identifies which parameter combinations have historically led to quality problems.
2. machine learning for pattern recognition
ML models recognize complex, non-linear relationships between process parameters and quality results - relationships that are not visible to humans. A model learns from historical data: "If temperature > X and pressure < Y and tool age > Z, then probability of error = 73%."
3. inline integration for real-time forecasting
The trained model is integrated into the production line. At each production step, the current process parameters are recorded, transferred to the model and a probability of error is calculated - in real time, while the part is still in the line.
What predictive quality is not
- No substitute for quality testing: Predictive quality predicts probabilities, not certainties. Inspections remain necessary - but they can be used in a more targeted manner.
- Not a black box: good predictive quality systems show why a part is classified as risky (feature importance). This is crucial for process improvement.
- No guarantee of accuracy: A model with 95% accuracy means that 5% of predictions are incorrect. Predictive quality reduces errors, but does not eliminate them.
| 72 % | Reduction of undetected errors through inline prediction |
| Ø 34 % | Reduction in inspection effort through risk-based inspection planning |
| 2,3× | ROI in the first year with successful implementation |
| 68 % | Predictive quality projects fail due to data quality, not ML |
Differentiation: Predictive quality vs. SPC vs. inline testing
Predictive quality is not the only approach to quality assurance during production. The differentiation from other methods clarifies when which approach makes sense.
| SPC (Statistical Process Control) | Inline testing | Predictive quality | |
|---|---|---|---|
| Basic principle | Statistical monitoring of process parameters on control charts | 100% inspection of every part during production | ML-based forecast of the probability of defects |
| Data basis | Process parameter samples | Inspection result of each part | Process parameters + historical quality results |
| Detection logic | Rule-based: Intervention limits, trend rules | Measurement: Actual value vs. tolerance | Pattern-based: Parameter constellations |
| What is recognized? | Process drift, outliers | Defective parts (after creation) | Risk of error (before occurrence) |
| Typical use | Process monitoring, early warning | Quality assurance, reject sorting | Adaptive inspection planning, process control |
When to use which approach?
SPC is useful when:
- Processes are to be kept stable
- Reaction to drift is sufficient
- Few, clearly defined process parameters are monitored
Inline testing is useful if:
- 100 % proof of quality is required
- Testing is possible quickly and cost-effectively
- defects can be reliably measured
Predictive quality makes sense when:
- Many process parameters interact
- Testing is expensive or destructive
- Defects only become visible at a late stage
- Data is already available but unused
The approaches are not mutually exclusive - they complement each other. Predictive quality can build on SPC data and control inline testing based on risk.
What data predictive quality needs
The most common reason for failed predictive quality projects is not an incorrect ML model, but missing or unlinked data. Before talking about algorithms, the database must be checked.
The three data categories for predictive quality
| Data category | Examples | Source | Critical requirement |
|---|---|---|---|
| Process parameters | Temperature, pressure, speed, feed rate, cycle time, tool age | PLC, MES, sensors | Available per serial number or cycle |
| Quality data | Test result (OK/NOT OK), measured value, error code, Cpk | QMS, test bench, laboratory | Linked per serial number |
| Context data | Shift, operator, batch, tool ID, environmental conditions | MES, ERP, manual | Consistently recorded, can be referenced |
The critical link: serial number as key
Predictive quality learns from the question: "Which process parameters produced which quality result?"
This question can only be answered if the process parameters and quality result can be assigned to the same unit. This requires a continuous key - typically the serial number or a combination of batch + cycle number.
Problem in practice: In many companies, process parameters are stored in the MES, inspection results in the QMS and batch information in the ERP - without a common key. The data exists, but it cannot be linked.
Data quality checklist for predictive quality
| dimension | Minimum requirement | Check question |
|---|---|---|
| Link | Process parameter + quality characteristic per unit | Can I say for each serial number: these parameters → this result? |
| Completeness | < 5 % missing values for key parameters | How many data points have NULL values for temperature, pressure, etc.? |
| Time stamp | Machine set, ISO 8601, synchronized | Are process timestamp and inspection timestamp consistent? |
| Sample size | At least 500-1,000 units, of which 50-100 are defects | Do we have enough positive examples AND enough negative examples? |
| Labeling | Clear quality classification (O.K./N.K. or measured value) | Is the quality result clearly defined - no ambiguous categories? |
| Depth of history | At least 3-6 months, better 12 months | Does the data set cover seasonal fluctuations, batch changes, tool changes? |
Typical data sources and their suitability
| Data source | Typical content | Suitability for predictive quality |
|---|---|---|
| PLC / control system | Cycle data, target/actual values, machine states | ✓ High - raw data with high resolution |
| MES | Production order, process parameters, time stamp, serial number | ✓ High - structured and linked |
| QMS | Test results, error coding, Cpk values | ✓ High - labels for model training |
| ERP | Batch, supplier, material, order | ○ Medium - context data, rarely real-time |
| Excel / manual | Various, often unstructured | ✗ Low - data quality mostly insufficient |
| Sensor technology (IIoT) | Vibration, temperature, current, acoustics | ✓ High - if integrated into database |
The ML workflow: from raw data to predictive model
Machine learning sounds complex - but the basic workflow is structured and comprehensible. The following six phases describe how a functioning predictive quality model is created from raw data.
PHASE 1 | DURATION: 1-2 weeks
Problem definition and target value
What exactly should the model predict?
Activities:
- Define target variable: Probability of error, quality class, specific characteristic (e.g. dimensional accuracy)?
- Binary classification (OK/NOK) or regression (predict measured value)?
- Business case: What is the value of a correct prediction? What is the cost of a false negative (unrecognized error)?
- Narrow down the scope: A product, a line, a quality feature as a pilot
Output: Documented problem definition with target value, scope and success criteria
PHASE 2 | DURATION: 2-4 weeks
Data collection and integration
What data do we need - and how do we get it together?
Activities:
- Identify data sources: MES, QMS, PLC, ERP, sensors
- Data extraction: SQL queries, API connection, CSV export
- Data linking: Establish common key (serial number)
- Create data set: One row = one unit, columns = features + target variable
Output: Integrated raw data set with process parameters and quality results per unit
PHASE 3 | DURATION: 2-3 weeks
Data cleansing and feature engineering
How do we make the data modelable?
Activities:
- Dealing with missing values: Remove, impute or as a separate category?
- Identify outliers: Plausibility check, ±3σ filter
- Feature engineering: Create derived features (e.g. temperature gradient, tool age category)
- Normalization: Scaling of features for model stability
- Train/test split: 70-80% training, 20-30% test - chronological, not random
Output: Cleaned, feature-engineered data set, split into training and test
PHASE 4 | DURATION: 1-2 weeks
Model selection and training
Which model fits - and how good is it?
Activities:
- Select model candidates: Logistic regression, random forest, gradient boosting, XGBoost
- Train baseline model: Simple model as reference
- Hyperparameter tuning: Grid Search or Random Search for optimal configuration
- Cross-validation: K-Fold (k=5 or 10) for robustness testing
Typical model selection according to data situation:
| Data situation | Recommended model | Justification |
|---|---|---|
| < 1,000 data points, few features | Logistic regression | Robust with little data |
| 1,000-10,000 data points, medium complexity | Random forest | Good balance of accuracy/interpretability |
| > 10,000 data points, many features | XGBoost, LightGBM | Highest accuracy with large data sets |
| Sequential data (time series) | LSTM, Transformer | Takes into account temporal dependencies |
Output: Trained model with documented hyperparameters
PHASE 5 | DURATION: 1-2 weeks
Model evaluation and interpretation
How good is the model - and why does it make which decisions?
Activities:
- Performance metrics on test data: Accuracy, Precision, Recall, F1-Score, AUC-ROC
- Analyze Confusion Matrix: How many false positives, how many false negatives?
- Threshold value optimization: At what probability is a part classified as a "risk"?
- Feature Importance: Which process parameters have the greatest influence?
- SHAP analysis: Why was this part classified as high-risk?
Critical metrics for predictive quality:
| Metric | Meaning | Target value (orientation) |
|---|---|---|
| Recall (Sensitivity) | Percentage of actual errors that were detected | > 85 % |
| Precision | Percentage of risk warnings that were actually errors | > 70 % |
| F1 score | Harmonic mean of precision and recall | > 0,75 |
| AUC-ROC | Total model discriminatory power | > 0,85 |
Output: Evaluated model with documented performance and interpretable results
PHASE 6 | DURATION: 2-4 weeks
Deployment and monitoring
How does the model get into production - and does it stay good?
Activities:
- Model deployment: integration into MES, edge device or cloud service
- Real-time inference: prediction for every production cycle
- Monitoring: Continuous monitoring of model performance (drift detection)
- Feedback loop: New test results → model retraining
- Threshold adjustment: Based on practical feedback
Output: Productive model with monitoring and retraining process
5 typical use cases in production
Predictive quality is not an end in itself - the benefits arise from specific use cases. The following five use cases show where predictive quality has the greatest leverage in practice.
USE CASE 01
Inline fault prediction with real-time warning
"This part has a 78% probability of dimensional deviation - check immediately."
Situation: Injection molding line, 12 process parameters per cycle. Dimensional deviation is only detected in the final inspection - 40 cycles later.
Predictive quality approach:
- Model trained on historical data: Process parameters → dimensional deviation yes/no
- Real-time prediction for each cycle
- For error probability > 60 %: Warning to operator + part marked for immediate inspection
Result:
| Before | After |
|---|---|
| 0.8 % scrap | 0.2 % rejects |
| Error detected after Ø 18 min. | Error detected after Ø 12 sec. |
USE CASE 02
Adaptive test planning (risk-based)
"Parts with low probability of error: random sample. Parts with high probability: 100%."
Situation: Assembly line, non-destructive final inspection possible, but time-consuming (45 sec./part). Current: 100 % inspection.
Predictive quality approach:
- Model predicts defect probability per part
- Inspection strategy: < 20 % → random sample (10 %). 20-60 % → normal inspection. > 60 % → extended inspection
- Inspection capacity is focused on risk parts
Result:
| Before | After |
|---|---|
| 100 % testing | 38 % Inspection |
| 0 Error slipped through | 0 Defects slipped through |
| Inspection costs 100 % | Inspection costs 41 % |
USE CASE 03
Replacement for destructive testing
"Instead of destroying the part, we predict the strength from the process parameters."
Situation: Weld seam strength test, destructive. Current: Sample 1 of 50, uncertainty for the other 49.
Predictive quality approach:
- Model trained on: Welding parameters (current, speed, gas) → Tensile strength (from destructive test)
- Regression model predicts strength for each part
- Destructive testing only for model validation and for predicted risk
Result:
| Before | After |
|---|---|
| 2 % destructively tested | 0.5 % destructively tested |
| 49 of 50 parts without verification | Each part with predicted strength |
| Ø 3 complaints/month | Ø 0.4 complaints/month |
USE CASE 04
Early warning of process drift
"The process is drifting - in 200 cycles Cpk will fall below 1.33."
Situation: Milling process, Cpk value drops slowly with tool wear. Current: Reaction only at Cpk < 1.33.
Predictive quality approach:
- Time series model on Cpk curve + tool age + process parameters
- Forecast: When will Cpk < 1.33 be reached?
- Early warning 2-4 hours before expected undershoot
Result:
| Before | After |
|---|---|
| Reaction with Cpk < 1.33 | Warning 3 hours before Cpk < 1.33 |
| Ø 120 parts in the limit range | Ø 8 parts in the limit range |
| Unplanned tool changes | Planned tool changes |
USE CASE 05
Root cause analysis through feature importance
"The three most important drivers for rejects are Coolant temperature, tool age, shift change break."
Situation: scrap rate fluctuates between 0.3 % and 1.2 % - with no recognizable pattern. 14 process parameters recorded.
Predictive quality approach:
- Classification model for all parameters → rejects yes/no
- Feature Importance and SHAP analysis: Which parameters drive scrap?
- Identification of the top 3 drivers for targeted measures
Result:
| Insight | Measure |
|---|---|
| Coolant temperature > 28°C correlates with 3× reject rate | Coolant temperature sensor + warning at > 26°C |
| Tool age > 8,000 cycles: significant increase | Tool change interval reduced to 7,500 |
| First 30 minutes after shift change: increased scrap rate | Start-up procedure with reference part introduced |
Predictive quality is not a technology project - it is a quality improvement project that uses technology. The value is not created by the model, but by the measures that follow from the findings.
- Amadeus Lederle, CTE, CSP Intelligence GmbH
Prerequisites: When is your company ready?
Predictive quality does not require a perfect database - but it does require a sufficient one. The following checklist shows whether your company meets the minimum requirements.
Readiness check for predictive quality
| Dimension | Minimum requirement | ✓ Ready | Not yet ✗ Not yet |
|---|---|---|---|
| Process parameters digital | At least 5 relevant parameters per unit recorded digitally | Parameters are saved automatically for each cycle | Parameters only recorded on paper or not at all |
| Quality results digitally | Test results (O.K./N.K. or measured values) in the system | QMS or MES with structured test reports | Inspection results in paper form or Excel islands |
| Linking | Common key (serial number) for parameters + quality | Serial number used throughout | No clear assignment possible |
| Depth of history | At least 3 months of data, better 6-12 months | Data has been stored for > 6 months | Data regularly deleted or available for < 3 months |
| Proportion of errors | At least 50 documented error cases in the data set | Rejects/NIOs are systematically recorded and categorized | Too few error cases or not categorized |
| Data quality | < 10 % missing values in key fields | Mandatory fields technically enforced, time stamp set by machine | High proportion of missing values, manual input without validation |
| Resources | At least 1 person with process knowledge + data affinity (20% FTE for 3 months) | Quality/process engineer with Excel/SQL skills available | No dedicated resource, only "on the side" |
If requirements are not met
If one or more dimensions show "Not yet", this is no reason to write off Predictive Quality - it shows where investments need to be made first.
| Gap | First step |
|---|---|
| Process parameters not digital | Set up PLC data export, check MES connection |
| No link | Introduce serial number as a mandatory field in MES and QMS |
| Too little historical data | Start today with systematic recording - the basis will be there in 3-6 months |
| Too few error cases | That's good! Extend scope if necessary (several lines, longer period) |
| Insufficient data quality | Define mandatory fields, check timestamp synchronization |
The 6-step implementation path for
predictive quality does not have to start as a major project.
The following implementation path is designed for a pragmatic proof of concept - 8-12 weeks, one quality feature, one product.
STEP 1 | DURATION: 1 week
Define pilot scope
Activities:
- Select one product, one line, one quality feature
- Criteria for pilot: enough data, measurable problem, dedicated process owner
- Define target value:
- Defect classification or measurement prediction?
- Define success criteria:
- What model performance justifies follow-up?
Output:
Documented pilot scope with target value and success criteria
STEP 2 | DURATION: 2 weeks
Extract and link data
Activities:
- Export process parameters from MES/PLC (6-12 months)
- Export quality results from QMS
- Establish link via serial number
- Validate data set:
- Completeness, consistency, plausibility
Output:
Linked analysis data set
STEP 3 | DURATION: 2 weeks
Exploratory data analysis
Activities:
- Descriptive statistics: Mean values, distributions, correlations
- Visualization: Time series, box plots, scatter plots
- Identify patterns:
- Are there recognizable correlations between parameters and quality?
- Prioritize feature candidates
Output:
Analysis report with identified patterns and feature candidates
STEP 4 | DURATION: 2 weeks
Train and evaluate model
Activities:
- Train/test split (temporal, not random)
- Baseline model: Logistic regression or random forest
- Performance metrics: Recall, Precision, F1, AUC-ROC
- Analyze feature importance
Output:
Trained model with documented performance
STEP 5 | DURATION: 2 weeks
Validation on the store floor
Activities:
- Apply model to new data (not used in training)
- Compare predictions with actual results
- Involve process experts:
- Are the identified drivers plausible?
- Analyze false positives and false negatives
Output:
Validated model with store floor feedback
STEP 6 | DURATION: 2 weeks
Decision and next steps
Activities:
- Evaluate results against success criteria
- Calculate business case:
- What savings are realistic with rollout
- ? Go/No-Go for productization
- Roadmap for rollout or further pilot
Output: Decision + roadmap
| 8-12 weeks | Typical proof of concept duration |
| 1 FTE × 30-40 % | Typical resource requirements (process engineer + data affinity) |
| Ø 67 % | PoCs leading to productization |
| < € 10,000 | Typical PoC costs (without license costs |
)
Limits and typical errors
Predictive quality is not a panacea. The method has clear limitations - and typical mistakes during implementation can be avoided if you are aware of them.
Limits of predictive quality
| Limit | Explanation | Consequence |
|---|---|---|
| Only known failure modes | ML models recognize patterns that occur in the training data - new, unknown failure modes are not predicted | Predictive quality supplements but does not replace systematic FMEA and process control |
| Correlation ≠ causality | A model finds statistical correlations, not physical causes | Feature Importance shows correlation - process understanding is necessary to confirm causality |
| Data quality as a bottleneck | Garbage in = garbage out. Incomplete or incorrectly linked data provides incorrect models | Data quality is a prerequisite, not a by-product |
| Model drift | Processes change (new material, tool change, seasonal effects) - the model is left behind | Continuous monitoring and regular retraining are mandatory |
| False positives/negatives | No model is perfect - there are always false alarms and overlooked errors | Threshold values must be tailored to the use case (more alarms or fewer?) |
Typical errors during implementation
| Error | Why it happens | How to avoid it |
|---|---|---|
| Scope too broad | "We want to predict all defects on all lines" | Start with one feature, one line, one product |
| Data integration underestimated | "The data is there" - but not linked | Allow 50% of the PoC time for data integration |
| Black box model without interpretation | Model says "error", but nobody understands why | Feature Importance and SHAP analysis right from the start |
| No store floor involvement | Data scientists build model, process experts are not consulted | Involve process engineers from day 1 |
| One-off training | Model is trained, then never touched again | Define monitoring + retraining cycle |
| Overengineering | "We need deep learning and real-time streaming" | Random forest on clean data beats LSTM on bad data |
PRACTICAL TIP
The data basis for predictive quality
Predictive quality rarely fails because of the algorithm - but because of a missing or unlinked database. IPM solves precisely this problem: all process parameters, test results and traceability data are stored in an integrated database, with the serial number as the continuous key.
For predictive quality, this means
- Linking out-of-the-box: process parameters + quality result + context data per serial number - no subsequent integration necessary
- Machine-set time stamp: Synchronized, ISO 8601, no manual input errors
- Mandatory fields technically enforced: No missing values for key parameters
- Unlimited history depth: All data is saved, not deleted after 90 days
- ML export-capable: CSV, Parquet or direct database query for Python/R
The result: Data integration - typically 50% of the effort of a predictive quality PoC - is eliminated. Instead of weeks for data collection: start analyzing immediately.
Frequently asked questions
Do we need a data science team for predictive quality?
Not to get started. An initial proof of concept can be carried out by a process engineer with basic Python knowledge - or with low-code ML platforms (DataRobot, Azure AutoML, H2O). The core competence is not ML, but process understanding: Which parameters are relevant? Which data is reliable? What do the results mean? External expertise can be useful for productization and scaling - but the PoC should be driven internally.
How many data points do I need as a minimum?
As a rule of thumb: at least 500-1,000 data points in total, including at least 50-100 error cases (positive class). This becomes difficult with highly unbalanced data (0.1 % rejects) - techniques such as oversampling (SMOTE) or adapted loss functions can help here. More important than the absolute number is the variability in the data: Does the data cover different batches, tools, shifts, seasons?
What is the difference between classification and regression?
- Classification: Model predicts a category (e.g. OK/NOT OK, defect type A/B/C)
- Regression: Model predicts a continuous value (e.g. strength in N/mm², dimensional deviation in mm)
Classification is more common for predictive quality - but regression can be more valuable if the specific measured value is important (e.g. replacement for destructive testing).
How do we explain the model decision?
Modern ML models are no longer black boxes. Two approaches:
- Feature Importance: Shows which parameters are the strongest drivers overall (e.g. "Temperature explains 34% of the variance")
- SHAP values: Show for each individual prediction which parameters have acted in which direction (e.g. "For this part: temperature was 3°C too high → +18 % probability of error")
Interpretability is crucial for acceptance on the store floor and for deriving improvement measures.
How often does the model need to be retrained?
That depends on the process stability. Typical intervals:
- Stable process, few changes: Every 3-6 months
- Frequent batch changes, tool changes: Monthly
- Continuous drift: Online learning or weekly retraining
Monitoring is more important than a fixed rhythm: if the model performance drops significantly on new data, retraining is necessary - regardless of the calendar.
What does the introduction of predictive quality cost?
The costs vary greatly depending on the scope and data situation. Orientation:
| Phase | Typical costs |
|---|---|
| PoC (one feature, 8-12 weeks) | 5,000-20,000 € (internal, without tools) |
| Productization (one use case) | 20,000-80,000 € (incl. integration) |
| Scaling (several lines/products) | 50,000-200,000 €/year (incl. platform) |
The ROI results from: reduced rejects, lower inspection effort, avoided complaints, earlier defect detection. Typical: 2-3× ROI in the first year of successful implementation.
Can we also use predictive quality for predictive maintenance?
Yes - the basic logic is identical. The difference:
- Predictive quality: Predicts product quality (rejects, dimensional deviations)
- Predictive maintenance: Predicts machine condition (failure, maintenance requirements)
Both use process parameters and historical patterns. The same data is often relevant - and a common data model can serve both use cases.
15 years of experience in industrial software architecture and system integration. Amadeus has supported numerous legacy migration projects in the manufacturing industry across Germany, Austria, and Switzerland—from the initial assessment to the controlled decommissioning of the last legacy system.
