What is predictive quality? Quality prediction in production

Written by Amadeus Lederle | 1.6.2026

In traditional manufacturing, quality testing comes at the end. The part is produced, inspected - and then it becomes clear whether it fits or not. If not: rejects, rework, complaint. The reaction to a quality problem is always retrospective: something has gone wrong, now we analyze why.

Predictive quality reverses this logic. Instead of checking whether a part is good after production, it is predicted during production whether a part will be good - based on process parameters, machine data and historical samples. Quality inspection shifts from final inspection to process control.

That sounds like a dream of the future. But it is not. The underlying technologies - machine learning, process data integration, real-time monitoring - are mature. The limiting factor is not the technology, but the database: if you want to introduce predictive quality, you don't need AI specialists, you first need complete, linked process data.

This article explains what predictive quality actually means, what data you need, how typical ML workflows work and where the method is already delivering measurable results in practice.

THE MOST IMPORTANT FACTS IN BRIEF

Predictive quality forecasts the quality of a part during production - not afterwards. It is based on process parameters, machine data and historical quality results, which are condensed into a probability of error using ML models.

The critical success factor is not the ML model, but the database. Predictive quality requires complete, time-synchronized data: Process parameters per serial number, quality characteristic per serial number, time stamp machine-set. Without this link, there is no basis for training.

Typical use cases are inline defect prediction, adaptive inspection planning and early warning of process drift. The greatest effects arise in the case of high inspection costs (destructive testing, laboratory analyses) or high scrap costs (expensive components, late defect detection).

Getting started is easier than expected: an initial proof of concept with one characteristic, 3-6 months of historical data and a random forest model delivers initial results in 2-4 weeks - without a data science team.

IN A NUTSHELL

Predictive quality is not a replacement for quality inspections - it is an advance of them. The aim is not to abolish inspections, but to detect errors before they occur.

The key question with predictive quality is: Which process parameter constellation is most likely to lead to a quality problem? The answer is in your data - if it is linked.

ML models are only as good as their training data. A perfect algorithm on bad data delivers bad predictions. Data quality is the bottleneck, not model complexity.

CONTENT OF THIS ARTICLE

What predictive quality actually means
Differentiation: Predictive quality vs. SPC vs. inline testing
What data predictive quality needs
The ML workflow: from raw data to predictive model
5 typical use cases in production
Prerequisites: When will your company be ready?
The 6-step implementation path
Limitations and typical errors
Predictive quality-capable database
Frequently asked questions

What predictive quality actually means - and what it doesn't mean

Predictive quality is a data-driven approach that predicts the likely quality of a product before or during production - based on process parameters, machine conditions and historical patterns.

	Traditional quality inspection	Predictive quality
Point in time	After production (final inspection)	During production (inline prediction)
Question	Is this part good?	Will this part be good?
Data basis	Inspection result of the individual part	Process parameters + historical patterns
Reaction time	Reactive: Defect has already occurred	Proactive: Intervention before defect occurs
Inspection scope	100 % or random sample - independent of risk	Risk-based: more testing with high error probability

The three core elements of predictive quality

1. process parameters as quality indicators

Every quality characteristic - dimensional accuracy, strength, surface quality - is influenced by process parameters: temperature, pressure, speed, feed rate, cycle time, tool condition. Predictive Quality identifies which parameter combinations have historically led to quality problems.

2. machine learning for pattern recognition

ML models recognize complex, non-linear relationships between process parameters and quality results - relationships that are not visible to humans. A model learns from historical data: "If temperature > X and pressure < Y and tool age > Z, then probability of error = 73%."

3. inline integration for real-time forecasting

The trained model is integrated into the production line. At each production step, the current process parameters are recorded, transferred to the model and a probability of error is calculated - in real time, while the part is still in the line.

What predictive quality is not

No substitute for quality testing: Predictive quality predicts probabilities, not certainties. Inspections remain necessary - but they can be used in a more targeted manner.
Not a black box: good predictive quality systems show why a part is classified as risky (feature importance). This is crucial for process improvement.
No guarantee of accuracy: A model with 95% accuracy means that 5% of predictions are incorrect. Predictive quality reduces errors, but does not eliminate them.

72 %	Reduction of undetected errors through inline prediction
Ø 34 %	Reduction in inspection effort through risk-based inspection planning
2,3×	ROI in the first year with successful implementation
68 %	Predictive quality projects fail due to data quality, not ML

Differentiation: Predictive quality vs. SPC vs. inline testing

Predictive quality is not the only approach to quality assurance during production. The differentiation from other methods clarifies when which approach makes sense.

	SPC (Statistical Process Control)	Inline testing	Predictive quality
Basic principle	Statistical monitoring of process parameters on control charts	100% inspection of every part during production	ML-based forecast of the probability of defects
Data basis	Process parameter samples	Inspection result of each part	Process parameters + historical quality results
Detection logic	Rule-based: Intervention limits, trend rules	Measurement: Actual value vs. tolerance	Pattern-based: Parameter constellations
What is recognized?	Process drift, outliers	Defective parts (after creation)	Risk of error (before occurrence)
Typical use	Process monitoring, early warning	Quality assurance, reject sorting	Adaptive inspection planning, process control

When to use which approach?

SPC is useful when:

Processes are to be kept stable
Reaction to drift is sufficient
Few, clearly defined process parameters are monitored

Inline testing is useful if:

100 % proof of quality is required
Testing is possible quickly and cost-effectively
defects can be reliably measured

Predictive quality makes sense when:

Many process parameters interact
Testing is expensive or destructive
Defects only become visible at a late stage
Data is already available but unused

The approaches are not mutually exclusive - they complement each other. Predictive quality can build on SPC data and control inline testing based on risk.

What data predictive quality needs

The most common reason for failed predictive quality projects is not an incorrect ML model, but missing or unlinked data. Before talking about algorithms, the database must be checked.

The three data categories for predictive quality

Data category	Examples	Source	Critical requirement
Process parameters	Temperature, pressure, speed, feed rate, cycle time, tool age	PLC, MES, sensors	Available per serial number or cycle
Quality data	Test result (OK/NOT OK), measured value, error code, Cpk	QMS, test bench, laboratory	Linked per serial number
Context data	Shift, operator, batch, tool ID, environmental conditions	MES, ERP, manual	Consistently recorded, can be referenced

The critical link: serial number as key

Predictive quality learns from the question: "Which process parameters produced which quality result?"

This question can only be answered if the process parameters and quality result can be assigned to the same unit. This requires a continuous key - typically the serial number or a combination of batch + cycle number.

Problem in practice: In many companies, process parameters are stored in the MES, inspection results in the QMS and batch information in the ERP - without a common key. The data exists, but it cannot be linked.

Data quality checklist for predictive quality

dimension	Minimum requirement	Check question
Link	Process parameter + quality characteristic per unit	Can I say for each serial number: these parameters → this result?
Completeness	< 5 % missing values for key parameters	How many data points have NULL values for temperature, pressure, etc.?
Time stamp	Machine set, ISO 8601, synchronized	Are process timestamp and inspection timestamp consistent?
Sample size	At least 500-1,000 units, of which 50-100 are defects	Do we have enough positive examples AND enough negative examples?
Labeling	Clear quality classification (O.K./N.K. or measured value)	Is the quality result clearly defined - no ambiguous categories?
Depth of history	At least 3-6 months, better 12 months	Does the data set cover seasonal fluctuations, batch changes, tool changes?

Typical data sources and their suitability

Data source	Typical content	Suitability for predictive quality
PLC / control system	Cycle data, target/actual values, machine states	✓ High - raw data with high resolution
MES	Production order, process parameters, time stamp, serial number	✓ High - structured and linked
QMS	Test results, error coding, Cpk values	✓ High - labels for model training
ERP	Batch, supplier, material, order	○ Medium - context data, rarely real-time
Excel / manual	Various, often unstructured	✗ Low - data quality mostly insufficient
Sensor technology (IIoT)	Vibration, temperature, current, acoustics	✓ High - if integrated into database

The ML workflow: from raw data to predictive model

Machine learning sounds complex - but the basic workflow is structured and comprehensible. The following six phases describe how a functioning predictive quality model is created from raw data.

PHASE 1 | DURATION: 1-2 weeks

Problem definition and target value

What exactly should the model predict?

Activities:

Define target variable: Probability of error, quality class, specific characteristic (e.g. dimensional accuracy)?
Binary classification (OK/NOK) or regression (predict measured value)?
Business case: What is the value of a correct prediction? What is the cost of a false negative (unrecognized error)?
Narrow down the scope: A product, a line, a quality feature as a pilot

Output: Documented problem definition with target value, scope and success criteria

PHASE 2 | DURATION: 2-4 weeks

Data collection and integration

What data do we need - and how do we get it together?

Activities:

Identify data sources: MES, QMS, PLC, ERP, sensors
Data extraction: SQL queries, API connection, CSV export
Data linking: Establish common key (serial number)
Create data set: One row = one unit, columns = features + target variable

Output: Integrated raw data set with process parameters and quality results per unit

PHASE 3 | DURATION: 2-3 weeks

Data cleansing and feature engineering

How do we make the data modelable?

Activities:

Dealing with missing values: Remove, impute or as a separate category?
Identify outliers: Plausibility check, ±3σ filter
Feature engineering: Create derived features (e.g. temperature gradient, tool age category)
Normalization: Scaling of features for model stability
Train/test split: 70-80% training, 20-30% test - chronological, not random

Output: Cleaned, feature-engineered data set, split into training and test

PHASE 4 | DURATION: 1-2 weeks

Model selection and training

Which model fits - and how good is it?

Activities:

Select model candidates: Logistic regression, random forest, gradient boosting, XGBoost
Train baseline model: Simple model as reference
Hyperparameter tuning: Grid Search or Random Search for optimal configuration
Cross-validation: K-Fold (k=5 or 10) for robustness testing

Typical model selection according to data situation:

Data situation	Recommended model	Justification
< 1,000 data points, few features	Logistic regression	Robust with little data
1,000-10,000 data points, medium complexity	Random forest	Good balance of accuracy/interpretability
> 10,000 data points, many features	XGBoost, LightGBM	Highest accuracy with large data sets
Sequential data (time series)	LSTM, Transformer	Takes into account temporal dependencies

Output: Trained model with documented hyperparameters

PHASE 5 | DURATION: 1-2 weeks

Model evaluation and interpretation

How good is the model - and why does it make which decisions?

Activities:

Performance metrics on test data: Accuracy, Precision, Recall, F1-Score, AUC-ROC
Analyze Confusion Matrix: How many false positives, how many false negatives?
Threshold value optimization: At what probability is a part classified as a "risk"?
Feature Importance: Which process parameters have the greatest influence?
SHAP analysis: Why was this part classified as high-risk?

Critical metrics for predictive quality:

Metric	Meaning	Target value (orientation)
Recall (Sensitivity)	Percentage of actual errors that were detected	> 85 %
Precision	Percentage of risk warnings that were actually errors	> 70 %
F1 score	Harmonic mean of precision and recall	> 0,75
AUC-ROC	Total model discriminatory power	> 0,85

Output: Evaluated model with documented performance and interpretable results

PHASE 6 | DURATION: 2-4 weeks

Deployment and monitoring

How does the model get into production - and does it stay good?

Activities:

Model deployment: integration into MES, edge device or cloud service
Real-time inference: prediction for every production cycle
Monitoring: Continuous monitoring of model performance (drift detection)
Feedback loop: New test results → model retraining
Threshold adjustment: Based on practical feedback

Output: Productive model with monitoring and retraining process

5 typical use cases in production

Predictive quality is not an end in itself - the benefits arise from specific use cases. The following five use cases show where predictive quality has the greatest leverage in practice.

USE CASE 01

Inline fault prediction with real-time warning

"This part has a 78% probability of dimensional deviation - check immediately."

Situation: Injection molding line, 12 process parameters per cycle. Dimensional deviation is only detected in the final inspection - 40 cycles later.

Predictive quality approach:

Model trained on historical data: Process parameters → dimensional deviation yes/no
Real-time prediction for each cycle
For error probability > 60 %: Warning to operator + part marked for immediate inspection

Result:

Before	After
0.8 % scrap	0.2 % rejects
Error detected after Ø 18 min.	Error detected after Ø 12 sec.

USE CASE 02

Adaptive test planning (risk-based)

"Parts with low probability of error: random sample. Parts with high probability: 100%."

Situation: Assembly line, non-destructive final inspection possible, but time-consuming (45 sec./part). Current: 100 % inspection.

Predictive quality approach:

Model predicts defect probability per part
Inspection strategy: < 20 % → random sample (10 %). 20-60 % → normal inspection. > 60 % → extended inspection
Inspection capacity is focused on risk parts

Result:

Before	After
100 % testing	38 % Inspection
0 Error slipped through	0 Defects slipped through
Inspection costs 100 %	Inspection costs 41 %

USE CASE 03

Replacement for destructive testing

"Instead of destroying the part, we predict the strength from the process parameters."

Situation: Weld seam strength test, destructive. Current: Sample 1 of 50, uncertainty for the other 49.

Predictive quality approach:

Model trained on: Welding parameters (current, speed, gas) → Tensile strength (from destructive test)
Regression model predicts strength for each part
Destructive testing only for model validation and for predicted risk

Result:

Before	After
2 % destructively tested	0.5 % destructively tested
49 of 50 parts without verification	Each part with predicted strength
Ø 3 complaints/month	Ø 0.4 complaints/month

USE CASE 04

Early warning of process drift

"The process is drifting - in 200 cycles Cpk will fall below 1.33."

Situation: Milling process, Cpk value drops slowly with tool wear. Current: Reaction only at Cpk < 1.33.

Predictive quality approach:

Time series model on Cpk curve + tool age + process parameters
Forecast: When will Cpk < 1.33 be reached?
Early warning 2-4 hours before expected undershoot

Result:

Before	After
Reaction with Cpk < 1.33	Warning 3 hours before Cpk < 1.33
Ø 120 parts in the limit range	Ø 8 parts in the limit range
Unplanned tool changes	Planned tool changes

USE CASE 05

Root cause analysis through feature importance

"The three most important drivers for rejects are Coolant temperature, tool age, shift change break."

Situation: scrap rate fluctuates between 0.3 % and 1.2 % - with no recognizable pattern. 14 process parameters recorded.

Predictive quality approach:

Classification model for all parameters → rejects yes/no
Feature Importance and SHAP analysis: Which parameters drive scrap?
Identification of the top 3 drivers for targeted measures

Result:

Insight	Measure
Coolant temperature > 28°C correlates with 3× reject rate	Coolant temperature sensor + warning at > 26°C
Tool age > 8,000 cycles: significant increase	Tool change interval reduced to 7,500
First 30 minutes after shift change: increased scrap rate	Start-up procedure with reference part introduced

Predictive quality is not a technology project - it is a quality improvement project that uses technology. The value is not created by the model, but by the measures that follow from the findings.

- Amadeus Lederle, CTE, CSP Intelligence GmbH

Prerequisites: When is your company ready?

Predictive quality does not require a perfect database - but it does require a sufficient one. The following checklist shows whether your company meets the minimum requirements.

Readiness check for predictive quality

Dimension	Minimum requirement	✓ Ready	Not yet ✗ Not yet
Process parameters digital	At least 5 relevant parameters per unit recorded digitally	Parameters are saved automatically for each cycle	Parameters only recorded on paper or not at all
Quality results digitally	Test results (O.K./N.K. or measured values) in the system	QMS or MES with structured test reports	Inspection results in paper form or Excel islands
Linking	Common key (serial number) for parameters + quality	Serial number used throughout	No clear assignment possible
Depth of history	At least 3 months of data, better 6-12 months	Data has been stored for > 6 months	Data regularly deleted or available for < 3 months
Proportion of errors	At least 50 documented error cases in the data set	Rejects/NIOs are systematically recorded and categorized	Too few error cases or not categorized
Data quality	< 10 % missing values in key fields	Mandatory fields technically enforced, time stamp set by machine	High proportion of missing values, manual input without validation
Resources	At least 1 person with process knowledge + data affinity (20% FTE for 3 months)	Quality/process engineer with Excel/SQL skills available	No dedicated resource, only "on the side"

If requirements are not met

If one or more dimensions show "Not yet", this is no reason to write off Predictive Quality - it shows where investments need to be made first.

Gap	First step
Process parameters not digital	Set up PLC data export, check MES connection
No link	Introduce serial number as a mandatory field in MES and QMS
Too little historical data	Start today with systematic recording - the basis will be there in 3-6 months
Too few error cases	That's good! Extend scope if necessary (several lines, longer period)
Insufficient data quality	Define mandatory fields, check timestamp synchronization

The 6-step implementation path for

predictive quality does not have to start as a major project.

The following implementation path is designed for a pragmatic proof of concept - 8-12 weeks, one quality feature, one product.

STEP 1 | DURATION: 1 week

Define pilot scope

Activities:

Select one product, one line, one quality feature
Criteria for pilot: enough data, measurable problem, dedicated process owner
Define target value:
Defect classification or measurement prediction?
Define success criteria:

What model performance justifies follow-up?

Output:

Documented pilot scope with target value and success criteria

STEP 2 | DURATION: 2 weeks

Extract and link data

Activities:

Export process parameters from MES/PLC (6-12 months)
Export quality results from QMS
Establish link via serial number
Validate data set:

Completeness, consistency, plausibility

Output:

Linked analysis data set

STEP 3 | DURATION: 2 weeks

Exploratory data analysis

Activities:

Descriptive statistics: Mean values, distributions, correlations
Visualization: Time series, box plots, scatter plots
Identify patterns:

Are there recognizable correlations between parameters and quality?
Prioritize feature candidates

Output:

Analysis report with identified patterns and feature candidates

STEP 4 | DURATION: 2 weeks

Train and evaluate model

Activities:

Train/test split (temporal, not random)
Baseline model: Logistic regression or random forest
Performance metrics: Recall, Precision, F1, AUC-ROC
Analyze feature importance

Output:

Trained model with documented performance

STEP 5 | DURATION: 2 weeks

Validation on the store floor

Activities:

Apply model to new data (not used in training)
Compare predictions with actual results
Involve process experts:

Are the identified drivers plausible?
Analyze false positives and false negatives

Output:

Validated model with store floor feedback

STEP 6 | DURATION: 2 weeks

Decision and next steps

Activities:

Evaluate results against success criteria
Calculate business case:

What savings are realistic with rollout
? Go/No-Go for productization
Roadmap for rollout or further pilot

Output: Decision + roadmap

8-12 weeks	Typical proof of concept duration
1 FTE × 30-40 %	Typical resource requirements (process engineer + data affinity)
Ø 67 %	PoCs leading to productization
< € 10,000	Typical PoC costs (without license costs

)

Limits and typical errors

Predictive quality is not a panacea. The method has clear limitations - and typical mistakes during implementation can be avoided if you are aware of them.

Limits of predictive quality

Limit	Explanation	Consequence
Only known failure modes	ML models recognize patterns that occur in the training data - new, unknown failure modes are not predicted	Predictive quality supplements but does not replace systematic FMEA and process control
Correlation ≠ causality	A model finds statistical correlations, not physical causes	Feature Importance shows correlation - process understanding is necessary to confirm causality
Data quality as a bottleneck	Garbage in = garbage out. Incomplete or incorrectly linked data provides incorrect models	Data quality is a prerequisite, not a by-product
Model drift	Processes change (new material, tool change, seasonal effects) - the model is left behind	Continuous monitoring and regular retraining are mandatory
False positives/negatives	No model is perfect - there are always false alarms and overlooked errors	Threshold values must be tailored to the use case (more alarms or fewer?)

Typical errors during implementation

Error	Why it happens	How to avoid it
Scope too broad	"We want to predict all defects on all lines"	Start with one feature, one line, one product
Data integration underestimated	"The data is there" - but not linked	Allow 50% of the PoC time for data integration
Black box model without interpretation	Model says "error", but nobody understands why	Feature Importance and SHAP analysis right from the start
No store floor involvement	Data scientists build model, process experts are not consulted	Involve process engineers from day 1
One-off training	Model is trained, then never touched again	Define monitoring + retraining cycle
Overengineering	"We need deep learning and real-time streaming"	Random forest on clean data beats LSTM on bad data

PRACTICAL TIP

The data basis for predictive quality

Predictive quality rarely fails because of the algorithm - but because of a missing or unlinked database. IPM solves precisely this problem: all process parameters, test results and traceability data are stored in an integrated database, with the serial number as the continuous key.

For predictive quality, this means

Linking out-of-the-box: process parameters + quality result + context data per serial number - no subsequent integration necessary
Machine-set time stamp: Synchronized, ISO 8601, no manual input errors
Mandatory fields technically enforced: No missing values for key parameters
Unlimited history depth: All data is saved, not deleted after 90 days
ML export-capable: CSV, Parquet or direct database query for Python/R

The result: Data integration - typically 50% of the effort of a predictive quality PoC - is eliminated. Instead of weeks for data collection: start analyzing immediately.

→ Arrange a demo

Frequently asked questions

Do we need a data science team for predictive quality?

Not to get started. An initial proof of concept can be carried out by a process engineer with basic Python knowledge - or with low-code ML platforms (DataRobot, Azure AutoML, H2O). The core competence is not ML, but process understanding: Which parameters are relevant? Which data is reliable? What do the results mean? External expertise can be useful for productization and scaling - but the PoC should be driven internally.

How many data points do I need as a minimum?

As a rule of thumb: at least 500-1,000 data points in total, including at least 50-100 error cases (positive class). This becomes difficult with highly unbalanced data (0.1 % rejects) - techniques such as oversampling (SMOTE) or adapted loss functions can help here. More important than the absolute number is the variability in the data: Does the data cover different batches, tools, shifts, seasons?

What is the difference between classification and regression?

Classification: Model predicts a category (e.g. OK/NOT OK, defect type A/B/C)
Regression: Model predicts a continuous value (e.g. strength in N/mm², dimensional deviation in mm)

Classification is more common for predictive quality - but regression can be more valuable if the specific measured value is important (e.g. replacement for destructive testing).

How do we explain the model decision?

Modern ML models are no longer black boxes. Two approaches:

Feature Importance: Shows which parameters are the strongest drivers overall (e.g. "Temperature explains 34% of the variance")
SHAP values: Show for each individual prediction which parameters have acted in which direction (e.g. "For this part: temperature was 3°C too high → +18 % probability of error")

Interpretability is crucial for acceptance on the store floor and for deriving improvement measures.

How often does the model need to be retrained?

That depends on the process stability. Typical intervals:

Stable process, few changes: Every 3-6 months
Frequent batch changes, tool changes: Monthly
Continuous drift: Online learning or weekly retraining

Monitoring is more important than a fixed rhythm: if the model performance drops significantly on new data, retraining is necessary - regardless of the calendar.

What does the introduction of predictive quality cost?

The costs vary greatly depending on the scope and data situation. Orientation:

Phase	Typical costs
PoC (one feature, 8-12 weeks)	5,000-20,000 € (internal, without tools)
Productization (one use case)	20,000-80,000 € (incl. integration)
Scaling (several lines/products)	50,000-200,000 €/year (incl. platform)

The ROI results from: reduced rejects, lower inspection effort, avoided complaints, earlier defect detection. Typical: 2-3× ROI in the first year of successful implementation.

Can we also use predictive quality for predictive maintenance?

Yes - the basic logic is identical. The difference:

Predictive quality: Predicts product quality (rejects, dimensional deviations)
Predictive maintenance: Predicts machine condition (failure, maintenance requirements)

Both use process parameters and historical patterns. The same data is often relevant - and a common data model can serve both use cases.

View full post