Create and edit the models¶
Insights Hub Quality Prediction Train Model Video
This chapter describes the steps to create a prediction model,
To create a Predictive model, follow these steps:
-
From the left navigation, click "Models". Prediction Models overview page is displayed.
-
Click
to add a new prediction model.
-
In "Select model information", perform the following actions:
- Enter the "Model name" and the "Description of model version".
- Select and configure the asset and its corresponding aspect with the process data used for model training and click “Next”.
-
In "Select input features", perform the following actions:
- Select the time frame to obtain the IoT timeseries process data for training.
- Click
and select the identifier from the identifier selection window.
- Check the box to use timestamp as an identifier.
- Click
and select the features from the feature selection window.
- Enable the toggle to include the additional features "Time Based", "Count" and "Gap count".
- Click "Apply".
- The data chart is displayed according to the selected input features on the right-side panel as shown in the graphic and click “Next”.
- Select the time frame to obtain the IoT timeseries process data for training.
-
In "Filtering features", perform the following actions:
- Click
and add the process variables(features) from the IoT timeseries model from the feature selection window and Click "Select feature".
- Enter the filter values to select data corresponding to the productive phases of the process (for example select relevant values of the process status or machine programs).
- Check the box and enter the outlier value to remove outlier values during manufacturing phase.
- Click "Apply" and click "Next".
- Click
-
In "Aggregation", perform the following actions:
- Check the "Apply aggregation" check box to enable aggregation if multiple data points are collected during the production phase.
- Select one or more combination of calculation methods (Minimum, Mean, Maximum and Standard Deviation) to formulate the aggregated variables of the manufacturing phase for each part number.
- Select "Remove transients" check box and enter the value 1, if the first and the last data point of the production phase have to be excluded from the calculations.
Info
- "Remove transients" is selected if there are any artificial outliers in the ramp up and ramp down transient phases due to the delays in the data transfer from the physical machine.
- "Aggregation" is not selected if the process data is already aggregated or only single datapoint data is collected during the manufacturing phase.
- Check the "Apply aggregation" check box to enable aggregation if multiple data points are collected during the production phase.
-
In "Select quality data", perform the following actions:
- Select the Quality Data file in csv format to train the model from Integrated Data Lake and review the file structure.
- In "File schema configuration", select the feature type for the features (quality variables).
- Click "Apply" and click “Next”.
Info
The quality data file should already be uploaded to the selected folder in the Integrated Data Lake by the user. The file should contain defined measurement parameter (prediction target) representing product quality and required variables to be linked to the process parameters for building a process model. For information on feature and feature types, refer to Prerequisites.
- Select the Quality Data file in csv format to train the model from Integrated Data Lake and review the file structure.
-
In “Select quality variables”, perform the following actions:
- Select the process parameters for the quality variables. For information on quality variables, refer to Prerequisites.
- Click "Apply" and click “Next”.
-
In “Configure model”, select the model type and parameters in the user dialog window as shown below. The following are the regression model types available to be selected to train the model:
- Linear Regression
- Decision Tree
- Random Forest
- Gradient Boosting
- XGBoost
- Neural Network
If “Optimized Parameters” is not selected, the user can enter the values of ML model parameters. Default model type and optimized parameters are recommended.
Click “View Training results” to start a model training. The model status changes to “In Progress” and the user is navigated to the "Training Board".