6.3 Using Machine Learning Models for Scoring and Prediction
The predictive model is an algorithm that you apply to streaming data to predict outcomes. In a pipeline, you use a predictive model in a scoring stage to do probability scoring.
In GoldenGate Stream Analytics, a predictive model is an ONNX file, that you upload and store in the system:
-
GGSA supports ONNX models with single-dimensional outputs of size 1. The output should be of
int
,float
,double
orBoolean
datatype.
6.3.1 Importing a Predictive Model
To import a predictive model:
- On the Catalog page, click Create New Item, and select Predictive Model from the drop-down list.
- On the Type Properties screen, enter the following details and click Next :
- Name
- Description
- Tags
- Predictive Model Type
- On the Predictive Model Details, enter the following details and click Save:
- For Predictive Model URL, upload your ONNX file.
- In the Model Version field, enter the version of this artifact. For example,
1.0
. - (Optional) In the Version Description, enter a meaningful description for your model.
- In the Algorithm field, accept the default. The algorithm is derived from the model you have uploaded.
- (Optional) In the Tool drop-down list, select the tool with which you created your model.
6.3.2 Adding a Scoring Stage
To add a scoring stage:
- Open the required pipeline in Pipeline Editor.
- Right-click the stage after which you want to add a scoring stage, click Add a Stage, and then select Scoring.
- Enter a meaningful name and suitable description for the scoring stage and click Save.
- In the stage editor, enter the following details:
- Model name: Select the predictive model that you want to use in the scoring stage
- Model Version: Select the version of the predictive model
- Mapping: Select the corresponding model fields that appropriately map to the stage fields
You can add multiple scoring stages based on your use case.