Models

AI Models are based on model configurations and trained on datasets. They are used by the AI Server to detect anomalies in live data.

To access and create model configs, navigate to AI > Model.

createModelConfigList
AI: Model config list

Creating model configs

To create a model config, click the blue plus button. This opens a form:

createModelConfig
AI: Create model config
  • Name: Name of the model config.
  • Dataset config: Dataset config to access a dataset.
  • Dataset: Dataset of the provided dataset config.
  • AI Trainer: Microservice to use as AI Trainer.
  • Architecture: One of two available architectures: LSTM Autoencoder or GRU Autoencoder.
**Architecture**: To learn more about the LSTM and GRU Autoencoder, consult **TensorFlow**.

Click Create to continue.

Configuring model configs

To access the config of a model, click an entry in the list. A model config consists of several options:

General

createModelConfigGeneral
Model config: General
  • Name: Name of the model config.
  • Description: Description of the model config.
  • Dataset: The dataset on which the model is trained. Click to select another dataset available in the selected dataset config.

Architecture

modelConfigArchitecture
Model config: Architecture
  • Architecture: Selected during creation. Can be changed here along with the latent space size factor.
  • Latent space size factor: Multiplied with the number of features to determine the size of the latent space.
  • Model depth: Number of layers of the encoder and decoder.
  • Model width: Multiplied with the number of features to determine the base number of neurons in an encoder and decoder.
  • Model slope: Factor between 0 and 1 that determines how steep the encoder and decoder layers approach the latent space.
  • Activation: Select between ReLU or Sigmoid .
  • Use dropout: Only used during training. When enabled, a layer is inserted that randomly deactivates connections between neurons to prevent overfitting .

Training

Configuring training options
Model config: Training
  • Epochs: Number of epochs to train.
  • Loss: Function to evaluate model performance.
  • Optimizer: Optimization algorithm: SGD (Stochastic Gradient Descent) or Adam .
  • Retrain on data update: Toggle to retrain the model when a new dataset is extracted.
  • AI Trainer: Currently selected AI Trainer.
**Loss**: DataForge supports several functions to determine the loss. For details, consult **TensorFlow**.

Preprocessing

Options to adjust the hyper-parameters of the model.

modelConfigPreprocessing
Model config: Preprocessing
  • Scaler: Currently only the MIN_MAX algorithm is supported.
  • Use outlier clipping: Toggle to exclude extreme values.
  • Outlier threshold: Threshold for clipping values. Smaller values mean stronger clipping and should not fall below 1. Recommended value: 3.
  • Dampening mode: One of three modes: Use training boundaries (No Dampening), Extended boundaries (Dampening), Always Clip (Infinite Dampening).
  • Dampening factor: Available only if dampening mode is set to Extended boundaries (Dampening).
  • Window size: Number of time steps considered for one model step.
  • Time step: Interval (in seconds) between rotations of the sliding window. Must be less or equal to the smallest interval of an item in the dataset to prevent loss of data.
  • Batch size: Determines the number of batches based on batch size, time step, and dataset duration.

Storage options

Storage options organize models by retention period. Define how long a model is kept in days and/or hours. If older than the defined time, it is deleted.

configStorage
Configure dataset: Storage options
The default setting `Days: 0, Hours: 0` keeps datasets indefinitely.

Models

Generate and view models in this section.

models
Model config: Models

Create model

Once configuration is complete, click Train now to create a model. The model appears in the list and can be inspected by clicking it.

Model details

The status and name of the model are shown at the top of the page. A model can have two states: Training completed and Training incomplete.

As long as the model is being trained, only limited information is visible. Once the model has the status Training completed, additional information is displayed.

Model configuration

This section shows all configurations used to train the model. Essentially, a summary of the dataset and model config at training time.

Model tests

Once a model is fully trained, it can be tested immediately on previously extracted datasets to evaluate the loss.

modelsTest
Model details: Model test

Click Create new test to open a dialog:

modelsCreateTest
Model details: Create test
  • Name: Name of the model test.
  • Description: Description of the model test.
  • Dataset: Dataset for the model test.

Click Create to finish the process.

After the test is created, DataForge applies it to the specified dataset. Results can be viewed in the test details, accessible by clicking the test.

Model test details

This page shows the status, name of the test, and three sections: General, Evaluation loss, and Job log. The status can be Test completed or Currently training.

General

Displays the dataset used for testing.

modelTestGeneral
Model test details: General
Training loss

Overview of how accurately the model processes data. For the supported loss functions, smaller values indicate higher accuracy.

modelTestEvaLoss
Model test details: Evaluation loss

The loss function can be selected in the top right to see its effect on evaluation:

modelTestEvaLossFunc
Model test details: Evaluation loss function select
Job log

A completion bar at the top shows test progress. The job log lists each step carried out to test the model. Steps can be expanded for details.

modelTestJobLog
Model test details: Job log
  • Processing time: Duration of the step.
  • Date and time: When the step was processed.
  • Type: Event type.
  • Event: Event name.
  • Microservice: Responsible microservice.
  • Full event: Full event stack trace.

Training loss

Shows model performance during training relative to loss. The x-axis indicates the number of batches trained.

Training loss visualization
Model details: Training loss

Hovering over a datapoint provides additional details.

The graph supports zoom, reset zoom, and save as image. A bar below the graph also provides data zoom with start and end values.

Job log

A completion bar at the top shows training progress. The job log lists each step carried out to create the model. Steps can be expanded for details.

modelsJobLog
Model details: Job log
  • Processing time: Duration of the step.
  • Date and time: When the step was processed.
  • Type: Event type.
  • Event: Event name.
  • Microservice: Responsible microservice.
  • Full event: Full event stack trace.

Delete a model

Delete a model by clicking the bin icon.