Electronics in Advanced Research Industries. Alessandro Massaro
Чтение книги онлайн.

Читать онлайн книгу Electronics in Advanced Research Industries - Alessandro Massaro страница 27

СКАЧАТЬ 1.20). In particular, the function of the training dataset is to fit the model; the validation set is a small partition of the full dataset able to previously estimate prediction error of the selected model; finally, the test set is used for testing the final model. A correct choice of the three parts depends on the SNR of the full dataset.

Schematic illustration of supervised artificial network model: partitioning of the available dataset into training set, validation set, and test set. Schematic illustration of algorithm classification and Industry 5.0 facilities. Schematic illustration of (a) regression analysis, (b) data classification, and (c) data clustering.

      The ensemble approach is an alternative method for data classification. An ensemble is a set of classifiers that learn a target function. By combining different outputs of several classifiers, the risk of selecting a poorly performing classifier is reduced. The typical ensemble procedure is provided by the following pseudocode where T denotes the original training dataset, κ is the number of base classifiers, and B is the test data:

Schematic illustration of ensemble method and classification.

       F input features are randomly selected to split at each node (step 1 of creation of random vectors).

       A linear combination of the input features is created to split at each node (step 2 of using a random vector to build multiple DTs).

       A combination of DTs is created (step 3).

Schematic illustration of ensemble method and classification.

      The RFo classification technique is also applied in image processing detecting defect features. The logic of the DT algorithm is reported by the following pseudocode:

      Decision_Tree Function. 1. Compute Gain values for all attributes and select an attribute having the highest value creating a node for that attribute. 2. Make a branch from this node for every value of the attribute. 3. Assign all possible values of the attributes to branches.

      Follow each branch partitioning the dataset to be only instances whereby the value of the branch is present (or for similar values) and then go back to 1.

Schematic illustration of (a) LSTM unit cell. (b) LSTM network and its memory.