Supervised machine learning describes the recognition of connections in data sets. In contrast to unsupervised machine learning, both the input and the output are already available as a data set. The algorithms learn (“train”) in this data the connection between input (features) and output (e.g., label).
The algorithm trained in this way can be applied to new input data to predict a result based on the learned relationships. Accordingly, these algorithms serve the purpose of carrying out regressions or classifying data. They are often used in forecasts in particular.
A particular challenge with this form of technology is generating a sufficiently large data set with input and output. In many cases, this is still associated with manual work.
Many use cases of AI discussed today fall into supervised learning – often combined with (complex) artificial neural networks (deep learning). A property of these algorithms is that their training with large amounts of data is time-consuming, but in many cases, a trained model can calculate an output for new input much faster than classic methods.
This leads, among other things, to the simulation of physical effects, e.g., B. from fluids (see non-Newtonian fluids like honey or the interaction of water with solids ), light rays, or other material properties to significant progress in near real-time. Accordingly, the potential of this application area is also very large in the energy industry.
The ANN is a simple algorithm that can be implemented quickly to assign data points to exist clusters/groups. An (e.g., Euclidean) distance to all data points in the training data set is calculated, and the respective k-nearest neighbors (with the smallest distance) are selected.
The labels of the selected k-nearest neighbors serve as the basis for the classification of the data point to be classified. The mean of the k-nearest neighbors is used in regressions and the mode in classifications.
Decision trees are ordered, directed graphs used to visualize decision rules. In the training phase, decision rules are formed by the algorithm from the features of a data set. This is achieved by breaking the features into subsets using “questions.” These “questions” (mathematically, these are simple (in)equations) aim to achieve the clearest possible separation of labels into the respective subsets. Their order is determined by the information gained. This is high when, for example, the entropy or variance of the subsets is as low as possible, i.e., they are as homogeneous as possible.
Artificial neural networks aim to replicate the neural structure of biological life. For this purpose, neurons are arranged in layers. Various layers are connected, transporting information between the neurons from the input layer via the hidden layer to the output layer. The connections between the neurons are provided with so-called “weights” and “biases,” which in turn are offset against the respective information. A neuron only passes on information to the downstream layer if the incoming information exceeds a certain threshold value (specified by an activation function). As part of the training, these “weights” and “biases” are set iteratively between the layers so that the desired output is generated. Of all the methods presented so far, ANN has the highest complexity but can, therefore, cope with very complex challenges.
The advantage of these algorithms is that the quality of the learning process results can be checked using the already-known outputs. The disadvantage is that neural networks, in particular, are a “black box,” which makes traceability difficult. In principle, however, supervised machine learning algorithms are very versatile, and neural networks, in particular, have great potential in many application areas.
The success of a company also depends on the quality of customer experiences. However, many…
Whether it's Amazon, Apple, Google, or Microsoft, each big tech giant wants to claim the…
Companies are currently implementing various sustainability measures. However, internal IT is rarely considered. The new…
AI can help companies save valuable resources by uncovering optimization potential. Using self-learning algorithms, it…
More and more companies in the finance sector are facing considerable challenges with cloud transformation.…
The number of cyber attacks on companies is increasing alarmingly. Every company is affected, and…