# Algorithms grouped by purpose

Explore the following categories to find the appropriate algorithms for processing your data.

Use clustering algorithms to assign each data point to a group (cluster) based on the similarities with other data points. For some algorithms, you need to specify the number of clusters to be found as a parameter. Other algorithms detect the number of clusters themselves.

The following algorithms are available:

- DCPY.AFFINITYPROPCLUST(max_iter, convergence_iter, damping, preference, columns)
- DCPY.AGGLOCLUST(n_clusters, affinity, linkage, columns)
- DCPY.AUTOCLUST(max_clusters, columns)
- DCPY.BIRCHCLUST(n_clusters, threshold, branching_factor, columns)
- DCPY.DBSCANCLUST(min_samples, eps, columns)
- DCPY.KMEANSCLUST(n_clusters, random_state, init, n_init, max_iter, columns)
- DCPY.MEANSHIFTCLUST(cluster_all, bandwidth, columns)
- DCPY.MINIBATCHKMEANSCLUST(n_clusters, random_state, init, n_init, batch_size, max_no_improvement, max_iter, columns)
- DCPY.SPECTRALCLUST(n_clusters, random_state, n_init, columns)
- MLLIB.BICLUSTER(imputer, n_clusters, seed, columns)
- MLLIB.CLUSTER(imputer, n_clusters, n_iter, columns)
- MLLIB.GMMCLUSTER(imputer, n_clusters, columns)

Use correlation algorithms to decide whether there is any correlation between the data from two columns. The resulted correlation coefficient can have values between -1 and 1. A value closer to 1 indicates a strong positive linear correlation. A value around 0 indicates no correlation. A value close to -1 indicates a strong negative linear correlation.

The following algorithms are available:

Use the algorithms for outliers detection to identify the data points that deviate from the overall pattern of your data points. The algorithm assigns a value to the outlier data points and another value to the inlier data points. For example, the DCPY.LOFOUTLIER algorithm assigns the value '-1' to outliers and the value '1' to inliers.

The following algorithms are available:

Use regression algorithms to predict the values of the dependent variable from independent variables.

The MLLIB.LINEARREG(fit_intercept, standardize, columns) algorithm is available.

Use scaling algorithms to reduce the data values to a specific range using a mathematical transformation. For example, the DCPY.MAXABSSCALER algorithm reduces the absolute data values to the [0, 1] range by dividing each value by the largest absolute value in the column.

The following algorithms are available:

Use transformation algorithms to apply various transformations to your data, such as shrinking the distances between marginal outliers and inliers (DCPY.QUANTILETRANSFORM) or imputing missing values in a column based on a specific strategy (DCPY.IMPUTER).

The following algorithms are available:

The SCRIPT function is used to run your custom algorithm based on the available AI connection. For details, see Add script calculations.