Statistics Toolbox

Machine Learning

Machine learning algorithms use computational methods to "learn" information directly from data without assuming a predetermined equation as a model. They can adaptively improve their performance as you increase the number of samples available for learning.

Machine Learning with MATLAB 3:02
Prepare data and train machine learning models with MATLAB®.

Classification

Classification algorithms enable you to model a categorical response variable as a function of one or more predictors. Statistics Toolbox offers a wide variety of parametric and nonparametric classification algorithms, such as:

An Introduction to Classification 9:00
Develop predictive models for classifying data.

You can evaluate goodness of fit for the resulting classification models using techniques such as:

Cluster Analysis

Statistics Toolbox offers multiple algorithms to analyze data using k-means, hierarchical clustering, Gaussian mixture models, or hidden Markov models. When the number of clusters is unknown, the toolbox offers cluster evaluation techniques to determine the number of clusters present in the data based on a specified metric.

Plot showing natural patterns in gene expression profiles obtained from baker’s yeast.
Plot showing natural patterns in gene expression profiles obtained from baker’s yeast. Principal component analysis (PCA) and k-means clustering algorithms are used to find clusters in the profile data.

Cluster Genes Using K-Means and Self-Organizing Maps (Example)
Learn how to detect patterns in gene expression profiles by examining gene expression data

Two-component Gaussian mixture model fit to a mixture of bivariate Gaussians.
Two-component Gaussian mixture model fit to a mixture of bivariate Gaussians.
Output from applying a clustering algorithm to the same example.
Output from applying a clustering algorithm to the same example.
Dendrogram plot showing a model with four clusters.
Dendrogram plot showing a model with four clusters.

Cluster Analysis (Example)
Use k-means and hierarchical clustering to discover natural groupings in data.

Regression

Regression algorithms enable you to model a continuous response variable as a function of one or more predictors. Statistics Toolbox offers a wide variety of parametric and nonparametric classification algorithms, such as:

Computational Statistics: Feature Selection, Regularization, and Shrinkage with MATLAB 36:51
In this webinar, you will learn how to use Statistics Toolbox to generate accurate predictive models from data sets that contain large numbers of correlated variables.

Next: Multivariate Statistics

Try Statistics Toolbox

Get trial software

Multilevel Mixed-Effects Modeling Using MATLAB

View webinar

Apply Machine Learning Techniques to Classify Data

Learn how