### What is Classification?

Classification is a form of supervised learning in which the objective is to classify observations into a discrete number of categories based on their input features.

- Machine Learning 101 (30)
- Statistics 101 (38)
- Supervised Learning (114)
- Regression (42)
- Classification (46)
- Logistic Regression (10)
- Support Vector Machine (10)
- Naive Bayes (4)
- Discriminant Analysis (5)
- Classification Evaluations (9)

- Classification & Regression Trees (CART) (23)

- Unsupervised Learning (55)
- Clustering (28)
- Distance Measures (9)
- Dimensionality Reduction (9)

- Deep Learning (23)
- Data Preparation (34)
- General (5)
- Standardization (6)
- Missing data (7)
- Textual Data (16)

Classification is a form of supervised learning in which the objective is to classify observations into a discrete number of categories based on their input features.

Most of the concepts of binary classification transfer over to the situation of an outcome with more than two levels, which is referred to as multi-class classification.

Logistic regression is the most traditional classification algorithm and preserves many of the advantages in interpretation as linear regression for a continuous outcome.

If the overall distribution of the outcome is heavily tilted towards one class compared to the other, the classification problem is considered to be imbalanced.

One of the decisions that often has to be made in a classification problem is determining what threshold to use in order to classify observations into appropriate class designations.

Both discriminative and generative models have the ability to learn model parameters in a classification setting, but they are based on entirely different mechanisms and serve different purposes.

Logistic Regression is a case of Generalized Linear Model used in situations where the response variable is a binary outcome.

Logistic regression uses a logistic loss function, where the cost for a single observation is represented by:

SVM is a classification algorithm that seeks to determine a decision boundary by maximizing the distance between points of different classes.

Naive Bayes uses the framework of Bayes Theorem and the assumption of conditional independence between all pairs of predictors

Find out all the ways

that you can