### What is IDF? What do we need IDF?

Inverse Document Frequency builds upon Term Frequency by inversely weighting words that appear frequently across all of the documents.

- Machine Learning 101 (30)
- Statistics 101 (38)
- Supervised Learning (108)
- Regression (36)
- Classification (46)
- Logistic Regression (10)
- Support Vector Machine (10)
- Naive Bayes (4)
- Discriminant Analysis (5)
- Classification Evaluations (9)

- Classification & Regression Trees (CART) (23)

- Unsupervised Learning (46)
- Clustering (17)

- Regularization (6)
- Deep Learning (23)
- Data Preparation (43)
- General (5)
- Standardization (6)
- Missing data (7)
- Textual Data (16)
- Dimensionality Reduction (9)

A Term Frequency matrix consists of the IDs for the documents in the corpus for the rows

Vector space models are a family of models that represent data as vectors within space.

Tokenization is the process of separating text within documents into its smallest building blocks.

A corpus of text is the entire set of documents considered.

Find out all the ways

that you can