03.machine learning

  • Machine Learning is a method of data analysis that automates analytical model building.
  • Using algorithms that iteratively learn from data, machine learning allows computers to find hidden insights without being explicitly programmed where to look.

ML

Supervised Learning

Spark's MLlib is mainly designed for supervised and unsupervised learning tasks, with most of its alogorithms falling under those two categories.

Supervised learning is commonly used in applications where historical data predicts likely future events.

Unsupervised Learning

Unsupervised learning is used against data that has no historical labels.
The goal is to explore the data and find some structure within.

One issue is that it can be difficult to evaluate the results of an unsupervised model.