09.natural language processing

Natural language processing is a field of machine learning that focuses on creating models from a text data source (straight from articles of words).

Examples of NLP

  • Clustering News Articles
  • Suggesting similar books
  • Grouping legal documents
  • Analyzing consumer feedback
  • Spam Email Detection

Basic process of any NLP task:

  • Compile all documents
  • Featurize the words to numerics
  • Compare features of documents

A standard way of doing this is through the use of what is known as 'TF-IDF' methods.
'TF-IDF' stands for term frequency inverse documented frequency.

nlp-1

nlp-2

nlp-3

TF-IDF