Feature importance and model interpretation in Python

A practical course about feature importance and model interpretation using Python programming language and sklearn

In this practical course, we are going to focus on feature importance and model interpretation in supervised machine learning using Python programming language.

What you’ll learn

  • How to calculate feature importance according to several models.
  • How to use SHAP technique to calculate feature importance of every model.
  • Recursive Feature Elimination.
  • How to apply RFE with and without cross-validation.

Course Content

  • Introduction –> 2 lectures • 9min.
  • Feature importance and model interpretation –> 4 lectures • 1hr 13min.
  • Recursive Feature Elimination –> 2 lectures • 24min.

Feature importance and model interpretation in Python

Requirements

  • Python programming language.

In this practical course, we are going to focus on feature importance and model interpretation in supervised machine learning using Python programming language.

Feature importance makes us better understand the information behind data and allows us to reduce the dimensionality of our problem considering only the relevant information, discarding all the useless variables. A common dimensionality reduction technique based on feature importance is the Recursive Feature Elimination.

Model interpretation helps us to correctly analyze and interpret the results of a model. A common approach for calculating model interpretation is the SHAP technique.

With this course, you are going to learn:

  1. How to calculate feature importance according to a model
  2. SHAP technique for calculating feature importance according to every model
  3. Recursive Feature Elimination for dimensionality reduction, with and without the use of cross-validation

All the lessons of this course start with a brief introduction and end with a practical example in Python programming language and its powerful scikit-learn library. The environment that will be used is Jupyter, which is a standard in the data science industry. All the Jupyter notebooks are downloadable.

This course is part of my Supervised Machine Learning in Python online course, so you’ll find some lessons that are already included in the larger course.