Feature Engineering for Machine Learning

Feature engineering with Python

Find out what you will learn throughout the course (if the video does not show, try allowing cookies in your browser).

What you'll learn

👉 Multiple methods for missing data imputation.

👉 Strategies to transform categorical variables into numbers.

👉 How to handle infrequent categories.

👉 Variance stabilizing transformations.

👉 Multiple discretization techniques.

👉 How and when to handle outliers.

👉 How to create features from dates and times.

👉 How to create features from small text data.

👉 Apply transformations with Python open source libraries.

What you'll get

Website Counter

00

Hours on-demand video

00

Jupyter Notebooks

00

Quizzes and Assignments

Lifetime access

Instructor support

Certificate of completion

💬 English subtitles

Instructor

Soledad Galli, PhD

Sole is a lead data scientist, instructor, and developer of open source software. She created and maintains the Python library Feature-engine, which allows us to impute data, encode categorical variables, transform, create, and select features. Sole is also the author of the"Python Feature Engineering Cookbook," published by Packt.

Can't afford it? Get in touch.

30 days money back guarantee

If you're disappointed for whatever reason, you'll get a full refund.

So you can buy with confidence.

Feature Engineering Course

Welcome to the most comprehensive course on feature engineering for machine learning available online.

In this course, you will learn everything you need to preprocess your datasets to train machine learning models like linear regression, logistic regression, decision trees, random forests and gradient boosting machines.

What is feature engineering?

Feature engineering consists in using domain knowledge and statistical methods to create features that make machine learning algorithms work effectively.

Raw data is almost never suitable to train machine learning models. In fact, data scientists devote a lot of effort to data analysis, data engineering and preprocessing, and feature extraction, to create the best features to train predictive models.

Feature engineering includes imputation of missing data, encoding of categorical variables, transformation or discretization of continuous variables, combination of variables, extraction of dates and times, and much more.

What will you learn in this online course?

In this course, you will learn about missing data imputation, encoding of categorical features, numerical variable transformation, discretization, and how to create new features from your dataset.

Specifically, you will learn:

How to impute missing values
How to encode categorical features
How to transform and scale numerical variables
How to perform discretization
How to remove outliers
How to perform feature extraction from dates and time
How to create new features from existing ones

And there is more...

You probably saw a lot of courses on other learning platforms like Coursera or Udemy. In fact, this is the full version of the Udemy course. Why is this course special?

While most online courses will teach you the very basics of feature engineering, like imputing variables with the mean or transforming categorical features using one hot encoding, this course will teach you all of that, and much more.

Here, you will first learn the most popular techniques for variable engineering, like mean and median imputation, one-hot encoding, transformation with logarithm, and discretization. Then, you will discover more advanced methods that capture information while encoding or transforming your variables, to obtain better features and improve the performance of regression and classification models.

Advanced methods for feature engineering

These methods include encoding variables with the target mean, using decision trees for discretization or variable combination, combining features mathematically, and automating the procedure of feature creation with a bespoke library: Feature-engine.

You will learn methods described in scientific articles, used in data science competitions like those hosted by Kaggle and the KDD, and that are commonly utilized in organizations. And what’s more, you will easily implement all of these methods by utilizing Python's open-source libraries, like pandas, Scikit-learn, category encoders and Feature-engine.

By the end of the course, you will be able to create end-to-end machine learning workflows that fully transform your datasets and obtain predictions from them.

Feature engineering with Python

Throughout the course, we will use Python as the main language. We will compare the feature engineering implementations of the open-source libraries Pandas, Scikit-learn, Category Encoders and Feature-engine.

Throughout the tutorials, you’ll find detailed explanations of each technique and a discussion about their advantages, limitations, and underlying assumptions, followed by the best programming practices to implement them in Python.

By the end of the course, you will be able to decide which feature engineering technique you need based on the variable characteristics and the models you wish to train. And you will also be well placed to test various transformation methods and let your models decide which ones work best.

Finally, you will be able to create end-to-end machine learning pipelines, packed with feature engineering preprocessing steps that you can easily deploy to production.

Who is this course for?

This course is for data scientists, machine learning engineers and software engineers who want to improve their skills and advance their careers.

Course prerequisites

To make the most out of this course, learners need to have basic knowledge of machine learning, data analytics, and familiarity with the most common predictive models, like linear and logistic regression, decision trees, and random forests.

The instructor

Sole will be guiding you through this course. She’s been selected as Linkedin’s top voice in data science and analytics in 2018 and then again in 2024. She is the author of Packt’s Python Feature Engineering Cookbook and Leanpub’s Feature Selection in Machine Learning book. She is also the maintainer of Feature-engine, an open source Python library for feature engineering and feature selection. With all this experience, who would be better place to teach a course about feature engineering for machine learning? Not that many ;)

To wrap-up

This comprehensive feature engineering course contains over 100 lectures spread across 15 hours of in-demand video, more than 10 quizzes and assessments, demonstrations using real-world use cases, and all topics include hands-on Python code examples in Jupyter notebooks that you can use for reference, practice, and reuse in your own projects.

The course comes with a 30-day money-back guarantee, so you can sign up today with no risk.

So what are you waiting for? Enroll today and join the world's most comprehensive course on feature engineering for machine learning, and start creating better machine learning models.

Course Curriculum

Welcome

Available in days

days after you enroll

Course material

Available in days

days after you enroll

Variable types

Available in days

days after you enroll

Variable characteristics

Available in days

days after you enroll

Missing data imputation - Basic

Available in days

days after you enroll

Missing data imputation - Alternative methods

Available in days

days after you enroll

Multivariate imputation

Available in days

days after you enroll

Categorical encoding - Basic methods

Available in days

days after you enroll

Categorical encoding - monotonic

Available in days

days after you enroll

Categorical encoding - Rare labels

Available in days

days after you enroll

Variable transformation

Available in days

days after you enroll

Discretization - Basic methods

Available in days

days after you enroll

Discretization - Alternative methods

Available in days

days after you enroll

Outliers

Available in days

days after you enroll

Datetime variables

Available in days

days after you enroll

Engineering mixed variables

Available in days

days after you enroll

Feature creation

Available in days

days after you enroll

Feature scaling

Available in days

days after you enroll

Assembling feature engineering pipelines

Available in days

days after you enroll

Congratulations! You did it!

Available in days

days after you enroll

Frequently Asked Questions

When does the course begin and end?

You can start taking the course from the moment you enroll. The course is self-paced, so you can watch the tutorials and apply what you learn whenever you find it most convenient.

For how long can I access the course?

The course has lifetime access. This means that once you enroll, you will have unlimited access to the course for as long as you like.

What if I don't like the course?

There is a 30-day money back guarantee. If you don't find the course useful, contact us within the first 30 days of purchase and you will get a full refund.

Will I get a certificate?

Yes, you'll get a certificate of completion after completing all lectures, quizzes and assignments.

Check out our other courses!

Feature Selection for Machine Learning

Learn filter, wrapper, and embedded methods, recursive feature elimination, exhaustive search, feature shuffling & more.

Soledad Galli

$33.99

Machine Learning Interpretability

Explain interpretable and black box models with LIME, Shap, partial dependency plots and more.

Soledad Galli

$37.99

Forecasting with Machine Learning

Forecast single and multiple time series with machine learning models. Implement backtesting to evaluate models before deployment.

Kishan Manani

$43.99

This site uses cookies

Feature Engineering for Machine Learning

Feature engineering with Python

What you'll learn

👉 Multiple methods for missing data imputation.

👉 Strategies to transform categorical variables into numbers.

👉 How to handle infrequent categories.

👉 Variance stabilizing transformations.

👉 Multiple discretization techniques.

👉 How and when to handle outliers.

👉 How to create features from dates and times.

👉 How to create features from small text data.

👉 Apply transformations with Python open source libraries.

What you'll get

💬 English subtitles

Instructor

Soledad Galli, PhD

Pricing

Full access to the course by paying a one-time amount.

Lifetime access

Can't afford it? Get in touch.

30 days money back guarantee

Feature Engineering Course

What is feature engineering?

What will you learn in this online course?

And there is more...

Advanced methods for feature engineering

Feature engineering with Python

Who is this course for?

Course prerequisites

The instructor

To wrap-up

Course Curriculum

Frequently Asked Questions

When does the course begin and end?

For how long can I access the course?

What if I don't like the course?

Will I get a certificate?

Check out our other courses!

Feature Selection for Machine Learning

Learn filter, wrapper, and embedded methods, recursive feature elimination, exhaustive search, feature shuffling &amp; more.

Machine Learning Interpretability

Explain interpretable and black box models with LIME, Shap, partial dependency plots and more.

Forecasting with Machine Learning

Forecast single and multiple time series with machine learning models. Implement backtesting to evaluate models before deployment.

Ready to take your skills to the next level?

Follow us 👇

Learn filter, wrapper, and embedded methods, recursive feature elimination, exhaustive search, feature shuffling & more.