Data Pre-processing and Feature Engineering for Machine Learning

September 29, 2021 | 8:50 am
Soledad Galli, Lead Data Scientist, Train in Data
Share

Data in its raw format is almost never suitable for use to train Machine Learning Models. In fact, Data Scientist devote a big part of their time to clean and pre-process data. Feature engineering refers to the various processes and techniques that we can use to pre-process variables for use in machine learning modelling. Feature engineering includes transformations like filling missing values, encoding categorical variables, transforming variables mathematically, and creating new variables from existing ones, just to name a few. There are multiple feature engineering techniques that we can use to extract maximum value from features. When should we use each technique, and why? What are their advantages, assumptions and limitations? Are they suitable for every algorithm? In this video, I will discuss various feature engineering techniques, and compare their implementation in open source Python libraries.

Let’s
chat

Leave us a message using our contact form and we’ll get back to you straight away.

If you’re eager to get started, give us a call now on 01908 465 570

Thanks

for reaching out, 🙏

A member of our team will be in touch shortly to arrange our chat.

Mmm 🍪cookies!

We use cookies to make your experience on this website better, and we have a variety to choose from. Use the toggles below to customise your selection or click 'Save my cookies' to get straight to the content.