Data Processing and Feature Engineering for Machine Learning

September 29, 2021 | 8:50 am
Soledad Galli, Lead Data Scientist, Train in Data

Please complete the form to watch the webinar recording


    You can watch the video below. Let us know if you learn something new or useful by tweeting us @DataIdols.

    Data in its raw format is almost never suitable for use to train Machine Learning Models. In fact, Data Scientist devote a big part of their time to clean and pre-process data. Feature engineering refers to the various processes and techniques that we can use to pre-process variables for use in machine learning modelling. Feature engineering includes transformations like filling missing values, encoding categorical variables, transforming variables mathematically, and creating new variables from existing ones, just to name a few. There are multiple feature engineering techniques that we can use to extract maximum value from features. When should we use each technique, and why? What are their advantages, assumptions and limitations? Are they suitable for every algorithm? In this video, I will discuss various feature engineering techniques, and compare their implementation in open source Python libraries.

    Mmm 🍪cookies!

    We use cookies to make your experience on this website better, and we have a variety to choose from. Use the toggles below to customise your selection or click 'Save my cookies' to get straight to the content.