Needles & Haystacks: Machine Learning with Imbalanced Datasets

December 20, 2022 | 8:00 am
Matt How, Adatis
Share

Please complete the form to watch the webinar recording

    Thanks

    You can watch the video below. Let us know if you learn something new or useful by tweeting us @DataIdols.

    The unique skill of a Machine Learning model is to identify specific instances that humans would easily miss. That needle in a haystack can make or break your business. But how can you train a machine learning model when the data points you need to identify are extremely rare? Fortunately, this problem is not new and there are several strategies that can be employed to tackle this common scenario. From synthetic data generation techniques, algorithms, and metric selection to over and under-sampling, this session will cover a variety of approaches that can be implemented through common libraries and toolsets.

    You will leave with an understanding of how to identify imbalanced classification problems and a myriad of resolution approaches to experiment with. This session would suit those with a basic knowledge of data science although all the basics will be covered in brief.

    Mmm 🍪cookies!

    We use cookies to make your experience on this website better, and we have a variety to choose from. Use the toggles below to customise your selection or click 'Save my cookies' to get straight to the content.