Anomaly Detection in Python
Detect anomalies in your data analysis and expand your Python statistical toolkit in this four-hour course.
Course Description
Extreme values or anomalies are present in almost any dataset, and it is critical to detect and deal with them before continuing statistical exploration. When left untouched, anomalies can easily disrupt your analyses and skew the performance of machine learning models.
In this course, you’ll leverage Python to implement a variety of anomaly detection methods. You’ll spot extreme values visually and use tested statistical techniques like Median Absolute Deviation for univariate datasets. For multivariate data, you’ll learn to use estimators such as Isolation Forest, k-Nearest-Neighbors, and Local Outlier Factor. You’ll also learn how to ensemble multiple outlier classifiers into a low-risk final estimator. You’ll walk away with an essential data science tool in your belt: anomaly detection with Python.
Better anomaly detection means better understanding of your data, and particularly, better root cause analysis and communication around system behaviour. Adding this skill to your existing Python repertoire will help you with data cleaning, fraud detection, and identifying system disturbances.
What You’ll Learn
Detecting Univariate Outliers
This chapter covers techniques to detect outliers in 1-dimensional data using histograms, scatterplots, box plots, z-scores, and modified z-scores.
Distance and Density-based Algorithms
After a tree-based outlier classifier, you will explore a class of distance and density-based detectors. KNN and Local Outlier Factor classifiers have been proven highly effective in this area, and you will learn how to use them.
Isolation Forests with PyOD
In this chapter, you’ll learn the ins and outs of how the Isolation Forest algorithm works. Explore how Isolation Trees are built, the essential parameters of PyOD’s IForest and how to tune them, and how to interpret the output of IForest using outlier probability scores.
Time Series Anomaly Detection and Outlier Ensembles
In this chapter, you’ll learn how to perform anomaly detection on time series datasets and make your predictions more stable and trustworthy using outlier ensembles.
User Reviews
Be the first to review “Anomaly Detection in Python”
You must be logged in to post a review.
There are no reviews yet.