Category Archives: Pandas
Detect and Remove Outliers from Pandas DataFrame
Z-score re-scale and center(Normalize) the data and look for data points which are too far from zero(center). Data points far from zero will be treated as the outliers. In most of the cases, a threshold of 3 or -3 is used i.e if the Z-score value is greater than or less than 3 or -3 respectively, that data point will be identified as outliers.