molprop.utils.data_transform.remove_outliers

molprop.utils.data_transform.remove_outliers(df: DataFrame, columns: list[str], std_devs: int = 1)

Remove outliers from a dataframe based on the number of standard deviations.

Parameters:
  • df – pd.DataFrame Dataframe to remove outliers from.

  • columns – List[str] List of columns to remove outliers from.

  • std_devs – int, optional Number of standard deviations to use as a threshold, by default 1

Returns:

pd.DataFrame

Dataframe with outliers removed.