githubEdit

fishStage IV: Anomaly Detection & Handling

Outlier detection cases.

Base Framework

  • Statistical Methods

  • Machine Learning based methods e.g. OLS and random forest

  • Time-Series specific (only applicable for continuous data types)

During EDA, statistical methods are generally preferable because they are faster, interpretable, and more aligned with exploratory decision-making. ML methods are better reserved for confirmatory or production contexts where their complexity pays off.

Last updated

Was this helpful?