Cours
Winsorized Mean: A Robust Approach to Handling Outliers
A winsorized mean reduces the influence of outliers by capping extreme values at specific percentiles, preserving the overall structure of the dataset. Read further to learn how to calculate the winsorized mean using Python for hands-on practice.
10 sept. 2024 · 7 min de lecture
Become a ML Scientist
Master Python skills to become a machine learning scientist
What is a winsorized mean?
When should I use the winsorized mean over the standard mean?
How does winsorized mean differ from the Trimmed Mean?
What are the advantages of using the winsorized mean?
What are other winsorized statistical measures besides the mean?
Sujets
Learn with DataCamp
4 hr
110.1K
Cours
Introduction to Statistics in Python
4 hr
145.1K
Cours
Introduction to Statistics
4 hr
103.3K
Voir plus
ApparentéVoir plusVoir plus
podcast
Robust Data Science with Statistical Modeling
Robustify your data science with statistical modeling, whether you work in tech, epidemiology, finance or anything else.
Hugo Bowne-Anderson
57 min
Didacticiel
Scikit-Learn Tutorial: Baseball Analytics Pt 1
A scikit-learn tutorial to predicting MLB wins per season by modeling data to KMeans clustering model and linear regression models.
Daniel Poston
14 min
Didacticiel
Moving Averages in pandas
Learn how you can capture trends and make sense out of time series data with the help of a moving or rolling average.
Aditya Sharma
8 min
Didacticiel
Isolation Forest Guide: Explanation and Python Implementation
Isolation Forest is an unsupervised machine learning algorithm that identifies anomalies or outliers in data by isolating them through a process of random partitioning within a collection of decision trees.
Conor O'Sullivan
9 min
Didacticiel
Probability Distributions in Python Tutorial
In this tutorial, you'll learn about and how to code in Python the probability distributions commonly referenced in machine learning literature.
DataCamp Team
15 min
Didacticiel
Demystifying Crucial Statistics in Python
Learn about the basic statistics required for Data Science and Machine Learning in Python.
Sayak Paul
15 min